DOI

In the paper vector-space semantic models based on Word2Vec word embeddings algorithm and a count-based association-oriented algorithm are evaluated and compared by measuring association strength between Russian nouns and adjectives. A dataset of nouns and associated adjectives is used as the test set for pseudodisambiguation task. Models are trained with corpora of Russian fiction. A measure of lexical association anomaly is applied evaluating similarity between the initial noun and the resulting attributive phrase. Results of association strength are reported for models characterized by different parameter values; the best parameter value combinations are proposed. The test exemplars producing the error rate are manually annotated, and the model errors are categorized in terms of their linguistic nature and compositionality features.

Язык оригиналаанглийский
Название основной публикацииAnalysis of Images, Social Networks and Texts - 5th International Conference, AIST 2016, Revised Selected Papers
РедакторыNatalia Loukachevitch, Alexander Panchenko, Konstantin Vorontsov, Valeri G. Labunets, Andrey V. Savchenko, Dmitry I. Ignatov, Sergey I. Nikolenko, Mikhail Yu. Khachay
ИздательSpringer Nature
Страницы236-247
Число страниц12
ISBN (печатное издание)9783319529196
DOI
СостояниеОпубликовано - 1 янв 2017
Событие5th International Conference on Analysis of Images, Social Networks and Texts, AIST 2016 - Yekaterinburg, Российская Федерация
Продолжительность: 7 апр 20169 апр 2016

Серия публикаций

НазваниеCommunications in Computer and Information Science
Том661
ISSN (печатное издание)1865-0929

конференция

конференция5th International Conference on Analysis of Images, Social Networks and Texts, AIST 2016
Страна/TерриторияРоссийская Федерация
ГородYekaterinburg
Период7/04/169/04/16

    Предметные области Scopus

  • Компьютерные науки (все)
  • Математика (все)

ID: 47480880