Recent advances in deep leaming for natural language processing achieve and improve over state of the art results in many natural language processing tasks. One problem with neural network models, however, is that they require large datasets, including large labeled datasets for the corresponding problems. In this work, we suggest a dala augmentation method based on extending a given dataset with synonyms for the words appearing there. We apply this approach to the morphologically rich Russian language and show improvements for modem neural network NLP models on standard tasks such as sentiment analysis.

Original languageEnglish
Title of host publicationProceedings of the AINL FRUCT 2016 Conference
EditorsAndrey Filchenkov, Jan Zizka, Lidia Pivovarova, Sergey Balandin
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9789526839783
StatePublished - 3 Apr 2017
Event5th Artificial Intelligence and Natural Language FRUCT Conference, AINL FRUCT 2016 - Saint-Petersburg, Russian Federation
Duration: 10 Nov 201612 Nov 2016

Publication series

NameProceedings of the AINL FRUCT 2016 Conference

Conference

Conference5th Artificial Intelligence and Natural Language FRUCT Conference, AINL FRUCT 2016
Country/TerritoryRussian Federation
CitySaint-Petersburg
Period10/11/1612/11/16

    Scopus subject areas

  • Artificial Intelligence

ID: 95167988