DOI

The paper is devoted to processing parallel and comparable corpora by means of topic modelling. We focus our attention on Russian and English parallel and comparable texts. We use Latent Dirichlet Allocation (LDA) algorithm for building topic models of fiction texts, evaluation of compatibility for the original text and its translation(s), selection of possible translation equivalents.

Original languageEnglish
Title of host publicationProceedings of the International Conference on Internet and Modern Society, IMS 2017
EditorsIrina I. Tolstikova, Nikolai V. Borisov, Victor P. Zakharov, Nikolai V. Borisov, Leonid V. Smorgunov, Radomir V. Bolgov
PublisherAssociation for Computing Machinery
Pages175-180
Number of pages6
ISBN (Electronic)9781450354370
DOIs
StatePublished - 21 Jun 2017
Event2017 International Conference on Internet and Modern Society, IMS 2017: международная объединенная конференция - Университет ИТМО, Санкт-Петербург, Russian Federation
Duration: 21 Jun 201723 Jun 2017
Conference number: XX
http://icims.ifmo.ru/
http://ims.ifmo.ru/ru/pages/28/IMS_2017.htm

Publication series

NameACM International Conference Proceeding Series

Conference

Conference2017 International Conference on Internet and Modern Society, IMS 2017
Abbreviated titleIMS 2017
Country/TerritoryRussian Federation
CityСанкт-Петербург
Period21/06/1723/06/17
Internet address

    Scopus subject areas

  • Human-Computer Interaction
  • Computer Networks and Communications
  • Computer Vision and Pattern Recognition
  • Software

    Research areas

  • Comparable Texts, English, Fiction, Parallel, Russian, Text Corpora, Topic Modelling

ID: 41188336