Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › Research › peer-review
The paper is devoted to processing parallel and comparable corpora by means of topic modelling. We focus our attention on Russian and English parallel and comparable texts. We use Latent Dirichlet Allocation (LDA) algorithm for building topic models of fiction texts, evaluation of compatibility for the original text and its translation(s), selection of possible translation equivalents.
Original language | English |
---|---|
Title of host publication | Proceedings of the International Conference on Internet and Modern Society, IMS 2017 |
Editors | Irina I. Tolstikova, Nikolai V. Borisov, Victor P. Zakharov, Nikolai V. Borisov, Leonid V. Smorgunov, Radomir V. Bolgov |
Publisher | Association for Computing Machinery |
Pages | 175-180 |
Number of pages | 6 |
ISBN (Electronic) | 9781450354370 |
DOIs | |
State | Published - 21 Jun 2017 |
Event | 2017 International Conference on Internet and Modern Society, IMS 2017: международная объединенная конференция - Университет ИТМО, Санкт-Петербург, Russian Federation Duration: 21 Jun 2017 → 23 Jun 2017 Conference number: XX http://icims.ifmo.ru/ http://ims.ifmo.ru/ru/pages/28/IMS_2017.htm |
Name | ACM International Conference Proceeding Series |
---|
Conference | 2017 International Conference on Internet and Modern Society, IMS 2017 |
---|---|
Abbreviated title | IMS 2017 |
Country/Territory | Russian Federation |
City | Санкт-Петербург |
Period | 21/06/17 → 23/06/17 |
Internet address |
ID: 41188336