Результаты исследований: Публикации в книгах, отчётах, сборниках, трудах конференций › статья в сборнике материалов конференции › Рецензирование
The goal of the current work is to evaluate semantic feature aggregation techniques in a task of gender classification of public social media texts in Russian. We collect Facebook posts of Russian-speaking users and apply them as a dataset for two topic modelling techniques and a distributional clustering approach. The output of the algorithms is applied as a feature aggregation method in a task of gender classification based on a smaller Facebook sample. The classification performance of the best model is favorably compared against the lemmas baseline and the state-of-the-art results reported for a different genre or language. The resulting successful features are exemplified, and the difference between the three techniques in terms of classification performance and feature contents are discussed, with the best technique clearly outperforming the others.
Язык оригинала | английский |
---|---|
Название основной публикации | Artificial Intelligence and Natural Language - 6th Conference, AINL 2017, Revised Selected Papers |
Издатель | Springer Nature |
Страницы | 3-15 |
Число страниц | 13 |
Том | 789 |
ISBN (печатное издание) | 9783319717456 |
DOI | |
Состояние | Опубликовано - 2018 |
Событие | Conference on Artificial Intelligence and Natural Language - St. Petersburg, Российская Федерация Продолжительность: 19 сен 2017 → 22 сен 2017 Номер конференции: 6 http://ainlconf.ru/2017 |
Название | Communications in Computer and Information Science |
---|---|
Том | 789 |
ISSN (печатное издание) | 1865-0929 |
конференция | Conference on Artificial Intelligence and Natural Language |
---|---|
Сокращенное название | AINL 2017 |
Страна/Tерритория | Российская Федерация |
Город | St. Petersburg |
Период | 19/09/17 → 22/09/17 |
Сайт в сети Internet |
ID: 13395534