Результаты исследований: Публикации в книгах, отчётах, сборниках, трудах конференций › статья в сборнике материалов конференции › научная › Рецензирование
In this paper, we describe the second stage of the study aimed at describing the factors that influence the phonetic reduction of words in Russian speech using machine learning algorithms. We discuss the limitations of the first stage of our study and try to overcome some of them by increasing the dataset and using new algorithms such as random forest, gradient boosting, and perceptron. We used the texts from the Corpus of Russian Speech as the data. The dataset was divided into two separate datasets: one consisted of single words and the other contained multiword units from our corpus. According to the results, for single words the most important features turned out to be the number of syllables and whether the word is an adjective as they were chosen by all algorithms. For the multiword units, the main features were the number of syllables, frequency in Russian spoken texts (in ipm), and token frequency in a given text. In our further research, we are going to expand the dataset and look closer on such features as text type and token frequency in a given text.
Язык оригинала | английский |
---|---|
Название основной публикации | Speech and Computer - 23rd International Conference, SPECOM 2021, Proceedings |
Редакторы | Alexey Karpov, Rodmonga Potapova |
Издатель | Springer Nature |
Страницы | 146-156 |
Число страниц | 11 |
ISBN (печатное издание) | 9783030878016 |
DOI | |
Состояние | Опубликовано - 2021 |
Событие | 23rd International Conference on Speech and Computer - Virtual, Online, Российская Федерация Продолжительность: 27 сен 2021 → 30 сен 2021 Номер конференции: 23 http://specom.nw.ru/2021/ |
Название | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
---|---|
Том | 12997 LNAI |
ISSN (печатное издание) | 0302-9743 |
ISSN (электронное издание) | 1611-3349 |
конференция | 23rd International Conference on Speech and Computer |
---|---|
Сокращенное название | SPECOM 2021 |
Страна/Tерритория | Российская Федерация |
Город | Virtual, Online |
Период | 27/09/21 → 30/09/21 |
Сайт в сети Internet |
ID: 87566335