Результаты исследований: Публикации в книгах, отчётах, сборниках, трудах конференций › статья в сборнике материалов конференции › научная › Рецензирование
Prepositional Phrase Classification in Russian with Transformer Models. / Belyi, A.; Boitsova, D.; Botvineva, E.
Internet and Modern Society. Human-Computer Communication (IMS 2024). Springer Nature, 2026. стр. 130-139 (Communications in Computer and Information Science; Том 2534 CCIS).Результаты исследований: Публикации в книгах, отчётах, сборниках, трудах конференций › статья в сборнике материалов конференции › научная › Рецензирование
}
TY - GEN
T1 - Prepositional Phrase Classification in Russian with Transformer Models
AU - Belyi, A.
AU - Boitsova, D.
AU - Botvineva, E.
N1 - Conference code: XXVII
PY - 2026
Y1 - 2026
N2 - In this paper we discuss the task of prepositional phrase classification in the Russian annotated corpus of prepositional phrases. As previous research has shown, differentiation of highly confused classes, namely THEME and OBJECT classes, remains a problem waiting for the computational solution. Since simple classifier architecture demonstrates significant performance on these classes, we propose a tree-based classifier architecture to improve performance on the whole and these classes specifically. This architecture consists of a main classifier validating its decisions concerning troublesome classes with another supporting classifier, trained to differentiate between the classes causing performance dropdown. We experiment with various types of classifiers inside of our architecture and various embedding models for the Russian language, which we use for encoding the dataset. The best result that we managed to achieve is an overall F1-score of 0.76 on the validation set using the classifier trained with DeepPavlov/rubert-base-cased model and SVM (Support Vector Machines) classifiers. © The Author(s), under exclusive license to Springer Nature Switzerland AG 2026.
AB - In this paper we discuss the task of prepositional phrase classification in the Russian annotated corpus of prepositional phrases. As previous research has shown, differentiation of highly confused classes, namely THEME and OBJECT classes, remains a problem waiting for the computational solution. Since simple classifier architecture demonstrates significant performance on these classes, we propose a tree-based classifier architecture to improve performance on the whole and these classes specifically. This architecture consists of a main classifier validating its decisions concerning troublesome classes with another supporting classifier, trained to differentiate between the classes causing performance dropdown. We experiment with various types of classifiers inside of our architecture and various embedding models for the Russian language, which we use for encoding the dataset. The best result that we managed to achieve is an overall F1-score of 0.76 on the validation set using the classifier trained with DeepPavlov/rubert-base-cased model and SVM (Support Vector Machines) classifiers. © The Author(s), under exclusive license to Springer Nature Switzerland AG 2026.
KW - phrase embeddings
KW - prepositional phrases
KW - text classification
KW - transformers
KW - word sense disambiguation
KW - Architecture
KW - Classification (of information)
KW - Computational linguistics
KW - Embeddings
KW - Natural language processing systems
KW - Text processing
KW - Computational solutions
KW - Performance
KW - Phrase embedding
KW - Prepositional phrase
KW - Simple++
KW - Text classification
KW - Transformer
KW - Transformer modeling
KW - Word Sense Disambiguation
KW - Support vector machines
UR - https://www.mendeley.com/catalogue/f4761c81-3361-3178-a630-e4a532e274d4/
U2 - 10.1007/978-3-031-96177-9_11
DO - 10.1007/978-3-031-96177-9_11
M3 - статья в сборнике материалов конференции
SN - 9783031961762
T3 - Communications in Computer and Information Science
SP - 130
EP - 139
BT - Internet and Modern Society. Human-Computer Communication (IMS 2024)
PB - Springer Nature
Y2 - 24 June 2024 through 26 June 2024
ER -
ID: 151442632