This paper presents models for solving the problem of multiclass classification of user posts in a social media. These models are based on embeddings extracted from messages using the RuBert language model and a fully connected neural network built over it. The models presented are compared to a baseline model using long-Term short-Term memory neurons (LSTM). The results will improve the accuracy of the classification posts, which in turn will improve the accuracy of assessing the psychological characteristics users.

Original languageEnglish
Title of host publicationProceedings of 2022 25th International Conference on Soft Computing and Measurements, SCM 2022
Subtitle of host publicationProceedings
EditorsS. Shaposhnikov
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages31-33
Number of pages3
ISBN (Electronic)9781665496698
ISBN (Print)9781665496698
DOIs
StatePublished - 25 May 2022
Event25th International Conference on Soft Computing and Measurements, SCM 2022 - St. Petersburg, Russian Federation
Duration: 25 May 202227 May 2022

Conference

Conference25th International Conference on Soft Computing and Measurements, SCM 2022
Country/TerritoryRussian Federation
CitySt. Petersburg
Period25/05/2227/05/22

    Scopus subject areas

  • Computer Science Applications
  • Computational Mathematics
  • Control and Optimization
  • Numerical Analysis
  • Instrumentation

    Research areas

  • analysis of social media posts, fully connected neural network, LSTM, machine learning, multi-class classification, RuBert, sentence embedding

ID: 99228938