This paper proposes a linguistically-rich approach to hidden community detection which was tested in experiments with the Russian corpus of VKontakte posts. Modern algorithms for hidden community detection are based on graph theory, these procedures leaving out of account the linguistic features of analyzed texts. The authors have developed a new hybrid approach to the detection of hidden communities, combining author-topic modeling and automatic topic labeling. Specific linguistic parameters of Russian posts were revealed for correct language processing. The results justify the use of the algorithm that can be further integrated with already developed graph methods.
|Name||Communications in Computer and Information Science|
|Conference||9th Conference on Artificial Intelligence and Natural Language|
|Abbreviated title||AINL 2020|
|Period||7/10/20 → 9/10/20|