Stable topic modeling with local density regularization

DOI

https://doi.org/10.1007/978-3-319-45982-016
Final published version

Sergei Koltcov
Sergey I. Nikolenko
Olessia Koltsova
Vladimir Filippov
Svetlana Bodrunova

Topic modeling has emerged over the last decade as a powerful tool for analyzing large text corpora, including Web-based usergenerated texts. Topic stability, however, remains a concern: topic models have a very complex optimization landscape with many local maxima, and even different runs of the same model yield very different topics. Aiming to add stability to topic modeling, we propose an approach to topic modeling based on local density regularization, where words in a local context window of a given word have higher probabilities to get the same topic as that word. We compare several models with local density regularizers and show how they can improve topic stability while remaining on par with classical models in terms of quality metrics.

Original language	English
Title of host publication	Internet Science - 3rd International Conference, INSCI 2016, Proceedings
Editors	Anna Satsiou, Yanina Welp, Thanassis Tiropanis, Dominic DiFranzo, Ioannis Stavrakakis, Franco Bagnoli, Paolo Nesi, Giovanna Pacini
Publisher	Springer Nature
Pages	176-188
Number of pages	13
ISBN (Print)	9783319459813
DOIs	https://doi.org/10.1007/978-3-319-45982-016
State	Published - 2016
Event	3rd International Conference on Internet Science, INSCI 2016 - Florence, Italy Duration: 12 Sep 2016 → 14 Sep 2016

Publication series

Name	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume	9934 LNCS
ISSN (Print)	0302-9743
ISSN (Electronic)	1611-3349

Conference

Conference	3rd International Conference on Internet Science, INSCI 2016
Country/Territory	Italy
City	Florence
Period	12/09/16 → 14/09/16

Scopus subject areas

Theoretical Computer Science
Computer Science(all)

Research areas

Gibbs sampling, Latent Dirichlet allocation, Topic modeling

ID: 7604879