Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › Research › peer-review
Topic modeling has emerged over the last decade as a powerful tool for analyzing large text corpora, including Web-based usergenerated texts. Topic stability, however, remains a concern: topic models have a very complex optimization landscape with many local maxima, and even different runs of the same model yield very different topics. Aiming to add stability to topic modeling, we propose an approach to topic modeling based on local density regularization, where words in a local context window of a given word have higher probabilities to get the same topic as that word. We compare several models with local density regularizers and show how they can improve topic stability while remaining on par with classical models in terms of quality metrics.
Original language | English |
---|---|
Title of host publication | Internet Science - 3rd International Conference, INSCI 2016, Proceedings |
Editors | Anna Satsiou, Yanina Welp, Thanassis Tiropanis, Dominic DiFranzo, Ioannis Stavrakakis, Franco Bagnoli, Paolo Nesi, Giovanna Pacini |
Publisher | Springer Nature |
Pages | 176-188 |
Number of pages | 13 |
ISBN (Print) | 9783319459813 |
DOIs | |
State | Published - 2016 |
Event | 3rd International Conference on Internet Science, INSCI 2016 - Florence, Italy Duration: 12 Sep 2016 → 14 Sep 2016 |
Name | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
---|---|
Volume | 9934 LNCS |
ISSN (Print) | 0302-9743 |
ISSN (Electronic) | 1611-3349 |
Conference | 3rd International Conference on Internet Science, INSCI 2016 |
---|---|
Country/Territory | Italy |
City | Florence |
Period | 12/09/16 → 14/09/16 |
ID: 7604879