DOI

Topic modeling is a powerful tool for analyzing large collections of user-generated web content, but it still suffers from problems with topic stability, which are especially important for social sciences. We evaluate stability for differenttopic models and propose a new model, granulated LDA,that samples short sequences of neighboring words at once. We show that gLDA exhibits very stable results.

Original languageEnglish
Title of host publicationWebSci 2016 - Proceedings of the 2016 ACM Web Science Conference
PublisherAssociation for Computing Machinery
Pages342-343
Number of pages2
ISBN (Electronic)9781450342087
DOIs
StatePublished - 22 May 2016
Event8th ACM Web Science Conference, WebSci 2016 - Hannover, Germany
Duration: 22 May 201625 May 2016

Publication series

NameWebSci 2016 - Proceedings of the 2016 ACM Web Science Conference

Conference

Conference8th ACM Web Science Conference, WebSci 2016
Country/TerritoryGermany
CityHannover
Period22/05/1625/05/16

    Scopus subject areas

  • Computer Networks and Communications

    Research areas

  • Gibbs sampling, Latent Dirichlet allocation, Topic modeling

ID: 7604123