Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › Research › peer-review
Usually, text documents are represented as a vector of n-dimensional Euclidean space. One of the main it the problem of the typology of texts using cluster analysis is to determine the number of clusters. In this article was researched the agglomerative clustering algorithm in Euclidean space. A statistical criterion for completing the clustering process was deriving as the Markov moment. Was considered the problem of cluster stability. As an example, it was considered retrieval of the harmful content.
Original language | English |
---|---|
Title of host publication | Internet Science - INSCI 2018 International Workshops |
Subtitle of host publication | Conference proceedings |
Editors | S.S. Bodrunova, et al. |
Publisher | Springer Nature |
Pages | 19-32 |
ISBN (Print) | 9783030177041 |
DOIs | |
State | Published - 2019 |
Event | 5th International Conference on Internet Science, INSCI 2018: Internet in World Regions: Digital Freedoms and Citizen Empowerment - СПбГУ, Институт "Высшая школа журналистики и массовых коммуникаций", St. Petersburg, Russian Federation Duration: 24 Oct 2018 → 26 Oct 2018 Conference number: 5th http://insci2018.org/ http://insci2018.org |
Name | Lecture Notes in Computer Science |
---|---|
Volume | 11551 |
ISSN (Print) | 0302-9743 |
ISSN (Electronic) | 1611-3349 |
Conference | 5th International Conference on Internet Science, INSCI 2018 |
---|---|
Abbreviated title | INSCI 2018 |
Country/Territory | Russian Federation |
City | St. Petersburg |
Period | 24/10/18 → 26/10/18 |
Internet address |
ID: 41713635