Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review
The paper describes a process of clustering of article abstracts, taken from the largest bibliographic life sciences and biomedical information MEDLINE database into categories that correspond to types of medical interventions - types of patient treatments. Experiments were carried out to evaluate the quality of clustering for the following algorithms: K-means; K- means++; Hierarchical clustering, SIB (Sequential information bottleneck) together with the LSA (Latent Semantic Analysis) methods and MI (Mutual Information) which allow selecting feature vectors. Best results of clustering were achieved by K- means++ together with LSA then 210- dimensional space was chosen: Purity = 0.5719, Entropy = 1.3841, Normalized Entropy = 0.6299.
Original language | English |
---|---|
Title of host publication | 2015 INTERNATIONAL CONFERENCE "STABILITY AND CONTROL PROCESSES" IN MEMORY OF V.I. ZUBOV (SCP) |
Editors | LA Petrosyan, AP Zhabko |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
Pages | 555-557 |
Number of pages | 3 |
ISBN (Print) | 9781467376983 |
DOIs | |
State | Published - 2015 |
Event | International Conference on "Stability and Control Processes" in Memory of V.I. Zubov, SCP 2015 - Петергоф, St. Petersburg, Russian Federation Duration: 5 Oct 2015 → 9 Oct 2015 http://www.apmath.spbu.ru/scp2015/openconf.php |
Conference | International Conference on "Stability and Control Processes" in Memory of V.I. Zubov, SCP 2015 |
---|---|
Abbreviated title | SCP 2015 |
Country/Territory | Russian Federation |
City | St. Petersburg |
Period | 5/10/15 → 9/10/15 |
Internet address |
ID: 3983135