Research output: Chapter in Book/Report/Conference proceeding › Conference contribution › peer-review
The paper describes a process of clustering of article abstracts, taken from the largest bibliographic life sciences and biomedical information MEDLINE database into categories that correspond to types of medical interventions - types of patient treatments. Experiments were carried out to evaluate the quality of clustering for the following algorithms: K-means; K- means++; Hierarchical clustering, SIB (Sequential information bottleneck) together with the LSA (Latent Semantic Analysis) methods and MI (Mutual Information) which allow selecting feature vectors. Best results of clustering were achieved by K- means++ together with LSA then 210- dimensional space was chosen: Purity = 0.5719, Entropy = 1.3841, Normalized Entropy = 0.6299.
| Original language | English |
|---|---|
| Title of host publication | 2015 INTERNATIONAL CONFERENCE "STABILITY AND CONTROL PROCESSES" IN MEMORY OF V.I. ZUBOV (SCP) |
| Editors | LA Petrosyan, AP Zhabko |
| Publisher | Institute of Electrical and Electronics Engineers Inc. |
| Pages | 555-557 |
| Number of pages | 3 |
| ISBN (Print) | 9781467376983 |
| DOIs | |
| State | Published - 2015 |
| Event | International Conference on "Stability and Control Processes" in Memory of V.I. Zubov, SCP 2015 - Петергоф, St. Petersburg, Russian Federation Duration: 5 Oct 2015 → 9 Oct 2015 http://www.apmath.spbu.ru/scp2015/openconf.php |
| Conference | International Conference on "Stability and Control Processes" in Memory of V.I. Zubov, SCP 2015 |
|---|---|
| Abbreviated title | SCP 2015 |
| Country/Territory | Russian Federation |
| City | St. Petersburg |
| Period | 5/10/15 → 9/10/15 |
| Internet address |
ID: 3983135