The stability of clustering methods is a commonly used approach in cluster analysis for determining the "true" number of groupings. The acceptable clustering is such data sample grouping that is robust to random perturbations of investigated data. In this paper, we propose an algorithm for determining the number of clusters based on the introduction of the initial dataset which are expanded by adding the set of perturbated initial dataset.

Original languageRussian
Pages (from-to)28-37
Number of pages10
Journal ВЕСТНИК САНКТ-ПЕТЕРБУРГСКОГО УНИВЕРСИТЕТА. ПРИКЛАДНАЯ МАТЕМАТИКА. ИНФОРМАТИКА. ПРОЦЕССЫ УПРАВЛЕНИЯ
Volume12
Issue number1
StatePublished - 2016

    Research areas

  • clustering, cluster stability, optimal cluster number

ID: 85909530