One of the important problems arising in cluster analysis is the estimation of the appropriate number of clusters. In the case when the expected number of clusters is sufficiently large, the majority of the existing methods involve high complexity computations. This difficulty can be avoided by using a suitable confidence interval to estimate the number of clusters. Such a method is proposed in the current chapter.

The main idea is to allocate the jump position of the within-cluster dispersion function using Chebyshev polynomial approximations. The confidence interval for the true number of clusters can be obtained in this way by means of a comparatively small number of the distortion calculations. a significant computational complexity decreasing is proven. Several examples are given to demonstrate the high ability of the proposed methodology.

Язык оригиналаАнглийский
Название основной публикацииDATA MINING: FOUNDATIONS AND INTELLIGENT PARADIGMS, VOL 1: CLUSTERING, ASSOCIATION AND CLASSIFICATION
РедакторыDE Holmes, LC Jain
ИздательSpringer Nature
Страницы131-155
Число страниц25
ISBN (печатное издание)978-3-642-23165-0
СостояниеОпубликовано - 2012

Серия публикаций

НазваниеIntelligent Systems Reference Library
ИздательSPRINGER-VERLAG BERLIN
Том23
ISSN (печатное издание)1868-4394

ID: 4420519