Validity studies in clustering methodologies

作者:

Highlights:

摘要

Clustering algorithms tend to generate clusters even when applied to random data. This paper provides a semi-tutorial review of the state-of-the-art in cluster validity, or the verification of results from clustering algorithms. The paper covers ways of measuring clustering tendency, the fit of hierarchical and partitional structures and indices of compactness and isolation for individual clusters. Included are structural criteria for validating clusters and the factors involved in choosing criteria, according to which the literature of cluster validity is classified. An application to speaker identification demonstrates several indices. The development of new clustering techniques and the wide availability of clustering programs necessitates vigorous research in cluster validity.

论文关键词:Clustering,Cluster validity,Hierarchical structure,Clustering tendency,Compactness,Isolation,Intrinsic dimensionality,Global fit

论文评审过程:Received 19 October 1978, Revised 16 February 1979, Available online 19 May 2003.

论文官网地址:https://doi.org/10.1016/0031-3203(79)90034-7