Noise, histogram and cluster validity for Gaussian-mixtured data

作者:

Highlights:

摘要

In this study, a critique of the clustering methodology is carried out for the definition of a cluster, determination of the number of clusters and evaluation of heuristic partitional clustering algorithms, when the data is a noisy Gaussian Mixture. The effects of noise in determining the number of clusters and the clustering parameters are investigated. Two cluster validity criteria, namely, the likelihood information criterion and the sum of squared error are described. It is concluded that these criteria can be used as a guide in deciding on the number of valid clusters. By using the proposed sum of squared error criterion, an improvement algorithm which reduces the effect of noise on the results of heuristic clustering algorithms is described.

论文关键词:Cluster,Noise,Histogram,LIC

论文评审过程:Received 15 August 1986, Revised 30 September 1986, Available online 19 May 2003.

论文官网地址:https://doi.org/10.1016/0031-3203(87)90063-X