Estimating the number of clusters in a numerical data set via quantization error modeling

作者:

Highlights:

• A parameterized model for the clustering error is introduced.

• The model parameter is a measure of the data dimension and homogeneity.

• A new cost criterion is derived from the properties of the model.

• The method demonstrates good results for numerical data sets.

摘要

•A parameterized model for the clustering error is introduced.•The model parameter is a measure of the data dimension and homogeneity.•A new cost criterion is derived from the properties of the model.•The method demonstrates good results for numerical data sets.

论文关键词:Clustering,Number of clusters,Vector quantization,Color quantization,Dominant colors,Fractal dimensions

论文评审过程:Received 13 April 2014, Revised 10 August 2014, Accepted 15 September 2014, Available online 30 September 2014.

论文官网地址:https://doi.org/10.1016/j.patcog.2014.09.017