A genetic approach to the automatic clustering problem

作者:

Highlights:

摘要

In solving the clustering problem, traditional methods, for example, the K-means algorithm and its variants, usually ask the user to provide the number of clusters. Unfortunately, the number of clusters in general is unknown to the user. Therefore, clustering becomes a tedious trial-and-error work and the clustering result is often not very promising especially when the number of clusters is large and not easy to guess. In this paper, we propose a genetic algorithm for the clustering problem. This algorithm is suitable for clustering the data with compact spherical clusters. It can be used in two ways. One is the user-controlled clustering, where the user may control the result of clustering by varying the values of the parameter, w. A small value of w results in a larger number of compact clusters, while a large value of w results in a smaller number of looser clusters. The other is an automatic clustering, where a heuristic strategy is applied to find a good clustering. Experimental results are given to illustrate the effectiveness of this genetic clustering algorithm.

论文关键词:Clustering,Single-linkage algorithm,Genetic clustering algorithm

论文评审过程:Received 13 October 1999, Revised 7 December 1999, Accepted 7 December 1999, Available online 7 June 2001.

论文官网地址:https://doi.org/10.1016/S0031-3203(00)00005-4