An efficient hybrid data clustering method based on K-harmonic means and Particle Swarm Optimization

作者:

Highlights:

摘要

Clustering is the process of grouping data objects into set of disjoint classes called clusters so that objects within a class are highly similar with one another and dissimilar with the objects in other classes. K-means (KM) algorithm is one of the most popular clustering techniques because it is easy to implement and works fast in most situations. However, it is sensitive to initialization and is easily trapped in local optima. K-harmonic means (KHM) clustering solves the problem of initialization using a built-in boosting function, but it also easily runs into local optima. Particle Swarm Optimization (PSO) algorithm is a stochastic global optimization technique. A hybrid data clustering algorithm based on PSO and KHM (PSOKHM) is proposed in this research, which makes full use of the merits of both algorithms. The PSOKHM algorithm not only helps the KHM clustering escape from local optima but also overcomes the shortcoming of the slow convergence speed of the PSO algorithm. The performance of the PSOKHM algorithm is compared with those of the PSO and the KHM clustering on seven data sets. Experimental results indicate the superiority of the PSOKHM algorithm.

论文关键词:Data clustering,K-means,K-harmonic means,Particle Swarm Optimization*∗

论文评审过程:Available online 12 February 2009.

论文官网地址:https://doi.org/10.1016/j.eswa.2009.02.003