Newtonian clustering: An approach based on molecular dynamics and global optimization

作者:

Highlights:

摘要

Given a data set, a dynamical procedure is applied to the data points in order to shrink and separate, possibly overlapping clusters. Namely, Newton's equations of motion are employed to concentrate the data points around their cluster centers, using an attractive potential, constructed specially for this purpose. During this process, important information is gathered concerning the spread of each cluster. In succession this information is used to create an objective function that maps each cluster to a local maximum. Global optimization is then used to retrieve the positions of the maxima that correspond to the locations of the cluster centers. Further refinement is achieved by applying the EM-algorithm to a Gaussian mixture model whose construction and initialization is based on the acquired information. To assess the effectiveness of our method, we have conducted experiments on a plethora of benchmark data sets. In addition we have compared its performance against four clustering techniques that are well established in the literature.

论文关键词:Clustering,Molecular dynamics,Global optimization,Order statistics

论文评审过程:Received 22 September 2005, Revised 2 May 2006, Accepted 28 July 2006, Available online 25 January 2007.

论文官网地址:https://doi.org/10.1016/j.patcog.2006.07.012