Fast agglomerative clustering using information of k-nearest neighbors

作者:

Highlights:

摘要

In this paper, we develop a method to lower the computational complexity of pairwise nearest neighbor (PNN) algorithm. Our approach determines a set of candidate clusters being updated after each cluster merge. If the updating process is required for some of these clusters, k-nearest neighbors are found for them. The number of distance calculations for our method is O(N2), where N is the number of data points. To further reduce the computational complexity of the proposed algorithm, some available fast search approaches are used. Compared to available approaches, our proposed algorithm can reduce the computing time and number of distance calculations significantly. Compared to FPNN, our method can reduce the computing time by a factor of about 26.8 for the data set from a real image. Compared with PMLFPNN, our approach can reduce the computing time by a factor of about 3.8 for the same data set.

论文关键词:Nearest neighbor,Agglomerative clustering,Vector quantization

论文评审过程:Received 2 November 2009, Revised 13 June 2010, Accepted 27 June 2010, Available online 1 July 2010.

论文官网地址:https://doi.org/10.1016/j.patcog.2010.06.021