A computationally efficient technique for data-clustering
作者:
Highlights:
•
摘要
A computationally efficient agglomerative clustering algorithm based on multilevel theory is presented. Here, the data set is divided randomly into a number of partitions. The samples of each such partition are clustered separately using hierarchical agglomerative clustering algorithm to form sub-clusters. These are merged at higher levels to get the final classification. This algorithm leads to the same classification as that of hierarchical agglomerative clustering algorithm when the clusters are well separated. The advantages of this algorithm are short run time and small storage requirement. It is observed that the savings, in storage space and computation time, increase nonlinearly with the sample size.
论文关键词:Nonparametric,Agglomerative,Multilevel theory,Partitioning,Relabeling,Representative samples,Well separated clusters
论文评审过程:Received 10 December 1979, Available online 19 May 2003.
论文官网地址:https://doi.org/10.1016/0031-3203(80)90039-4