A framework for semi-supervised and unsupervised optimal extraction of clusters from hierarchies

作者:R. J. G. B. Campello, D. Moulavi, A. Zimek, J. Sander

摘要

We introduce a framework for the optimal extraction of flat clusterings from local cuts through cluster hierarchies. The extraction of a flat clustering from a cluster tree is formulated as an optimization problem and a linear complexity algorithm is presented that provides the globally optimal solution to this problem in semi-supervised as well as in unsupervised scenarios. A collection of experiments is presented involving clustering hierarchies of different natures, a variety of real data sets, and comparisons with specialized methods from the literature.

论文关键词:Hierarchical clustering, Optimal selection of clusters , Should-link and should-not-link constraints, Cluster quality

论文评审过程:

论文官网地址:https://doi.org/10.1007/s10618-013-0311-4