Local equivalences of distances between clusterings—a geometric perspective

作者:Marina Meilă

摘要

In comparing clusterings, several different distances and indices are in use. We prove that the Misclassification Error distance, the Hamming distance (equivalent to the unadjusted Rand index), and the χ 2 distance between partitions are equivalent in the neighborhood of 0. In other words, if two partitions are very similar, then one distance defines upper and lower bounds on the other and viceversa. The proofs are geometric and rely on the concavity of the distances. The geometric intuitions themselves advance the understanding of the space of all clusterings. To our knowledge, this is the first result of its kind.

论文关键词:Clustering, Comparing partitions, χ 2 divergence, Misclassification error, Rand index, Convexity

论文评审过程:

论文官网地址:https://doi.org/10.1007/s10994-011-5267-2