Generalized Adjusted Rand Indices for cluster ensembles
作者:
Highlights:
•
摘要
In this paper, Adjusted Rand Index (ARI) is generalized to two new measures based on matrix comparison: (i) Adjusted Rand Index between a similarity matrix and a cluster partition (ARImp), to evaluate the consistency of a set of clustering solutions with their corresponding consensus matrix in a cluster ensemble, and (ii) Adjusted Rand Index between similarity matrices (ARImm), to evaluate the consistency between two similarity matrices. Desirable properties of ARI are preserved in the two new measures, and new properties are discussed. These properties include: (i) detection of uncorrelatedness; (ii) computation of ARImp/ARImm in a distributed environment; and (iii) characterization of the degree of uncertainty of a consensus matrix. All of these properties are investigated from both the perspectives of theoretical analysis and experimental validation. We have also performed a number of experiments to show the usefulness and effectiveness of the two proposed measures in practical applications.
论文关键词:Cluster ensembles,Clustering evaluation,Adjusted Rand Index
论文评审过程:Received 17 June 2011, Revised 1 September 2011, Accepted 24 November 2011, Available online 2 December 2011.
论文官网地址:https://doi.org/10.1016/j.patcog.2011.11.017