Measures of Diversity in Classifier Ensembles and Their Relationship with the Ensemble Accuracy
作者:Ludmila I. Kuncheva, Christopher J. Whitaker
摘要
Diversity among the members of a team of classifiers is deemed to be a key issue in classifier combination. However, measuring diversity is not straightforward because there is no generally accepted formal definition. We have found and studied ten statistics which can measure diversity among binary classifier outputs (correct or incorrect vote for the class label): four averaged pairwise measures (the Q statistic, the correlation, the disagreement and the double fault) and six non-pairwise measures (the entropy of the votes, the difficulty index, the Kohavi-Wolpert variance, the interrater agreement, the generalized diversity, and the coincident failure diversity). Four experiments have been designed to examine the relationship between the accuracy of the team and the measures of diversity, and among the measures themselves. Although there are proven connections between diversity and accuracy in some special cases, our results raise some doubts about the usefulness of diversity measures in building classifier ensembles in real-life pattern recognition problems.
论文关键词:pattern recognition, multiple classifiers ensemble/committee of learners, dependency and diversity, majority vote
论文评审过程:
论文官网地址:https://doi.org/10.1023/A:1022859003006