Exceptionally monotone models—the rank correlation model class for Exceptional Model Mining

作者:Lennart Downar, Wouter Duivesteijn

摘要

Exceptional Model Mining strives to find coherent subgroups of the dataset where multiple target attributes interact in an unusual way. One instance of such an investigated form of interaction is Pearson’s correlation coefficient between two targets. EMM then finds subgroups with an exceptionally linear relation between the targets. In this paper, we enrich the EMM toolbox by developing the more general rank correlation model class. We find subgroups with an exceptionally monotone relation between the targets. Apart from catering for this richer set of relations, the rank correlation model class does not necessarily require the assumption of target normality, which is implicitly invoked in the Pearson’s correlation model class. Furthermore, it is less sensitive to outliers. We provide pseudocode for the employed algorithm and analyze its computational complexity, and experimentally illustrate what the rank correlation model class for EMM can find for you on six datasets from an eclectic variety of domains.

论文关键词:Rank correlation, Exceptional Model Mining, Monotonicity, Subgroup Discovery, Data mining

论文评审过程:

论文官网地址:https://doi.org/10.1007/s10115-016-0979-z