A New Conceptual Clustering Framework

作者:Nina Mishra, Dana Ron, Ram Swaminathan

摘要

We propose a new formulation of the conceptual clustering problem where the goal is to explicitly output a collection of simple and meaningful conjunctions of attributes that define the clusters. The formulation differs from previous approaches since the clusters discovered may overlap and also may not cover all the points. In addition, a point may be assigned to a cluster description even if it only satisfies most, and not necessarily all, of the attributes in the conjunction. Connections between this conceptual clustering problem and the maximum edge biclique problem are made. Simple, randomized algorithms are given that discover a collection of approximate conjunctive cluster descriptions in sublinear time.

论文关键词:conceptual clustering, maximum edge biclustering

论文评审过程:

论文官网地址:https://doi.org/10.1023/B:MACH.0000033117.77257.41