Compressed constrained spectral clustering framework for large-scale data sets

作者:

Highlights:

摘要

The method of incorporating constraint information into spectral clustering, i.e., \constrained spectral clustering (CSC), can greatly improve clustering accuracy, and thus has been widely employed in the machine learning literature. In this paper, we propose a compressed CSC framework by combining specific graph constructions with a recently introduced CSC model. Particularly, our framework has ability to avoid losing the main partition information in the compression process. By presenting a theoretical analysis and empirical results, we demonstrate that our new framework can achieve the same clustering solution as that of the original model with the specific graph structure. In addition, because our framework utilizes landmark-based graph construction and the approximate matrix decomposition simultaneously, it can be applied to both feature and graph data in a more general way. Moreover, the parameter setting in our framework is rather simple, and therefore it is very practical. Experimental results indicate that our framework has advantages in terms of efficiency and effectiveness.

论文关键词:Constrained spectral clustering,Landmark,Matrix decomposition,Efficiency,Effectiveness

论文评审过程:Received 15 December 2016, Revised 26 July 2017, Accepted 4 August 2017, Available online 5 August 2017, Version of Record 22 September 2017.

论文官网地址:https://doi.org/10.1016/j.knosys.2017.08.003