Non-linear metric learning using pairwise similarity and dissimilarity constraints and the geometrical structure of data

作者:

Highlights:

摘要

The problem of clustering with side information has received much recent attention and metric learning has been considered as a powerful approach to this problem. Until now, various metric learning methods have been proposed for semi-supervised clustering. Although some of the existing methods can use both positive (must-link) and negative (cannot-link) constraints, they are usually limited to learning a linear transformation (i.e., finding a global Mahalanobis metric). In this paper, we propose a framework for learning linear and non-linear transformations efficiently. We use both positive and negative constraints and also the intrinsic topological structure of data. We formulate our metric learning method as an appropriate optimization problem and find the global optimum of this problem. The proposed non-linear method can be considered as an efficient kernel learning method that yields an explicit non-linear transformation and thus shows out-of-sample generalization ability. Experimental results on synthetic and real-world data sets show the effectiveness of our metric learning method for semi-supervised clustering tasks.

论文关键词:Metric learning,Positive and negative constraints,Semi-supervised clustering,Optimization problem,Non-linear,Topological structure,Kernel

论文评审过程:Received 18 December 2008, Revised 18 January 2010, Accepted 26 February 2010, Available online 4 March 2010.

论文官网地址:https://doi.org/10.1016/j.patcog.2010.02.022