Learning a Mahalanobis distance metric for data clustering and classification

作者:

Highlights:

摘要

Distance metric is a key issue in many machine learning algorithms. This paper considers a general problem of learning from pairwise constraints in the form of must-links and cannot-links. As one kind of side information, a must-link indicates the pair of the two data points must be in a same class, while a cannot-link indicates that the two data points must be in two different classes. Given must-link and cannot-link information, our goal is to learn a Mahalanobis distance metric. Under this metric, we hope the distances of point pairs in must-links are as small as possible and those of point pairs in cannot-links are as large as possible. This task is formulated as a constrained optimization problem, in which the global optimum can be obtained effectively and efficiently. Finally, some applications in data clustering, interactive natural image segmentation and face pose estimation are given in this paper. Experimental results illustrate the effectiveness of our algorithm.

论文关键词:Distance metric learning,Mahalanobis distance,Global optimization,Data clustering,Interactive image segmentation,Face pose estimation

论文评审过程:Received 7 October 2007, Revised 27 February 2008, Accepted 16 May 2008, Available online 28 May 2008.

论文官网地址:https://doi.org/10.1016/j.patcog.2008.05.018