Active semi-supervised fuzzy clustering

作者:

Highlights:

摘要

Clustering algorithms are increasingly employed for the categorization of image databases, in order to provide users with database overviews and make their access more effective. By including information provided by the user, the categorization process can produce results that come closer to user's expectations. To make such a semi-supervised categorization approach acceptable for the user, this information must be of a very simple nature and the amount of information the user is required to provide must be minimized. We propose here an effective semi-supervised clustering algorithm, active fuzzy constrained clustering (AFCC), that minimizes a competitive agglomeration cost function with fuzzy terms corresponding to pairwise constraints provided by the user. In order to minimize the amount of constraints required, we define an active mechanism for the selection of candidate constraints. The comparisons performed on a simple benchmark and on a ground truth image database show that with AFCC the results of clustering can be significantly improved with few constraints, making this semi-supervised approach an attractive alternative in the categorization of image databases.

论文关键词:Semi-supervised clustering,Image database categorization,Pairwise constraints,Active learning

论文评审过程:Received 19 July 2005, Revised 2 October 2007, Accepted 3 October 2007, Available online 12 October 2007.

论文官网地址:https://doi.org/10.1016/j.patcog.2007.10.004