Semi-supervised learning based on nearest neighbor rule and cut edges

作者:

Highlights:

摘要

In this paper, we propose a novel semi-supervised learning approach based on nearest neighbor rule and cut edges. In the first step of our approach, a relative neighborhood graph based on all training samples is constructed for each unlabeled sample, and the unlabeled samples whose edges are all connected to training samples from the same class are labeled. These newly labeled samples are then added into the training samples. In the second step, standard self-training algorithm using nearest neighbor rule is applied for classification until a predetermined stopping criterion is met. In the third step, a statistical test is applied for label modification, and in the last step, the remaining unlabeled samples are classified using standard nearest neighbor rule. The main advantages of the proposed method are: (1) it reduces the error reinforcement by using relative neighborhood graph for classification in the initial stages of semi-supervised learning; (2) it introduces a label modification mechanism for better classification performance. Experimental results show the effectiveness of the proposed approach.

论文关键词:Semi-supervised learning,Nearest neighbor rule,Cut edges

论文评审过程:Received 27 February 2009, Revised 16 March 2010, Accepted 18 March 2010, Available online 23 March 2010.

论文官网地址:https://doi.org/10.1016/j.knosys.2010.03.012