An efficient instance selection algorithm to reconstruct training set for support vector machine

作者:

Highlights:

摘要

Support vector machine is a classification model which has been widely used in many nonlinear and high dimensional pattern recognition problems. However, it is inefficient or impracticable to implement support vector machine in dealing with large scale training set due to its computational difficulties as well as the model complexity. In this paper, we study the support vector recognition problem mainly in the context of the reduction methods to reconstruct training set for support vector machine. We focus on the fact of uneven distribution of instances in the vector space to propose an efficient self-adaption instance selection algorithm from the viewpoint of geometry-based method. Also, we conduct an experimental study involving eleven different sizes of datasets from UCI repository for measuring the performance of the proposed algorithm as well as six competitive instance selection algorithms in terms of accuracy, reduction capabilities, and runtime. The extensive experimental results show that the proposed algorithm outperforms most of competitive algorithms due to its high efficiency and efficacy.

论文关键词:Support vector machine,Instance selection,Machine learning

论文评审过程:Received 31 July 2016, Revised 18 October 2016, Accepted 31 October 2016, Available online 2 November 2016, Version of Record 14 December 2016.

论文官网地址:https://doi.org/10.1016/j.knosys.2016.10.031