Speeding up incremental wrapper feature subset selection with Naive Bayes classifier

作者:

Highlights:

摘要

This paper deals with the problem of wrapper feature subset selection (FSS) in classification-oriented datasets with a (very) large number of attributes. In high-dimensional datasets with thousands of variables, wrapper FSS becomes a laborious computational process because of the amount of CPU time it requires. In this paper we study how under certain circumstances the wrapper FSS process can be speeded up by embedding the classifier into the wrapper algorithm, instead of dealing with it as a black-box. Our proposal is based on the combination of the NB classifier (which is known to be largely beneficial for FSS) with incremental wrapper FSS algorithms. The merit of this approach is analyzed both theoretically and experimentally, and the results show an impressive speed-up for the embedded FSS process.

论文关键词:Wrapper feature subset selection,Incremental algorithms,Naive Bayes,High-dimensional data

论文评审过程:Received 13 February 2013, Revised 11 October 2013, Accepted 13 October 2013, Available online 24 October 2013.

论文官网地址:https://doi.org/10.1016/j.knosys.2013.10.016