A hybrid filter/wrapper approach of feature selection using information theory

作者:

Highlights:

摘要

We focus on a hybrid approach of feature selection. We begin our analysis with a filter model, exploiting the geometrical information contained in the minimum spanning tree (MST) built on the learning set. This model exploits a statistical test of relative certainty gain, used in a forward selection algorithm. In the second part of the paper, we show that the MST can be replaced by the 1 nearest-neighbor graph without challenging the statistical framework. This leads to a feature selection algorithm belonging to a new category of hybrid models (filter-wrapper). Experimental results on readily available synthetic and natural domains are presented and discussed.

论文关键词:Machine learning,Data mining,Information theory,Feature selection,Wrapper models,Filter models

论文评审过程:Received 4 November 1999, Accepted 12 April 2001, Available online 17 December 2001.

论文官网地址:https://doi.org/10.1016/S0031-3203(01)00084-X