Maximum weight and minimum redundancy: A novel framework for feature subset selection

作者:

Highlights:

摘要

Feature subset selection is often required as a preliminary work for many pattern recognition problems. In this paper, a novel filter framework is presented to select optimal feature subset based on a maximum weight and minimum redundancy (MWMR) criterion. Since the weight of each feature indicates its importance for some ad hoc tasks (such as clustering and classification) and the redundancy represents the correlations among features. Through the proposed MWMR, we can select the feature subset in which the features are most beneficial to the subsequent tasks while the redundancy among them is minimal. Moreover, a pair-wise updating based iterative algorithm is introduced to solve our framework effectively. In the experiments, three feature weighting algorithms (Laplacian score, Fisher score and Constraint score) are combined with two redundancy measurement methods (Pearson correlation coefficient and Mutual information) to test the performances of proposed MWMR. The experimental results on five different databases (CMU PIE, Extended YaleB, Colon, DLBCL and PCMAC) demonstrate the advantage and efficiency of our MWMR.

论文关键词:Feature selection,Maximum weight and minimum redundancy,Face recognition,Microarray classification,Text categorization

论文评审过程:Received 30 June 2012, Revised 17 October 2012, Accepted 27 November 2012, Available online 10 December 2012.

论文官网地址:https://doi.org/10.1016/j.patcog.2012.11.025