A decision rule-based method for feature selection in predictive data mining

作者:

Highlights:

摘要

Algorithms for feature selection in predictive data mining for classification problems attempt to select those features that are relevant, and are not redundant for the classification task. A relevant feature is defined as one which is highly correlated with the target function. One problem with the definition of feature relevance is that there is no universally accepted definition of what it means for a feature to be ‘highly correlated with the target function or highly correlated with the other features’. A new feature selection algorithm which incorporates domain specific definitions of high, medium and low correlations is proposed in this paper. The proposed algorithm conducts a heuristic search for the most relevant features for the prediction task.

论文关键词:Feature selection,Feature subset search,Predictive data mining

论文评审过程:Available online 27 June 2009.

论文官网地址:https://doi.org/10.1016/j.eswa.2009.06.031