Hybrid attribute reduction based on a novel fuzzy-rough model and information granulation

作者:

Highlights:

摘要

Feature subset selection has become an important challenge in areas of pattern recognition, machine learning and data mining. As different semantics are hidden in numerical and categorical features, there are two strategies for selecting hybrid attributes: discretizing numerical variables or numericalize categorical features. In this paper, we introduce a simple and efficient hybrid attribute reduction algorithm based on a generalized fuzzy-rough model. A theoretic framework of fuzzy-rough model based on fuzzy relations is presented, which underlies a foundation for algorithm construction. We derive several attribute significance measures based on the proposed fuzzy-rough model and construct a forward greedy algorithm for hybrid attribute reduction. The experiments show that the technique of variable precision fuzzy inclusion in computing decision positive region can get the optimal classification performance. Number of the selected features is the least but accuracy is the best.

论文关键词:Numerical feature,Categorical feature,Feature selection,Attribute reduction,Fuzzy set,Rough set,Inclusion degree

论文评审过程:Received 2 April 2006, Revised 15 January 2007, Accepted 15 March 2007, Available online 30 March 2007.

论文官网地址:https://doi.org/10.1016/j.patcog.2007.03.017