Testing Different Ensemble Configurations for Feature Selection
作者:Borja Seijo-Pardo, Verónica Bolón-Canedo, Amparo Alonso-Betanzos
摘要
In recent years, ensemble learning has become a prolific area of study in pattern recognition, based on the assumption that using and combining different learning models in the same problem could lead to better performance results than using a single model. This idea of ensemble learning has traditionally been used for classification tasks, but has more recently been adapted to other machine learning tasks such as clustering and feature selection. We propose several feature selection ensemble configurations based on combining rankings of features from individual rankers according to the combination method and threshold value used. The performance of each proposed ensemble configuration was tested for synthetic datasets (to assess the adequacy of the selection), real classical datasets (with more samples than features), and DNA microarray datasets (with more features than samples). Five different classifiers were studied in order to test the suitability of the proposed ensemble configurations and assess the results.
论文关键词:Ensemble learning, Feature selection, Ranking aggregation, Classification, Fisher’s ratio, DNA microarray
论文评审过程:
论文官网地址:https://doi.org/10.1007/s11063-017-9619-1