A survey on feature selection approaches for clustering

作者:Emrah Hancer, Bing Xue, Mengjie Zhang

摘要

The massive growth of data in recent years has led challenges in data mining and machine learning tasks. One of the major challenges is the selection of relevant features from the original set of available features that maximally improves the learning performance over that of the original feature set. This issue attracts researchers’ attention resulting in a variety of successful feature selection approaches in the literature. Although there exist several surveys on unsupervised learning (e.g., clustering), lots of works concerning unsupervised feature selection are missing in these surveys (e.g., evolutionary computation based feature selection for clustering) for identifying the strengths and weakness of those approaches. In this paper, we introduce a comprehensive survey on feature selection approaches for clustering by reflecting the advantages/disadvantages of current approaches from different perspectives and identifying promising trends for future research.

论文关键词:Clustering, Feature selection, Data mining, Evolutionary computation

论文评审过程:

论文官网地址:https://doi.org/10.1007/s10462-019-09800-w