Unsupervised feature selection via maximum projection and minimum redundancy

作者:

Highlights:

摘要

Dimensionality reduction is an important and challenging task in machine learning and data mining. It can facilitate data clustering, classification and information retrieval. As an efficient technique for dimensionality reduction, feature selection is about finding a small feature subset preserving the most relevant information. In this paper, we propose a new criterion, called maximum projection and minimum redundancy feature selection, to address unsupervised learning scenarios. First, the feature selection is formalized with the use of the projection matrices and then characterized equivalently as a matrix factorization problem. Second, an iterative update algorithm and a greedy algorithm are proposed to tackle this problem. Third, kernel techniques are considered and the corresponding algorithm is also put forward. Finally, the proposed algorithms are compared with four state-of-the-art feature selection methods. Experimental results reported for six publicly datasets demonstrate the superiority of the proposed algorithms.

论文关键词:Machine learning,Feature selection,Unsupervised learning,Matrix factorization,Kernel method,Minimum redundancy

论文评审过程:Received 11 February 2014, Revised 23 October 2014, Accepted 8 November 2014, Available online 25 November 2014.

论文官网地址:https://doi.org/10.1016/j.knosys.2014.11.008