A survey on online feature selection with streaming features

作者:Xuegang Hu, Peng Zhou, Peipei Li, Jing Wang, Xindong Wu

摘要

In the era of big data, the dimensionality of data is increasing dramatically in many domains. To deal with high dimensionality, online feature selection becomes critical in big data mining. Recently, online selection of dynamic features has received much attention. In situations where features arrive sequentially over time, we need to perform online feature selection upon feature arrivals. Meanwhile, considering grouped features, it is necessary to deal with features arriving by groups. To handle these challenges, some state-of-the-art methods for online feature selection have been proposed. In this paper, we first give a brief review of traditional feature selection approaches. Then we discuss specific problems of online feature selection with feature streams in detail. A comprehensive review of existing online feature selection methods is presented by comparing with each other. Finally, we discuss several open issues in online feature selection.

论文关键词:big data, feature selection, online feature selection, feature stream

论文评审过程:

论文官网地址:https://doi.org/10.1007/s11704-016-5489-3