An iterative model-free feature screening procedure: Forward recursive selection

作者:

Highlights:

摘要

Many researchers have studied the combinations of machine learning techniques and traditional statistical strategies, and proposed effective procedures for complicated data sets. Yet, there is still some lack of running time and prediction accuracy. In this paper, we propose an iterative feature screening procedure, named forward recursive selection. We combine the random forest and forward selection to address the model-based limitations and the related requirements. We also use the forward strategy with a limited number of iterations to improve the computational efficiency. To provide the theoretical guarantees of this method, we calculate functions of the permutation importance of this algorithm in different models and data with group structures. Numerical comparisons and empirical analysis support our results, and the proposed procedure works well.

论文关键词:Random forest,Forward selection,Iterative algorithm,Statistical modeling

论文评审过程:Received 18 October 2021, Revised 24 February 2022, Accepted 2 April 2022, Available online 8 April 2022, Version of Record 23 April 2022.

论文官网地址:https://doi.org/10.1016/j.knosys.2022.108745