Feature selection in bankruptcy prediction

作者:

Highlights:

摘要

For many corporations, assessing the credit of investment targets and the possibility of bankruptcy is a vital issue before investment. Data mining and machine learning techniques have been applied to solve the bankruptcy prediction and credit scoring problems. As feature selection is an important step to select more representative data from a given dataset in data mining to improve the final prediction performance, it is unknown that which feature selection method is better. Therefore, this paper aims at comparing five well-known feature selection methods used in bankruptcy prediction, which are t-test, correlation matrix, stepwise regression, principle component analysis (PCA) and factor analysis (FA) to examine their prediction performance. Multi-layer perceptron (MLP) neural networks are used as the prediction model. Five related datasets are used in order to provide a reliable conclusion. Regarding the experimental results, the t-test feature selection method outperforms the other ones by the two performance measurements.

论文关键词:Feature selection,Data mining,Bankruptcy prediction,Neural networks

论文评审过程:Received 9 January 2008, Revised 14 July 2008, Accepted 7 August 2008, Available online 14 August 2008.

论文官网地址:https://doi.org/10.1016/j.knosys.2008.08.002