Two-stage consumer credit risk modelling using heterogeneous ensemble learning

作者:

Highlights:

• A two-stage credit risk model is proposed to predict expected loss.

• Combines class-imbalanced ensemble credit scoring with regression ensemble

• Predicts consumer credit risk of a non-bank financial institution and P2P lending

• Multi-objective evolutionary feature selection is used in both stages.

• Measure of misclassification cost is proposed for loans with fixed exposure.

摘要

Modelling consumer credit risk is a crucial task for banks and non-bank financial institutions to support decision-making on granting loans. To model the overall credit risk of a consumer loan in terms of expected loss (EL), three key credit risk parameters must be estimated: probability of default (PD), loss given default (LGD) and exposure at default (EAD). Research to date has tended to model these parameters separately. Moreover, a neglected area in the field of LGD/EAD modelling is the application of ensemble learning, which by benefitting from diverse base learners reduces the over-fitting problem and enables modelling diverse risk profiles of defaulted loans. To overcome these problems, this paper proposes a two-stage credit risk model that integrates (1) class-imbalanced ensemble learning for predicting PD (credit scoring), and (2) an EAD prediction using a regression ensemble. Furthermore, multi-objective evolutionary feature selection is used to minimize both the misclassification cost (root mean squared error) of the PD and EAD models and the number of attributes necessary for modelling. For this task, we propose a misclassification cost metric suitable for consumer loans with fixed exposure because it combines opportunity cost and LGD. We show that the proposed credit risk model is not only more effective than single-stage credit risk models but also outperforms state-of-the-art methods used to model credit risk in terms of prediction and economic performance.

论文关键词:Credit risk,Ensemble learning,Credit scoring,Expected loss,Exposure at default

论文评审过程:Received 9 October 2018, Revised 24 December 2018, Accepted 7 January 2019, Available online 9 January 2019, Version of Record 12 January 2019.

论文官网地址:https://doi.org/10.1016/j.dss.2019.01.002