Cost-sensitive Global Model Trees applied to loan charge-off forecasting
作者:
Highlights:
• Cost-sensitive global model trees are proposed for asymmetric regression problems.
• The CGMT system considers the cost of errors during the evolutionary learning phase.
• Model trees induced by CGMT are simple and can have direct applicability.
摘要
Regression learning methods in real world applications often require cost minimization instead of the reduction of various metrics of prediction errors. Currently in the literature, there is a lack of white box solutions that can deal with forecasting problems where under-prediction and over-prediction errors have different consequences. To fill this gap, we introduced the Cost-sensitive Global Model Tree (CGMT), which applies a fitness function that minimizes an average misprediction cost. Proposed specialized genetic operators improve searching for optimal tree structure and cost-sensitive linear regression models in the leaves. Experimental validation is performed on loan charge-off data. It is known to be a difficult forecasting problem for banks due to the asymmetric cost structure. Obtained results show that specialized evolutionary algorithm applied to model tree induction finds significantly more accurate predictions than tested competitors. Decisions generated by the CGMT are simple, easy to interpret, and can be applied directly.
论文关键词:Cost-sensitive regression,Model trees,Evolutionary algorithms,Asymmetric costs,Loan charge-off forecasting
论文评审过程:Received 17 April 2014, Revised 10 February 2015, Accepted 30 March 2015, Available online 9 April 2015.
论文官网地址:https://doi.org/10.1016/j.dss.2015.03.009