Interpretable high-stakes decision support system for credit default forecasting

被引:4
|
作者
Sun, Weixin [1 ]
Zhang, Xuantao [2 ]
Li, Minghao [1 ]
Wang, Yong [1 ,3 ]
机构
[1] Dongbei Univ Finance & Econ, Sch Stat, Dalian 116023, Peoples R China
[2] Dongbei Univ Finance & Econ, Sch Management Sci & Engn, Dalian 116023, Peoples R China
[3] Dalian Municipal Res Inst Publ Hlth, Dalian 116037, Peoples R China
基金
中国国家自然科学基金;
关键词
High-stakes decision forecasting; Credit default forecasting; Interpretable machine learning; Imbalanced datasets; Resampling methods; PREDICTION;
D O I
10.1016/j.techfore.2023.122825
中图分类号
F [经济];
学科分类号
02 ;
摘要
Methods for forecasting credit default have long been research focus for financial institutions. In this study, we propose an interpretable high-stakes decision support system for credit default forecasting called CDFS. Because of the high-stake nature of credit default prediction, the proposed CDFS adheres to the principle of "people in the loop." The proposed CDFS comprises six modules: data processing, feature selection, data balancing, forecasting, evaluation, and interpretation. A feature selection method (permutation importance method), nine resampling methods, and six high-performance forecasting methods were employed in the proposed CDFS. The China Taiwan credit card default dataset and South Germany credit dataset were used to test the interpretability and predictive performance of the proposed CDFS. Experiments showed that the feature selection and data balancing modules of the CDFS effectively improve the prediction performance. A comparison with traditional logistic regression models demonstrated that the CDFS can provide decision-makers with satisfactory explanations for prediction results. In summary, the CDFS proposed in this study exhibited excellent predictive performance and satisfactory interpretability. This study contributes to improving the accuracy of credit default forecasting and reducing credit risk in financial institutions.
引用
收藏
页数:15
相关论文
共 50 条