Feature Enhanced Ensemble Modeling With Voting Optimization for Credit Risk Assessment

被引:3
|
作者
Yang, Dongqi [1 ]
Xiao, Binqing [1 ]
机构
[1] Nanjing Univ, Sch Management & Engn, Nanjing 210008, Peoples R China
来源
IEEE ACCESS | 2024年 / 12卷
基金
中国国家自然科学基金;
关键词
Risk management; Predictive models; Data models; Adaptation models; Accuracy; Training; Soft sensors; Credit risk; ensemble modeling; feature enhancement; model interpretability; voting optimization; PERFORMANCE; PREDICTION;
D O I
10.1109/ACCESS.2024.3445499
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Machine learning methods have gained widespread utilization in small and micro enterprise credit risk assessment. However, the practical application of these methods encounters a conundrum involving accuracy and interpretability. In this study, a multi-stage ensemble model is proposed to enhance the model's interpretability. To strengthen predictive portraits, a multi-feature enhancement method is proposed to integrate non-financial behavioral information and soft information on credit rating into the annual loan ledger data, thereby bolstering the explanatory capacity of the features. To rectify the issue of data imbalance and avoid information loss, a new bagging-based oversampling method is proposed to oversample the minority class samples in multiple parallelized subsets divided by the bagging strategy. To unleash the performance potential of base classifiers, a new voting-weight optimization method is proposed to optimize the soft voting weights of the candidate base classifiers. The experiment results of an annual loan ledger dataset of a commercial bank in China (with an accuracy of 97.9%, an area under the curve of 0.97, a logistic loss of 0.07, a Brier score of 0.01, and a Kolmogorov-Smirnov statistic of 0.38) and the other five public datasets indicating excellent model fit. By focusing on the widespread soft information and data structures characteristic of SME loan risk assessment data, an additional SHAP model explanation method enhances interpretability. This method reveals that the enhanced 'debt-to-income ratio,' along with non-financial behavioral information and features derived from soft information, are essential for predicting loan defaults. Such enhancements help to alleviate the issue of information asymmetry in SME loan risk assessment.
引用
收藏
页码:115124 / 115136
页数:13
相关论文
共 50 条
  • [31] Ensemble Voting for Enhanced Robustness in DarkNet Traffic Detection
    Shinde, Varun
    Singhal, Kartik
    Almogren, Ahmad
    Dhanawat, Vineet
    Karande, Vishal
    Rehman, Ateeq Ur
    IEEE ACCESS, 2024, 12 : 177064 - 177079
  • [32] Modeling Cloud Computing Risk Assessment Using Ensemble Methods
    Ahmed, Nada
    Abraham, Ajith
    PATTERN ANALYSIS, INTELLIGENT SECURITY AND THE INTERNET OF THINGS, 2015, 355 : 261 - 274
  • [33] MSEs Credit Risk Assessment Model Based on Federated Learning and Feature Selection
    Xu, Zhanyang
    Cheng, Jianchun
    Cheng, Luofei
    Xu, Xiaolong
    Bilal, Muhammad
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 75 (03): : 5573 - 5595
  • [34] Genetic algorithm-based heuristic for feature selection in credit risk assessment
    Oreski, Stjepan
    Oreski, Goran
    EXPERT SYSTEMS WITH APPLICATIONS, 2014, 41 (04) : 2052 - 2064
  • [35] A comparative assessment of ensemble learning for credit scoring
    Wang, Gang
    Hao, Jinxing
    Ma, Jian
    Jiang, Hongbing
    EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (01) : 223 - 230
  • [36] Stacking ensemble method for personal credit risk assessment in Peer-to-Peer lending
    Yin, Wei
    Kirkulak-Uludag, Berna
    Zhu, Dongmei
    Zhou, Zixuan
    APPLIED SOFT COMPUTING, 2023, 142
  • [37] A hybrid ensemble approach for enterprise credit risk assessment based on Support Vector Machine
    Wang, Gang
    Ma, Jian
    EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (05) : 5325 - 5331
  • [38] Credit risk assessment
    Gustafson, Cole R.
    Pederson, Glenn D.
    Gloy, Brent A.
    AGRICULTURAL FINANCE REVIEW, 2005, 65 (02) : 201 - +
  • [39] Credit Risk Assessment
    Pokorna, Martina
    Sponer, Miroslav
    EUROPEAN FINANCIAL SYSTEMS 2015: PROCEEDINGS OF THE 12TH INTERNATIONAL SCIENTIFIC CONFERENCE, 2015, : 455 - 461
  • [40] Enhancing credit risk prediction based on ensemble tree-based feature transformation and logistic regression
    Liu, Jiaming
    Liu, Jiajia
    Wu, Chong
    Wang, Shouyang
    JOURNAL OF FORECASTING, 2024, 43 (02) : 429 - 455