Feature Enhanced Ensemble Modeling With Voting Optimization for Credit Risk Assessment

被引:3
|
作者
Yang, Dongqi [1 ]
Xiao, Binqing [1 ]
机构
[1] Nanjing Univ, Sch Management & Engn, Nanjing 210008, Peoples R China
来源
IEEE ACCESS | 2024年 / 12卷
基金
中国国家自然科学基金;
关键词
Risk management; Predictive models; Data models; Adaptation models; Accuracy; Training; Soft sensors; Credit risk; ensemble modeling; feature enhancement; model interpretability; voting optimization; PERFORMANCE; PREDICTION;
D O I
10.1109/ACCESS.2024.3445499
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Machine learning methods have gained widespread utilization in small and micro enterprise credit risk assessment. However, the practical application of these methods encounters a conundrum involving accuracy and interpretability. In this study, a multi-stage ensemble model is proposed to enhance the model's interpretability. To strengthen predictive portraits, a multi-feature enhancement method is proposed to integrate non-financial behavioral information and soft information on credit rating into the annual loan ledger data, thereby bolstering the explanatory capacity of the features. To rectify the issue of data imbalance and avoid information loss, a new bagging-based oversampling method is proposed to oversample the minority class samples in multiple parallelized subsets divided by the bagging strategy. To unleash the performance potential of base classifiers, a new voting-weight optimization method is proposed to optimize the soft voting weights of the candidate base classifiers. The experiment results of an annual loan ledger dataset of a commercial bank in China (with an accuracy of 97.9%, an area under the curve of 0.97, a logistic loss of 0.07, a Brier score of 0.01, and a Kolmogorov-Smirnov statistic of 0.38) and the other five public datasets indicating excellent model fit. By focusing on the widespread soft information and data structures characteristic of SME loan risk assessment data, an additional SHAP model explanation method enhances interpretability. This method reveals that the enhanced 'debt-to-income ratio,' along with non-financial behavioral information and features derived from soft information, are essential for predicting loan defaults. Such enhancements help to alleviate the issue of information asymmetry in SME loan risk assessment.
引用
收藏
页码:115124 / 115136
页数:13
相关论文
共 50 条
  • [1] A new hybrid credit scoring ensemble model with feature enhancement and soft voting weight optimization
    Yang, Dongqi
    Xiao, Binqing
    Cao, Mengya
    Shen, Huaqi
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 238
  • [2] A Sliding Window Based Meta-Majority of Voting Ensemble for Credit Risk Assessment
    Pradhan, Lopamudra
    Wang, Gi-Nam
    Dehuri, Satchidananda
    2013 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2013, : 2090 - 2094
  • [3] A multi-level classification based ensemble and feature extractor for credit risk assessment
    Wang, Yuanyuan
    Wu, Zhuang
    Gao, Jing
    Liu, Chenjun
    Guo, Fangfang
    PEERJ COMPUTER SCIENCE, 2024, 10
  • [4] Enterprise Credit Risk Assessment Using Feature Selection Approach and Ensemble Learning Technique
    Wang, Di
    Zhang, Zuoquan
    2020 16TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY (CIS 2020), 2020, : 228 - 233
  • [5] Feature selection in credit risk modeling: an international evidence
    Zhou, Ying
    Uddin, Mohammad Shamsu
    Habib, Tabassum
    Chi, Guotai
    Yuan, Kunpeng
    ECONOMIC RESEARCH-EKONOMSKA ISTRAZIVANJA, 2021, 34 (01): : 3064 - 3091
  • [6] A hybrid model with novel feature selection method and enhanced voting method for credit scoring
    Yao, Jianrong
    Wang, Zhongyi
    Wang, Lu
    Zhang, Zhebin
    Jiang, Hui
    Yan, Surong
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2022, 42 (03) : 2565 - 2579
  • [7] ANALYSIS OF FEATURE SELECTION TECHNIQUES IN CREDIT RISK ASSESSMENT
    Ramya, R. S.
    Kumaresan, S.
    ICACCS 2015 PROCEEDINGS OF THE 2ND INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING & COMMUNICATION SYSTEMS, 2015,
  • [8] Personal Credit Risk Assessment Based on Stacking Ensemble Model
    Wang, Maoguang
    Yu, Jiayu
    Ji, Zijian
    INTELLIGENT INFORMATION PROCESSING IX, 2018, 538 : 328 - 333
  • [9] Credit Risk Assessment Using Learning Algorithms for Feature Selection
    Hassani, Zeinab
    Meybodi, Mohsen Alambardar
    Hajihashemi, Vahid
    FUZZY INFORMATION AND ENGINEERING, 2020, 12 (04) : 529 - 544
  • [10] An ensemble wrapper feature selectionfor credit scoring
    Bouaguel, Waad
    Limam, Mohamed
    Advances in Intelligent Systems and Computing, 2015, 335 : 619 - 631