Tree-based heterogeneous cascade ensemble model for credit scoring

被引:12
|
作者
Liu, Wanan [1 ]
Fan, Hong [1 ]
Xia, Meng [2 ]
机构
[1] Donghua Univ, Glorious Sun Sch Business & Management, Shanghai 200051, Peoples R China
[2] Donghua Univ, Coll Informat Sci & Technol, Shanghai, Peoples R China
基金
上海市自然科学基金; 中国国家自然科学基金;
关键词
Credit scoring; Ensemble algorithm; Heterogeneous deep forest; Weighted voting mechanism; Interpretability; ART CLASSIFICATION ALGORITHMS; BANKRUPTCY PREDICTION; FEATURE-SELECTION; IMPACT; PERFORMANCE; MACHINES;
D O I
10.1016/j.ijforecast.2022.07.007
中图分类号
F [经济];
学科分类号
02 ;
摘要
Credit scoring is an important tool to guard against commercial risks for banks and lending companies and provides good conditions for the construction of individual personal credit. Ensemble algorithms have shown appealing progress for the improvement of credit scoring. In this study, to meet the challenge of large-scale credit scoring, we propose a heterogeneous deep forest model (Heter-DF), which is established based on considerations ranging from base learner selection, encouragement of the diversity of base learners, and ensemble strategies, for credit scoring. Heter-DF is designed as a scalable cascading framework that can increase its complexity with the scale of the credit dataset. Moreover, each level of Heter-DF is built by multiple heterogeneous tree-based ensembled base learners, avoiding the homogeneous prediction of the ensemble framework. In addition, a weighted voting mechanism is introduced to highlight important information and suppress irrelevant features, making Heter-DF a robust model for credit scoring. Experimental results on four credit scoring datasets and six evaluation metrics show that the cascading framework a good choice for the ensemble of tree-based base learners. A comparison among homogeneous ensembles and heterogeneous ensembles further demonstrates the effectiveness of Heter-DF. Experiments on different training sets indicate that Heter-DF is a scalable framework which not only deals with large-scale credit scoring but also satisfies the condition where small-scale credit scoring is desirable. Finally, based on the good interpretability of a tree-based structure, the global interpretation of Heter-DF is preliminarily explored. (c) 2022 International Institute of Forecasters. Published by Elsevier B.V. All rights reserved.
引用
收藏
页码:1593 / 1614
页数:22
相关论文
共 50 条
  • [1] A novel tree-based dynamic heterogeneous ensemble method for credit scoring
    Xia, Yufei
    Zhao, Junhao
    He, Lingyun
    Li, Yinguo
    Niu, Mengyi
    EXPERT SYSTEMS WITH APPLICATIONS, 2020, 159
  • [2] A novel heterogeneous ensemble credit scoring model based on bstacking approach
    Xia, Yufei
    Liu, Chuanzhe
    Da, Bowen
    Xie, Fangming
    EXPERT SYSTEMS WITH APPLICATIONS, 2018, 93 : 182 - 199
  • [3] An ensemble credit scoring model based on logistic regression with heterogeneous balancing and weighting effects
    Runchi, Zhang
    Liguo, Xue
    Qin, Wang
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 212
  • [4] An interpretable decision tree ensemble model for imbalanced credit scoring datasets
    My, Bui T. T.
    Ta, Bao Q.
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 45 (06) : 10853 - 10864
  • [5] A heterogeneous ensemble credit scoring model based on adaptive classifier selection: An application on imbalanced data
    Zhang, Tong
    Chi, Guotai
    INTERNATIONAL JOURNAL OF FINANCE & ECONOMICS, 2021, 26 (03) : 4372 - 4385
  • [6] A New Dynamic Credit Scoring Model Based on Clustering Ensemble
    Gao Wei
    Cheng Mingshu
    2013 3RD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT), 2013, : 421 - 425
  • [7] Enhancing credit risk prediction based on ensemble tree-based feature transformation and logistic regression
    Liu, Jiaming
    Liu, Jiajia
    Wu, Chong
    Wang, Shouyang
    JOURNAL OF FORECASTING, 2024, 43 (02) : 429 - 455
  • [8] A systematic credit scoring model based on heterogeneous classifier ensembles
    Ala'raj, Maher
    Abbod, Maysam
    2015 INTERNATIONAL SYMPOSIUM ON INNOVATIONS IN INTELLIGENT SYSTEMS AND APPLICATIONS (INISTA) PROCEEDINGS, 2015, : 119 - 125
  • [9] Decision tree-based technology credit scoring for start-up firms: Korean case
    Sohn, So Young
    Kim, Ji Won
    EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (04) : 4007 - 4012
  • [10] A novel method for credit scoring based on feature transformation and ensemble model
    Li, Hongxiang
    Feng, Ao
    Lin, Bin
    Su, Houcheng
    Liu, Zixi
    Duan, Xuliang
    Pu, Haibo
    Wang, Yifei
    PEERJ COMPUTER SCIENCE, 2021,