A novel tree-based dynamic heterogeneous ensemble method for credit scoring

被引:66
|
作者
Xia, Yufei [1 ]
Zhao, Junhao [2 ]
He, Lingyun [3 ]
Li, Yinguo [1 ]
Niu, Mengyi [4 ]
机构
[1] Jiangsu Normal Univ, Business Sch, Xuzhou 221116, Jiangsu, Peoples R China
[2] Jiangsu Normal Univ, Sino Russian Inst, Xuzhou 221116, Jiangsu, Peoples R China
[3] China Univ Min & Technol, Sch Econ & Management, Xuzhou 221116, Jiangsu, Peoples R China
[4] Jiangsu Normal Univ, Law Sch, Xuzhou 221116, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
Credit scoring; Selective ensemble; Random forests; Gradient boosting decision tree; Machine learning; NEURAL-NETWORK ENSEMBLE; ART CLASSIFICATION ALGORITHMS; RISK-ASSESSMENT; BANKRUPTCY PREDICTION; GENETIC ALGORITHM; REJECT INFERENCE; MODEL; CLASSIFIERS; SELECTION; DIVERSITY;
D O I
10.1016/j.eswa.2020.113615
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Ensemble models have been extensively applied to credit scoring. However, advanced tree-based classifiers have been seldom utilized as components of ensemble models. Moreover, few studies have considered dynamic ensemble selection. To fill the research gap, this paper aims to develop a novel tree-based overfitting-cautious heterogeneous ensemble model (i.e., OCHE) for credit scoring which departs from existing literature on base models and ensemble selection strategy. Regarding base models, tree-based techniques are employed to acquire a balance between predictive accuracy and computational cost. In terms of ensemble selection, the proposed method can assign weights to base models dynamically according to the overfitting measure. Validated on five public datasets, the proposed approach is compared with several popular benchmark models and selection strategies on predictive accuracy and computational cost measures. For predictive accuracy, the proposed approach outperforms the benchmark models significantly in most cases based on the non-parametric significance test. It also performs marginally better than several state-of-the-art studies. Our proposal remains robust in several scenarios. In terms of computational cost, the proposed method provides acceptable performance and benefits from GPU acceleration considerably. (C) 2020 Elsevier Ltd. All rights reserved.
引用
收藏
页数:18
相关论文
共 50 条
  • [41] Tree-Based Ensemble Methods: Predicting Asphalt Mixture Dynamic Modulus for Flexible Pavement Design
    Worthey, Hampton
    Yang, Jidong J.
    Kim, S. Sonny
    KSCE JOURNAL OF CIVIL ENGINEERING, 2021, 25 (11) : 4231 - 4239
  • [42] A novel communication-efficient heterogeneous federated positive and unlabeled learning method for credit scoring
    Qiu, Yongqin
    Chen, Yuanxing
    Fang, Kan
    Fang, Kuangnan
    COMPUTERS & OPERATIONS RESEARCH, 2025, 177
  • [43] A tree-based conservation scoring method for short linear motifs in multiple alignments of protein sequences
    Claudia Chica
    Alberto Labarga
    Cathryn M Gould
    Rodrigo López
    Toby J Gibson
    BMC Bioinformatics, 9
  • [44] A tree-based conservation scoring method for short linear motifs in multiple alignments of protein sequences
    Chica, Claudia
    Labarga, Alberto
    Gould, Cathryn M.
    Lopez, Rodrigo
    Gibson, Toby J.
    BMC BIOINFORMATICS, 2008, 9 (1)
  • [45] Using novel noise filtering techniques to ensemble classifiers for credit scoring
    Ngoc Anh Nguyen-Thi
    Quynh Pham-Nhu
    Son Luong-Ngoc
    Nam Vu-Thanh
    2019 INTERNATIONAL CONFERENCE ON TECHNOLOGIES AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE (TAAI), 2019,
  • [46] A novel deep ensemble model for imbalanced credit scoring in internet finance
    Xiao, Jin
    Zhong, Yu
    Jia, Yanlin
    Wang, Yadong
    Li, Ruoyi
    Jiang, Xiaoyi
    Wang, Shouyang
    INTERNATIONAL JOURNAL OF FORECASTING, 2024, 40 (01) : 348 - 372
  • [47] Flash-flood susceptibility mapping: a novel credal decision tree-based ensemble approaches
    Dingying Yang
    Ting Zhang
    Alireza Arabameri
    M. Santosh
    Ujwal Deep Saha
    Aznarul Islam
    Earth Science Informatics, 2023, 16 (4) : 3143 - 3161
  • [48] Flash-flood susceptibility mapping: a novel credal decision tree-based ensemble approaches
    Yang, Dingying
    Zhang, Ting
    Arabameri, Alireza
    Santosh, M.
    Saha, Ujwal Deep
    Islam, Aznarul
    EARTH SCIENCE INFORMATICS, 2023, 16 (04) : 3143 - 3161
  • [49] INTELLIGENT TREE-BASED ENSEMBLE APPROACHES FOR PHISHING WEBSITE DETECTION
    Alsariera, Yazan A.
    Balogun, Abdullateef O.
    Adeyemo, Victor E.
    Tarawneh, Omar H.
    Mojeed, Hammed A.
    JOURNAL OF ENGINEERING SCIENCE AND TECHNOLOGY, 2022, 17 (01): : 563 - 582
  • [50] Tree-Based Ensemble Learning Techniques in the Analysis of Parkinsonian Syndromes
    Gorriz, J. M.
    Ramirez, J.
    Moreno-Caballero, M.
    Martinez-Murcia, F. J.
    Ortiz, A.
    Illan, I. A.
    Segovia, F.
    Salas-Gonzalez, D.
    Gomez-Rio, M.
    MEDICAL IMAGE UNDERSTANDING AND ANALYSIS (MIUA 2017), 2017, 723 : 459 - 469