Cost-aware Credit-scoring Framework Based on Resampling and Feature Selection

被引:0
|
作者
Mou, Yunhan [1 ]
Pu, Zihao [2 ]
Feng, Duanyu [3 ]
Luo, Yingting [3 ]
Lai, Yanzhao [4 ]
Huang, Jimin [5 ]
Tian, Youjing [6 ]
Xiao, Fang [6 ]
机构
[1] Yale Sch Publ Hlth, Dept Biostat, New Haven, CT USA
[2] Univ Hong Kong, Dept Stat & Actuarial Sci, Hong Kong, Peoples R China
[3] Sichuan Univ, Coll Math, Chengdu, Peoples R China
[4] Southwest Jiaotong Univ, Sch Econ & Management, Chengdu, Peoples R China
[5] Chancefocus Asset Management Shanghai Co, Shanghai, Peoples R China
[6] Sichuan Jinding Fortune Informat Technol Co Ltd, Chengdu, Peoples R China
关键词
Credit scoring; Pre-learning resampling; Financial indicators; Feature selection; CLASSIFICATION ALGORITHMS; RISK-ASSESSMENT; MODEL; MACHINE;
D O I
10.1007/s10614-024-10808-w
中图分类号
F [经济];
学科分类号
02 ;
摘要
Credit loans are fundamental to the financial industry, and effectively managing their risks is essential. Financial companies may face two challenges when performing credit scoring to control such risks. First, datasets are often imbalanced with far more non-default cases than default ones, where oversampling methods are usually applied. Few methods, however, have considered further enhancing the quality of a training dataset by addressing the critical samples that may confuse the final classifiers while maintaining the interpretability of the final model. Second, common model evaluation indicators may not accurately reflect the financial loss associated with incorrect predictions or the costs involved in collecting features. To address these challenges, we propose Cost AwarE CRediT ScorIng Framework Based on ResamplIng and FeaturESelection (CERTIFIES). In this framework, we develop a pre-learning resampling approach that employs multiple machine learning methods as assistant classifiers to detect critical data samples after oversampling. This approach further enhances the overall performance of the chief classifier, logistic regression, without compromising its interpretability. Additionally, during the model evaluation step, we design a cost-aware evaluation indicator that accounts for the actual loss due to incorrect predictions and the cost of collecting various features. This provides an approach to perform feature selection based on financial costs. To demonstrate the effectiveness of the proposed method, we apply it to our credit scoring dataset collected by local financial companies, as well as to two public datasets.
引用
收藏
页数:26
相关论文
共 50 条
  • [1] Sample selection in credit-scoring models
    Greene, W
    JAPAN AND THE WORLD ECONOMY, 1998, 10 (03) : 299 - 316
  • [2] Cost-Aware Feature Selection for IoT Device Classification
    Chakraborty, Biswadeep
    Divakaran, Dinil Mon
    Nevat, Ido
    Peters, Gareth W.
    Gurusamy, Mohan
    IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (14) : 11052 - 11064
  • [3] A Cost-Aware Logical Framework
    Niu, Yue
    Sterling, Jonathan
    Grodin, Harrison
    Harper, Robert
    PROCEEDINGS OF THE ACM ON PROGRAMMING LANGUAGES-PACMPL, 2022, 6 (POPL):
  • [4] Feature selection based on SVM for credit scoring
    Yao, Ping
    PROCEEDINGS OF THE 2009 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND NATURAL COMPUTING, VOL II, 2009, : 44 - 47
  • [5] Cost-based feature selection for Support Vector Machines: An application in credit scoring
    Maldonado, Sebastian
    Perez, Juan
    Bravo, Cristian
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2017, 261 (02) : 656 - 665
  • [6] Integrated framework for profit-based feature selection and SVM classification in credit scoring
    Maldonado, Sebastian
    Bravo, Cristian
    Lopez, Julio
    Perez, Juan
    DECISION SUPPORT SYSTEMS, 2017, 104 : 113 - 121
  • [7] Feature Selection in a Credit Scoring Model
    Laborda, Juan
    Ryoo, Seyong
    MATHEMATICS, 2021, 9 (07)
  • [8] Quantum Optimized Cost Based Feature Selection and Credit Scoring for Mobile Micro-financing
    Chen, Chi Ming
    Tso, Geoffrey Kwok Fai
    He, Kaijian
    COMPUTATIONAL ECONOMICS, 2024, 63 (02) : 919 - 950
  • [9] Quantum Optimized Cost Based Feature Selection and Credit Scoring for Mobile Micro-financing
    Chi Ming Chen
    Geoffrey Kwok Fai Tso
    Kaijian He
    Computational Economics, 2024, 63 : 919 - 950
  • [10] An uncertainty-oriented cost-sensitive credit scoring framework with multi-objective feature selection
    Wu, Yiqiong
    Huang, Wei
    Tian, Yingjie
    Zhu, Qing
    Yu, Lean
    ELECTRONIC COMMERCE RESEARCH AND APPLICATIONS, 2022, 53