Deep generative models for reject inference in credit scoring

被引:31
|
作者
Mancisidor, Rogelio A. [1 ,2 ,4 ]
Kampffmeyer, Michael [1 ,4 ]
Aas, Kjersti [3 ]
Jenssen, Robert [1 ,4 ]
机构
[1] UiT Art Univ Norway, Fac Sci & Technol, Dept Phys & Technol, Hansine Hansen Veg 18, N-9037 Tromso, Norway
[2] Santander Consumer Bank AS, Credit Risk Models, Strandveien 18, N-1325 Lysaker, Norway
[3] Norwegian Comp Ctr, Stat Anal Machine Learning & Image Anal, Gaustadalleen 23a, N-0373 Oslo, Norway
[4] UiT Machine Learning Grp, Tromso, Norway
关键词
Reject inference; Deep generative models; Credit scoring; Semi-supervised learning; SAMPLE SELECTION BIAS; AUGMENTATION; PERFORMANCE;
D O I
10.1016/j.knosys.2020.105758
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Credit scoring models based on accepted applications may be biased and their consequences can have a statistical and economic impact. Reject inference is the process of attempting to infer the creditworthiness status of the rejected applications. Inspired by the promising results of semi-supervised deep generative models, this research develops two novel Bayesian models for reject inference in credit scoring combining Gaussian mixtures and auxiliary variables in a semi-supervised framework with generative models. To the best of our knowledge this is the first study coupling these concepts together. The goal is to improve the classification accuracy in credit scoring models by adding reject applications. Further, our proposed models infer the unknown creditworthiness of the rejected applications by exact enumeration of the two possible outcomes of the loan (default or non-default). The efficient stochastic gradient optimization technique used in deep generative models makes our models suitable for large data sets. Finally, the experiments in this research show that our proposed models perform better than classical and alternative machine learning models for reject inference in credit scoring, and that model performance increases with the amount of data used for model training. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页数:17
相关论文
共 50 条
  • [21] Subagging for credit scoring models
    Paleologo, Giuseppe
    Elisseeff, Andre
    Antonini, Gianluca
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2010, 201 (02) : 490 - 499
  • [22] A new approach for reject inference in credit scoring using kernel-free fuzzy quadratic surface support vector machines
    Tian, Ye
    Yong, Ziyang
    Luo, Jian
    APPLIED SOFT COMPUTING, 2018, 73 : 96 - 105
  • [23] A graph-based semi-supervised reject inference framework considering imbalanced data distribution for consumer credit scoring
    Kang, Yanzhe
    Jia, Ning
    Cui, Runbang
    Deng, Jiang
    APPLIED SOFT COMPUTING, 2021, 105
  • [24] Reject inference in credit scoring using a three-way decision and safe semi-supervised support vector machine
    Shen, Feng
    Yang, Zhiyuan
    Zhao, Xingchao
    Lan, Dao
    INFORMATION SCIENCES, 2022, 606 : 614 - 627
  • [25] Multi-view reject inference for semi-supervised credit scoring with consistency training and three-way decision
    Tang, Haoxin
    Liang, Decui
    OMEGA-INTERNATIONAL JOURNAL OF MANAGEMENT SCIENCE, 2025, 133
  • [26] FROM CREDIT SCORING TO REGULATORY SCORING: COMPARING CREDIT SCORING MODELS FROM A REGULATORY PERSPECTIVE
    Xia, Yufei
    Liao, Zijun
    Xu, Jun
    LI, Yinguo
    TECHNOLOGICAL AND ECONOMIC DEVELOPMENT OF ECONOMY, 2022, 28 (06) : 1954 - 1990
  • [27] Technology scoring model considering rejected applicants and effect of reject inference
    Kim, Y.
    Sohn, S. Y.
    JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 2007, 58 (10) : 1341 - 1347
  • [28] Internet Financial Credit Scoring Models Based on Deep Forest and Resampling Methods
    Zhong, Yu
    Wang, Huiling
    IEEE ACCESS, 2023, 11 : 8689 - 8700
  • [29] Credit scoring using machine learning and deep Learning-Based models
    Mestiri, Sami
    DATA SCIENCE IN FINANCE AND ECONOMICS, 2024, 4 (02): : 236 - 248
  • [30] Fuzzy Type 2 Inference System for Credit Scoring
    Uyar, Kaan
    Ilhan, Umit
    PROCEEDINGS OF THE 11TH WSEAS INTERNATIONAL CONFERENCE ON AUTOMATIC CONTROL, MODELLING AND SIMULATION: CONTROLLING, MODELLING AND SIMULATION, 2009, : 518 - +