Deep generative models for reject inference in credit scoring

被引:31
|
作者
Mancisidor, Rogelio A. [1 ,2 ,4 ]
Kampffmeyer, Michael [1 ,4 ]
Aas, Kjersti [3 ]
Jenssen, Robert [1 ,4 ]
机构
[1] UiT Art Univ Norway, Fac Sci & Technol, Dept Phys & Technol, Hansine Hansen Veg 18, N-9037 Tromso, Norway
[2] Santander Consumer Bank AS, Credit Risk Models, Strandveien 18, N-1325 Lysaker, Norway
[3] Norwegian Comp Ctr, Stat Anal Machine Learning & Image Anal, Gaustadalleen 23a, N-0373 Oslo, Norway
[4] UiT Machine Learning Grp, Tromso, Norway
关键词
Reject inference; Deep generative models; Credit scoring; Semi-supervised learning; SAMPLE SELECTION BIAS; AUGMENTATION; PERFORMANCE;
D O I
10.1016/j.knosys.2020.105758
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Credit scoring models based on accepted applications may be biased and their consequences can have a statistical and economic impact. Reject inference is the process of attempting to infer the creditworthiness status of the rejected applications. Inspired by the promising results of semi-supervised deep generative models, this research develops two novel Bayesian models for reject inference in credit scoring combining Gaussian mixtures and auxiliary variables in a semi-supervised framework with generative models. To the best of our knowledge this is the first study coupling these concepts together. The goal is to improve the classification accuracy in credit scoring models by adding reject applications. Further, our proposed models infer the unknown creditworthiness of the rejected applications by exact enumeration of the two possible outcomes of the loan (default or non-default). The efficient stochastic gradient optimization technique used in deep generative models makes our models suitable for large data sets. Finally, the experiments in this research show that our proposed models perform better than classical and alternative machine learning models for reject inference in credit scoring, and that model performance increases with the amount of data used for model training. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页数:17
相关论文
共 50 条
  • [31] Credit scoring, augmentation and lean models
    Banasik, J
    Crook, J
    JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY, 2005, 56 (09) : 1072 - 1081
  • [32] A comparison study of credit scoring models
    Zhang, Defu
    Huang, Hongyi
    Chen, Qingshan
    Jiang, Yi
    ICNC 2007: THIRD INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, VOL 1, PROCEEDINGS, 2007, : 15 - +
  • [33] Neural network credit scoring models
    West, D
    COMPUTERS & OPERATIONS RESEARCH, 2000, 27 (11-12) : 1131 - 1152
  • [34] An application of hybrid models in credit scoring
    Bonilla, M
    Olmeda, I
    Puertas, R
    FINANCIAL MODELLING, 2000, : 69 - 78
  • [35] Generative adversarial fusion network for class imbalance credit scoring
    Kai Lei
    Yuexiang Xie
    Shangru Zhong
    Jingchao Dai
    Min Yang
    Ying Shen
    Neural Computing and Applications, 2020, 32 : 8451 - 8462
  • [36] Generative adversarial fusion network for class imbalance credit scoring
    Lei, Kai
    Xie, Yuexiang
    Zhong, Shangru
    Dai, Jingchao
    Yang, Min
    Shen, Ying
    NEURAL COMPUTING & APPLICATIONS, 2020, 32 (12): : 8451 - 8462
  • [37] Inference and Learning for Generative Capsule Models
    Nazabal, Alfredo
    Tsagkas, Nikolaos
    Williams, Christopher K. I.
    NEURAL COMPUTATION, 2023, 35 (04) : 727 - 761
  • [38] Bayesian Inference for Misspecified Generative Models
    Nott, David J.
    Drovandi, Christopher
    Frazier, David T.
    ANNUAL REVIEW OF STATISTICS AND ITS APPLICATION, 2024, 11 : 179 - 202
  • [39] Validating risk models with a focus on credit scoring models
    Dryver, Arthur L.
    Sukkasem, Jantra
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2009, 79 (02) : 181 - 193
  • [40] Three-stage reject inference learning framework for credit scoring using unsupervised transfer learning and three-way decision theory
    Shen, Feng
    Zhao, Xingchao
    Kou, Gang
    DECISION SUPPORT SYSTEMS, 2020, 137 (137)