Deep generative models for reject inference in credit scoring

被引:31
|
作者
Mancisidor, Rogelio A. [1 ,2 ,4 ]
Kampffmeyer, Michael [1 ,4 ]
Aas, Kjersti [3 ]
Jenssen, Robert [1 ,4 ]
机构
[1] UiT Art Univ Norway, Fac Sci & Technol, Dept Phys & Technol, Hansine Hansen Veg 18, N-9037 Tromso, Norway
[2] Santander Consumer Bank AS, Credit Risk Models, Strandveien 18, N-1325 Lysaker, Norway
[3] Norwegian Comp Ctr, Stat Anal Machine Learning & Image Anal, Gaustadalleen 23a, N-0373 Oslo, Norway
[4] UiT Machine Learning Grp, Tromso, Norway
关键词
Reject inference; Deep generative models; Credit scoring; Semi-supervised learning; SAMPLE SELECTION BIAS; AUGMENTATION; PERFORMANCE;
D O I
10.1016/j.knosys.2020.105758
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Credit scoring models based on accepted applications may be biased and their consequences can have a statistical and economic impact. Reject inference is the process of attempting to infer the creditworthiness status of the rejected applications. Inspired by the promising results of semi-supervised deep generative models, this research develops two novel Bayesian models for reject inference in credit scoring combining Gaussian mixtures and auxiliary variables in a semi-supervised framework with generative models. To the best of our knowledge this is the first study coupling these concepts together. The goal is to improve the classification accuracy in credit scoring models by adding reject applications. Further, our proposed models infer the unknown creditworthiness of the rejected applications by exact enumeration of the two possible outcomes of the loan (default or non-default). The efficient stochastic gradient optimization technique used in deep generative models makes our models suitable for large data sets. Finally, the experiments in this research show that our proposed models perform better than classical and alternative machine learning models for reject inference in credit scoring, and that model performance increases with the amount of data used for model training. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页数:17
相关论文
共 50 条
  • [41] A deep learning approach for credit scoring using credit default swaps
    Luo, Cuicui
    Wu, Desheng
    Wu, Dexiang
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2017, 65 : 465 - 470
  • [42] A Deep Learning Approach to Credit Scoring Using Credit History Data
    Smirnov, V. S.
    Stupnikov, S. A.
    LOBACHEVSKII JOURNAL OF MATHEMATICS, 2023, 44 (01) : 198 - 204
  • [43] Generative models for reproducible coronary calcium scoring
    van Velzen, Sanne G. M.
    de Vos, Bob D.
    Noothout, Julia M. H.
    Verkooijen, Helena M.
    Viergever, Max A.
    Isgum, Ivana
    JOURNAL OF MEDICAL IMAGING, 2022, 9 (05)
  • [44] Diversity in Deep Generative Models and Generative AI
    Turinici, Gabriel
    MACHINE LEARNING, OPTIMIZATION, AND DATA SCIENCE, LOD 2023, PT II, 2024, 14506 : 84 - 93
  • [45] Inference with deep generative priors in high dimensions
    Pandit P.
    Sahraee-Ardakan M.
    Rangan S.
    Schniter P.
    Fletcher A.K.
    IEEE. J. Sel. Area. Inf. Theory., 1 (336-347): : 336 - 347
  • [46] Visual analytics for monitoring credit scoring models
    Baldo, Daiane Rodrigues
    Regio, Murilo Santos
    Manssour, Isabel Harb
    INFORMATION VISUALIZATION, 2023, 22 (04) : 340 - 357
  • [47] Sample selection in credit-scoring models
    Greene, W
    JAPAN AND THE WORLD ECONOMY, 1998, 10 (03) : 299 - 316
  • [48] Consumer credit scoring models with limited data
    Sustersic, Maia
    Mramor, Dusan
    Zupan, Jure
    EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (03) : 4736 - 4744
  • [49] Application of credit scoring models in electricity companies
    Shen, Aihua
    Tong, Rencheng
    Li, Xingsen
    PROGRESS IN INTELLIGENCE COMPUTATION AND APPLICATIONS, PROCEEDINGS, 2007, : 618 - 621
  • [50] CREDIT-SCORING BY ENLARGED DISCRIMINANT MODELS
    FALBO, P
    OMEGA-INTERNATIONAL JOURNAL OF MANAGEMENT SCIENCE, 1991, 19 (04): : 275 - 289