Internet Financial Credit Scoring Models Based on Deep Forest and Resampling Methods

被引:7
|
作者
Zhong, Yu [1 ]
Wang, Huiling [2 ]
机构
[1] Sichuan Univ, Business Sch, Chengdu 610065, Peoples R China
[2] Shenzhen Inst Informat Technol, Sch Management, Shenzhen 518172, Peoples R China
关键词
Internet; Deep learning; Random forests; Regression analysis; Data models; Finance; Credit scoring; class imbalance; deep forest; resampling method; NEURAL-NETWORKS; ENSEMBLE; CLASSIFICATION; IMBALANCE; TREE; CLASSIFIERS; REGRESSION; MACHINE; SMOTE;
D O I
10.1109/ACCESS.2023.3239889
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years, deep learning credit scoring models have become a hot research topic in Internet finance. However, most of the existing studies are based on deep neural network models, whose structure is difficult to design. Moreover, previous research seldom considers the impact of class imbalance problems on credit scoring performance. To fill this gap, we propose a new deep learning credit scoring model based on deep forest (DF) and resampling methods. First, we combine DF with five resampling methods including random over-sampling (ROS), random under-sampling (RUS), synthetic minority over-sampling technique (SMOTE), tomek links and SMOTE+ Tomek, respectively, to build responding models. We validate that the RUS-DF model has the best credit scoring performance among the above models. Then, to further evaluate the advantages of the deep ensemble model RUS-DF, we compare it with four models building by combining RUS with multilayer perceptron, convolutional neural network, and long short-term memory and random forests, respectively. All the experiments are conducted on four Internet financial credit scoring datasets. The results show that the RUS-DF model obtains better classification performance and stability than other models and is suitable for solving the credit scoring problem with imbalanced data.
引用
收藏
页码:8689 / 8700
页数:12
相关论文
共 50 条
  • [31] A predictive intelligence system of credit scoring based on deep multiple kernel learning
    Wu, Cheng-Feng
    Huang, Shian-Chang
    Chiou, Chei-Chang
    Wang, Yu-Min
    APPLIED SOFT COMPUTING, 2021, 111 (111)
  • [32] Fine Clustering Analysis of Internet Financial Credit Investigation Based on Big Data
    Sun, Jingqi
    Li, Yu
    Li, Qiang
    Li, Yingji
    Jia, Yanshu
    Xia, Dongmei
    BIG DATA RESEARCH, 2022, 27
  • [33] Research on Credit Risk Identification of Internet Financial Enterprises Based on Big Data
    Peng, Hua
    MOBILE INFORMATION SYSTEMS, 2021, 2021
  • [34] An Automatic Deep Reinforcement Learning based Credit Scoring Model using Deep-Q Network for Classification of Customer Credit Requests
    Paul, Sudipta
    Gupta, Agam
    Kar, Arpan Kumar
    Singh, Vinay
    2023 IEEE INTERNATIONAL SYMPOSIUM ON TECHNOLOGY AND SOCIETY, ISTAS, 2023,
  • [35] A Linear-dependence-based Approach to Design Proactive Credit Scoring Models
    Saia, Roberto
    Carta, Salvatore
    KDIR: PROCEEDINGS OF THE 8TH INTERNATIONAL JOINT CONFERENCE ON KNOWLEDGE DISCOVERY, KNOWLEDGE ENGINEERING AND KNOWLEDGE MANAGEMENT - VOL. 1, 2016, : 111 - 120
  • [36] Financial Credit Risk Control Strategy Based on Weighted Random Forest Algorithm
    Guo Yangyudongnanxin
    SCIENTIFIC PROGRAMMING, 2021, 2021
  • [37] Dynamic Prediction of Internet Financial Market Based on Deep Learning
    Zhang, Zixuan
    Jia, Xiaojun
    Chen, Shan
    Li, Menggang
    Wang, Fang
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2022, 2022
  • [38] Machine Learning Based on Resampling Approaches and Deep Reinforcement Learning for Credit Card Fraud Detection Systems
    Tran Khanh Dang
    Thanh Cong Tran
    Luc Minh Tuan
    Mai Viet Tiep
    APPLIED SCIENCES-BASEL, 2021, 11 (21):
  • [39] Utility based Credit Scoring for Banks and Financial Institutions: Case Study of a Major Iranian Bank
    Sadatrasoul, Seyed Mahdi
    Gholamian, Mohammad Reza
    Hajimohammadi, Zeynab
    Hosseini, Mahdi
    JOURNAL OF MATHEMATICS AND COMPUTER SCIENCE-JMCS, 2014, 13 (04): : 281 - 287