Internet Financial Credit Scoring Models Based on Deep Forest and Resampling Methods

被引:7
|
作者
Zhong, Yu [1 ]
Wang, Huiling [2 ]
机构
[1] Sichuan Univ, Business Sch, Chengdu 610065, Peoples R China
[2] Shenzhen Inst Informat Technol, Sch Management, Shenzhen 518172, Peoples R China
关键词
Internet; Deep learning; Random forests; Regression analysis; Data models; Finance; Credit scoring; class imbalance; deep forest; resampling method; NEURAL-NETWORKS; ENSEMBLE; CLASSIFICATION; IMBALANCE; TREE; CLASSIFIERS; REGRESSION; MACHINE; SMOTE;
D O I
10.1109/ACCESS.2023.3239889
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years, deep learning credit scoring models have become a hot research topic in Internet finance. However, most of the existing studies are based on deep neural network models, whose structure is difficult to design. Moreover, previous research seldom considers the impact of class imbalance problems on credit scoring performance. To fill this gap, we propose a new deep learning credit scoring model based on deep forest (DF) and resampling methods. First, we combine DF with five resampling methods including random over-sampling (ROS), random under-sampling (RUS), synthetic minority over-sampling technique (SMOTE), tomek links and SMOTE+ Tomek, respectively, to build responding models. We validate that the RUS-DF model has the best credit scoring performance among the above models. Then, to further evaluate the advantages of the deep ensemble model RUS-DF, we compare it with four models building by combining RUS with multilayer perceptron, convolutional neural network, and long short-term memory and random forests, respectively. All the experiments are conducted on four Internet financial credit scoring datasets. The results show that the RUS-DF model obtains better classification performance and stability than other models and is suitable for solving the credit scoring problem with imbalanced data.
引用
收藏
页码:8689 / 8700
页数:12
相关论文
共 50 条
  • [11] Credit Scoring Models Using Soft Computing Methods: A Survey
    Lahsasna, Adel
    Ainon, Raja Noor
    Teh, Ying Wah
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2010, 7 (02) : 115 - 123
  • [12] A recent review on optimisation methods applied to credit scoring models
    Kamimura, Elias Shohei
    Pinto, Anderson Rogerio Faia
    Nagano, Marcelo Seido
    JOURNAL OF ECONOMICS FINANCE AND ADMINISTRATIVE SCIENCE, 2023, 28 (56): : 352 - 371
  • [13] Cost-aware Credit-scoring Framework Based on Resampling and Feature Selection
    Mou, Yunhan
    Pu, Zihao
    Feng, Duanyu
    Luo, Yingting
    Lai, Yanzhao
    Huang, Jimin
    Tian, Youjing
    Xiao, Fang
    COMPUTATIONAL ECONOMICS, 2024,
  • [14] A Novel Credit Scoring Model based on Optimized Random Forest
    Zhang, Xingzhi
    Yang, Yan
    Zhou, Zhurong
    2018 IEEE 8TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE (CCWC), 2018, : 60 - 65
  • [15] A Novel Enterprise Credit Scoring Method Based On Random Forest
    Wu Jing
    Dong Huailin
    Wu Qingfeng
    Wang Wei
    ICCSE 2008: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION: ADVANCED COMPUTER TECHNOLOGY, NEW EDUCATION, 2008, : 188 - 192
  • [16] Internet Credit Risk Scoring Based on Simulated Annealing and Genetic Algorithm
    Hu, Ji
    Cai, Jiawen
    PROCEEDINGS OF THE 2017 INTERNATIONAL CONFERENCE ON APPLIED MATHEMATICS, MODELLING AND STATISTICS APPLICATION (AMMSA 2017), 2017, 141 : 373 - 377
  • [17] Credit scoring with a feature selection approach based deep learning
    Van-Sang Ha
    Ha-Nam Nguyen
    2016 7TH INTERNATIONAL CONFERENCE ON MECHANICAL, INDUSTRIAL, AND MANUFACTURING TECHNOLOGIES (MIMT 2016), 2016, 54
  • [18] Comparison of the hybrid Credit scoring models based on Various Classifiers
    Chen, Fei-Long
    Li, Feng-Chia
    INTERNATIONAL JOURNAL OF INTELLIGENT INFORMATION TECHNOLOGIES, 2010, 6 (03) : 56 - 74
  • [19] Credit Risk Scoring Analysis Based on Machine Learning Models
    Qiu, Ziyue
    Li, Yuming
    Ni, Pin
    Li, Gangmin
    2019 6TH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND CONTROL ENGINEERING (ICISCE 2019), 2019, : 220 - 224
  • [20] Network-based models to improve credit scoring accuracy
    Misheva, Branka Hadji
    Giudici, Paolo
    Pediroda, Valentino
    2018 IEEE 5TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA), 2018, : 623 - 630