Internet Financial Credit Scoring Models Based on Deep Forest and Resampling Methods

被引:7
|
作者
Zhong, Yu [1 ]
Wang, Huiling [2 ]
机构
[1] Sichuan Univ, Business Sch, Chengdu 610065, Peoples R China
[2] Shenzhen Inst Informat Technol, Sch Management, Shenzhen 518172, Peoples R China
关键词
Internet; Deep learning; Random forests; Regression analysis; Data models; Finance; Credit scoring; class imbalance; deep forest; resampling method; NEURAL-NETWORKS; ENSEMBLE; CLASSIFICATION; IMBALANCE; TREE; CLASSIFIERS; REGRESSION; MACHINE; SMOTE;
D O I
10.1109/ACCESS.2023.3239889
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years, deep learning credit scoring models have become a hot research topic in Internet finance. However, most of the existing studies are based on deep neural network models, whose structure is difficult to design. Moreover, previous research seldom considers the impact of class imbalance problems on credit scoring performance. To fill this gap, we propose a new deep learning credit scoring model based on deep forest (DF) and resampling methods. First, we combine DF with five resampling methods including random over-sampling (ROS), random under-sampling (RUS), synthetic minority over-sampling technique (SMOTE), tomek links and SMOTE+ Tomek, respectively, to build responding models. We validate that the RUS-DF model has the best credit scoring performance among the above models. Then, to further evaluate the advantages of the deep ensemble model RUS-DF, we compare it with four models building by combining RUS with multilayer perceptron, convolutional neural network, and long short-term memory and random forests, respectively. All the experiments are conducted on four Internet financial credit scoring datasets. The results show that the RUS-DF model obtains better classification performance and stability than other models and is suitable for solving the credit scoring problem with imbalanced data.
引用
收藏
页码:8689 / 8700
页数:12
相关论文
共 50 条
  • [41] The application of brute force logistic regression to corporate credit scoring models: Evidence from Serbian financial statements
    Nikolic, Nebojsa
    Zarkic-Joksimovic, Nevenka
    Stojanovski, Djordje
    Joksimovic, Iva
    EXPERT SYSTEMS WITH APPLICATIONS, 2013, 40 (15) : 5932 - 5944
  • [42] The effects of customer segmentation, borrower behaviors and analytical methods on the performance of credit scoring models in the agribusiness sector
    Lazo, Daniela
    Calabrese, Raffaella
    Bravo, Cristian
    JOURNAL OF CREDIT RISK, 2020, 16 (04): : 119 - 156
  • [43] RESAMPLING METHODS FOR A RELIABLE VALIDATION SET IN DEEP LEARNING BASED POINT CLOUD CLASSIFICATION
    Nurunnabi, A.
    Teferle, F. N.
    XXIV ISPRS CONGRESS IMAGING TODAY, FORESEEING TOMORROW, COMMISSION II, 2022, 43-B2 : 617 - 624
  • [44] Credit scoring using least squares support vector machine based on data of Thai financial institutions
    Worrachartdatchai, Usanee
    Sooraksa, Pitikhate
    9TH INTERNATIONAL CONFERENCE ON ADVANCED COMMUNICATION TECHNOLOGY: TOWARD NETWORK INNOVATION BEYOND EVOLUTION, VOLS 1-3, 2007, : 2067 - +
  • [45] Research on internet financial risk control based on deep learning algorithm
    Wu, Ziai
    Zhou, Qiao
    Wang, Lijuan
    Zhao, Di
    SOFT COMPUTING, 2023,
  • [46] Application research of credit fraud detection based on distributed rotation deep forest
    Chen, Hongwei
    Shi, Dewei
    Zhou, Xun
    Zhang, Man
    Liu, Luanxuan
    INTELLIGENT DATA ANALYSIS, 2024, 28 (04) : 1067 - 1091
  • [47] RETRACTED: Application of Business Intelligence Based on the Deep Neural Network in Credit Scoring (Retracted Article)
    Feng, Wei
    Chen, Ming
    SECURITY AND COMMUNICATION NETWORKS, 2022, 2022
  • [48] A Deep Learning Based Online Credit Scoring Model for P2P Lending
    Zhang, Zaimei
    Niu, Kun
    Liu, Yan
    IEEE ACCESS, 2020, 8 : 177307 - 177317
  • [49] Interpretation of QSAR Models Based on Random Forest Methods
    Kuz'min, Victor E.
    Polishchuk, Pavel G.
    Artemenko, Anatoly G.
    Andronati, Sergey A.
    MOLECULAR INFORMATICS, 2011, 30 (6-7) : 593 - 603
  • [50] Statistical optimization of supply chain financial credit based on deep learning and fuzzy algorithm
    Hu, Zijiang
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2020, 38 (06) : 7191 - 7202