Machine Learning and Deep Learning for Loan Prediction in Banking: Exploring Ensemble Methods and Data Balancing

被引:0
|
作者
Sayed, Eslam Hussein [1 ,2 ]
Alabrah, Amerah [3 ]
Rahouma, Kamel Hussein [4 ]
Zohaib, Muhammad [5 ]
Badry, Rasha M. [1 ]
机构
[1] Fayoum Univ, Fac Comp & Informat, Informat Syst Dept, Faiyum, Egypt
[2] Nahda Univ, Fac Comp Sci, Informat Syst Dept, Bani Suwayf 62764, Egypt
[3] King Saud Univ, Coll Comp & Informat Sci, Dept Informat Syst, Riyadh 11543, Saudi Arabia
[4] Minia Univ, Fac Engn, Elect Engn Dept, Al Minya, Egypt
[5] Lappeenranta Lahti Univ Technol, Software Engn Dept, Informat Syst Dept, Lappeenranta 53851, Finland
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Accuracy; Random forests; Predictive models; Classification algorithms; Prediction algorithms; Machine learning algorithms; Logistic regression; Support vector machines; Ensemble learning; Deep learning; Customer loan prediction; artificial intelligence; data preprocessing; model optimization; machine learning; deep learning; classification models; CLASSIFICATION;
D O I
10.1109/ACCESS.2024.3509774
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The prediction of loan defaults is crucial for banks and financial institutions due to its impact on earnings, and it also plays a significant role in shaping credit scores. This task is a challenging one, and as the demand for loans increases, so does the number of applications. Traditional methods of checking eligibility are time-consuming and laborious, and they may not always accurately identify suitable loan recipients. As a result, some applicants may default on their loans, causing financial losses for banks. Artificial Intelligence, using Machine Learning and Deep Learning techniques, can provide a more efficient solution. These techniques can use various classification algorithms to predict which applicants will likely be eligible for loans. This study uses five Machine Learning classification algorithms (Gaussian Naive Bayes, AdaBoost, Gradient Boosting, K Neighbors Classifier, Decision Trees, Random Forest, and Logistic Regression) and eight Deep Learning algorithms (MLP, CNN, LSTM, Transformer, GRU, Autoencoder, ResNet, and DenseNet). The use of Ensemble Methods and SMOTE with SMOTE-TOMEK Techniques also has a positive impact on the results. Four metrics are used to evaluate the effectiveness of these algorithms: accuracy, precision, recall, and F1-measure. The study found that DenseNet and ResNet were the most accurate predictive models. These findings highlight the potential of predictive modeling in identifying credit disapproval among vulnerable consumers in a sea of loan applications.
引用
收藏
页码:193997 / 194019
页数:23
相关论文
共 50 条
  • [31] Prediction of Loan Rate for Mortgage Data: Deep Learning Versus Robust Regression
    Wang, Donglin
    Hong, Don
    Wu, Qiang
    COMPUTATIONAL ECONOMICS, 2023, 61 (03) : 1137 - 1150
  • [32] An Ensemble Machine Learning and Data Mining Approach to Enhance Stroke Prediction
    Wijaya, Richard
    Saeed, Faisal
    Samimi, Parnia
    Albarrak, Abdullah M.
    Qasem, Sultan Noman
    BIOENGINEERING-BASEL, 2024, 11 (07):
  • [33] Machine Learning and Deep Learning Methods for Cybersecurity
    Xin, Yang
    Kong, Lingshuang
    Liu, Zhi
    Chen, Yuling
    Li, Yanmiao
    Zhu, Hongliang
    Gao, Mingcheng
    Hou, Haixia
    Wang, Chunhua
    IEEE ACCESS, 2018, 6 : 35365 - 35381
  • [34] Predicting loan approval of bank direct marketing data using ensemble machine learning algorithms
    Meshref H.
    International Journal of Circuits, Systems and Signal Processing, 2020, 14 : 914 - 922
  • [35] Deep Learning Versus Traditional Machine Learning Methods for Aggregated Energy Demand Prediction
    Paterakis, Nikolaos G.
    Mocanu, Elena
    Gibescu, Madeleine
    Stappers, Bart
    van Alst, Walter
    2017 IEEE PES INNOVATIVE SMART GRID TECHNOLOGIES CONFERENCE EUROPE (ISGT-EUROPE), 2017,
  • [36] Wheat Yield Prediction for Turkey Using Statistical Machine Learning and Deep Learning Methods
    Ozden, Cevher
    Karadogan, Nurguel
    PAKISTAN JOURNAL OF AGRICULTURAL SCIENCES, 2024, 61 (02): : 429 - 435
  • [37] Prediction of severe thunderstorm events with ensemble deep learning and radar data
    Sabrina Guastavino
    Michele Piana
    Marco Tizzi
    Federico Cassola
    Antonio Iengo
    Davide Sacchetti
    Enrico Solazzo
    Federico Benvenuto
    Scientific Reports, 12
  • [38] Application of machine learning and deep learning methods for hydrated electron rate constant prediction
    Zheng, Shanshan
    Guo, Wanqian
    Li, Chao
    Sun, Yongbin
    Zhao, Qi
    Lu, Hao
    Si, Qishi
    Wang, Huazhe
    ENVIRONMENTAL RESEARCH, 2023, 231
  • [39] In silico prediction of chemical-induced hematotoxicity with machine learning and deep learning methods
    Yuqing Hua
    Yinping Shi
    Xueyan Cui
    Xiao Li
    Molecular Diversity, 2021, 25 : 1585 - 1596
  • [40] Comparing Machine Learning and Deep Learning Methods for Real-Time Crash Prediction
    Theofilatos, Athanasios
    Chen, Cong
    Antoniou, Constantinos
    TRANSPORTATION RESEARCH RECORD, 2019, 2673 (08) : 169 - 178