Machine Learning and Deep Learning for Loan Prediction in Banking: Exploring Ensemble Methods and Data Balancing

被引:0
|
作者
Sayed, Eslam Hussein [1 ,2 ]
Alabrah, Amerah [3 ]
Rahouma, Kamel Hussein [4 ]
Zohaib, Muhammad [5 ]
Badry, Rasha M. [1 ]
机构
[1] Fayoum Univ, Fac Comp & Informat, Informat Syst Dept, Faiyum, Egypt
[2] Nahda Univ, Fac Comp Sci, Informat Syst Dept, Bani Suwayf 62764, Egypt
[3] King Saud Univ, Coll Comp & Informat Sci, Dept Informat Syst, Riyadh 11543, Saudi Arabia
[4] Minia Univ, Fac Engn, Elect Engn Dept, Al Minya, Egypt
[5] Lappeenranta Lahti Univ Technol, Software Engn Dept, Informat Syst Dept, Lappeenranta 53851, Finland
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Accuracy; Random forests; Predictive models; Classification algorithms; Prediction algorithms; Machine learning algorithms; Logistic regression; Support vector machines; Ensemble learning; Deep learning; Customer loan prediction; artificial intelligence; data preprocessing; model optimization; machine learning; deep learning; classification models; CLASSIFICATION;
D O I
10.1109/ACCESS.2024.3509774
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The prediction of loan defaults is crucial for banks and financial institutions due to its impact on earnings, and it also plays a significant role in shaping credit scores. This task is a challenging one, and as the demand for loans increases, so does the number of applications. Traditional methods of checking eligibility are time-consuming and laborious, and they may not always accurately identify suitable loan recipients. As a result, some applicants may default on their loans, causing financial losses for banks. Artificial Intelligence, using Machine Learning and Deep Learning techniques, can provide a more efficient solution. These techniques can use various classification algorithms to predict which applicants will likely be eligible for loans. This study uses five Machine Learning classification algorithms (Gaussian Naive Bayes, AdaBoost, Gradient Boosting, K Neighbors Classifier, Decision Trees, Random Forest, and Logistic Regression) and eight Deep Learning algorithms (MLP, CNN, LSTM, Transformer, GRU, Autoencoder, ResNet, and DenseNet). The use of Ensemble Methods and SMOTE with SMOTE-TOMEK Techniques also has a positive impact on the results. Four metrics are used to evaluate the effectiveness of these algorithms: accuracy, precision, recall, and F1-measure. The study found that DenseNet and ResNet were the most accurate predictive models. These findings highlight the potential of predictive modeling in identifying credit disapproval among vulnerable consumers in a sea of loan applications.
引用
收藏
页码:193997 / 194019
页数:23
相关论文
共 50 条
  • [11] Prediction of Concrete Properties Using Ensemble Machine Learning Methods
    Prayogo, D.
    Santoso, D., I
    Wijaya, D.
    Gunawan, T.
    Widjaja, J. A.
    2ND INTERNATIONAL CONFERENCE ON SUSTAINABLE INFRASTRUCTURE, 2020, 1625
  • [12] Prediction of Cesarean Childbirth using Ensemble Machine Learning Methods
    Khan, Nafiz Imtiaz
    Mahmud, Tahasin
    Islam, Muhammad Nazrul
    Mustafina, Sumaiya Nuha
    22ND INTERNATIONAL CONFERENCE ON INFORMATION INTEGRATION AND WEB-BASED APPLICATIONS & SERVICES (IIWAS2020), 2020, : 331 - 339
  • [13] A Comparative Study of non-deep Learning, Deep Learning, and Ensemble Learning Methods for Sunspot Number Prediction
    Dang, Yuchen
    Chen, Ziqi
    Li, Heng
    Shu, Hai
    APPLIED ARTIFICIAL INTELLIGENCE, 2022, 36 (01)
  • [14] An Ensemble of Random Forest Gradient Boosting Machine and Deep Learning Methods for Stock Price Prediction
    Shrivastav, Lokesh Kumar
    Kumar, Ravinder
    JOURNAL OF INFORMATION TECHNOLOGY RESEARCH, 2022, 15 (01)
  • [15] Exploring and Improving Deep Learning Methods for Genomic Prediction
    Kelly, C.
    Gaffney, A.
    McLaughlin, R.
    HUMAN HEREDITY, 2020, 84 (4-5) : 213 - 213
  • [16] Genomic prediction using machine learning: a comparison of the performance of regularized regression, ensemble, instance-based and deep learning methods on synthetic and empirical data
    Lourenco, Vanda M.
    Ogutu, Joseph O.
    Rodrigues, Rui A. P.
    Posekany, Alexandra
    Piepho, Hans-Peter
    BMC GENOMICS, 2024, 25 (01)
  • [17] Genomic prediction using machine learning: a comparison of the performance of regularized regression, ensemble, instance-based and deep learning methods on synthetic and empirical data
    Vanda M. Lourenço
    Joseph O. Ogutu
    Rui A.P. Rodrigues
    Alexandra Posekany
    Hans-Peter Piepho
    BMC Genomics, 25
  • [18] ENSEMBLE MACHINE LEARNING METHODS TO PREDICT THE BALANCING OF AYURVEDIC CONSTITUENTS IN THE HUMAN BODY
    Rajasekar, Vani
    Krishnamoorthi, Sathya
    Saracevic, Muzafer
    Pepic, Dzenis
    Zajmovic, Mahir
    Zogic, Haris
    COMPUTER SCIENCE-AGH, 2022, 23 (01): : 117 - 132
  • [19] Wave data prediction with optimized machine learning and deep learning techniques
    Domala, Vamshikrishna
    Lee, Wonhee
    Kim, Tae-wan
    JOURNAL OF COMPUTATIONAL DESIGN AND ENGINEERING, 2022, 9 (03) : 1107 - 1122
  • [20] News-based Machine Learning and Deep Learning Methods for Stock Prediction
    Guo, Junjie
    Tuckfield, Bradford
    4TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE APPLICATIONS AND TECHNOLOGIES (AIAAT 2020), 2020, 1642