In-Depth Analysis of Phishing Email Detection: Evaluating the Performance of Machine Learning and Deep Learning Models Across Multiple Datasets

被引:0
|
作者
Alhuzali, Abeer [1 ]
Alloqmani, Ahad [1 ]
Aljabri, Manar [1 ]
Alharbi, Fatemah [2 ]
机构
[1] King Abdulaziz Univ, Fac Comp & Informat Technol, Dept Comp Sci, Jeddah 21589, Saudi Arabia
[2] Taibah Univ, Coll Comp Sci & Engn, Comp Sci Dept, Yanbu 46522, Saudi Arabia
来源
APPLIED SCIENCES-BASEL | 2025年 / 15卷 / 06期
关键词
phishing email detection; cybersecurity; artificial intelligence (AI); deep learning (DL); machine learning (ML); spam filtering; threat detection; transformer models;
D O I
10.3390/app15063396
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Phishing emails remain a primary vector for cyberattacks, necessitating advanced detection mechanisms. Existing studies often focus on limited datasets or a small number of models, lacking a comprehensive evaluation approach. This study develops a novel framework for implementing and testing phishing email detection models to address this gap. A total of fourteen machine learning (ML) and deep learning (DL) models are evaluated across ten datasets, including nine publicly available datasets and a merged dataset created for this study. The evaluation is conducted using multiple performance metrics to ensure a comprehensive comparison. Experimental results demonstrate that DL models consistently outperform their ML counterparts in both accuracy and robustness. Notably, transformer-based models BERT and RoBERTa achieve the highest detection accuracies of 98.99% and 99.08%, respectively, on the balanced merged dataset, outperforming traditional ML approaches by an average margin of 4.7%. These findings highlight the superiority of DL in phishing detection and emphasize the potential of AI-driven solutions in strengthening email security systems. This study provides a benchmark for future research and sets the stage for advancements in cybersecurity innovation.
引用
收藏
页数:30
相关论文
共 50 条
  • [41] A robust approach to authorship verification using siamese deep learning: application in phishing email detection
    Remmide M.A.
    Boumahdi F.
    Ammar Aouchiche I.R.
    Guendouz A.
    Boustia N.
    International Journal of Speech Technology, 2024, 27 (02) : 405 - 412
  • [42] Enhancing Phishing Website Detection Using Ensemble Machine Learning Models
    Baliyan, Himanshu
    Prasath, A. Rama
    2024 OPJU International Technology Conference on Smart Computing for Innovation and Advancement in Industry 4.0, OTCON 2024, 2024,
  • [43] Evaluating machine learning models for building risk prediction models in complex datasets
    Cook, James P.
    Goulermas, Yannis
    Morris, Andrew P.
    GENETIC EPIDEMIOLOGY, 2020, 44 (05) : 477 - 477
  • [44] Enhancing classification performance in imbalanced datasets: A comparative analysis of machine learning models
    Dube, Lindani
    Verster, Tanja
    DATA SCIENCE IN FINANCE AND ECONOMICS, 2023, 3 (04): : 354 - 379
  • [45] Dissecting the infodemic: An in-depth analysis of COVID-19 misinformation detection on X (formerly Twitter) utilizing machine learning and deep learning techniques
    Ul Hussna, Asma
    Alam, Md Golam Rabiul
    Islam, Risul
    Alkhamees, Bader Fahad
    Hassan, Mohammad Mehedi
    Uddin, Md Zia
    HELIYON, 2024, 10 (18)
  • [46] Benchmarking of deep learning irradiance forecasting models from sky images - An in-depth analysis
    Paletta, Quentin
    Arbod, Guillaume
    Lasenby, Joan
    SOLAR ENERGY, 2021, 224 : 855 - 867
  • [47] Comparative analysis of machine learning algorithms in detection of phishing websites
    Kosan, Muhammed Ali
    Yildiz, Oktay
    Karacan, Hacer
    PAMUKKALE UNIVERSITY JOURNAL OF ENGINEERING SCIENCES-PAMUKKALE UNIVERSITESI MUHENDISLIK BILIMLERI DERGISI, 2018, 24 (02): : 276 - 282
  • [48] Performance Assessment of Multiple Machine Learning Classifiers for Detecting the Phishing URLs
    Rahman, Sheikh Shah Mohammad Motiur
    Rafiq, Fatama Binta
    Toma, Tapushe Rabaya
    Hossain, Syeda Sumbul
    Biplob, Khalid Been Badruzzaman
    DATA ENGINEERING AND COMMUNICATION TECHNOLOGY, ICDECT-2K19, 2020, 1079 : 285 - 296
  • [49] Securing Networks: An In-Depth Analysis of Intrusion Detection using Machine Learning and Model Explanations
    Hoang-Tu Vo
    Nhon Nguyen Thien
    Kheo Chau Mui
    Phuc Pham Tien
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (05) : 1436 - 1444
  • [50] Explainable end-to-end deep learning for diabetic retinopathy detection across multiple datasets
    Chetoui, Mohamed
    Akhloufi, Moulay A.
    JOURNAL OF MEDICAL IMAGING, 2020, 7 (04)