Next-Generation Spam Filtering: Comparative Fine-Tuning of LLMs, NLPs, and CNN Models for Email Spam Classification

被引:4
|
作者
Roumeliotis, Konstantinos I. [1 ]
Tselikas, Nikolaos D. [1 ]
Nasiopoulos, Dimitrios K. [2 ]
机构
[1] Univ Peloponnese, Dept Informat & Telecommun, Akadimaikou GK Vlachou St, Tripoli 22131, Greece
[2] Agr Univ Athens, Sch Appl Econ & Social Sci, Dept Agribusiness & Supply Chain Management, Athens 11855, Greece
关键词
spam filtering; spam classification; spam detection; spam detection systems; spam email; phishing email; phishing detection; phishing attacks; LLM fine-tuning; LLM classification; PHISHING EMAILS;
D O I
10.3390/electronics13112034
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Spam emails and phishing attacks continue to pose significant challenges to email users worldwide, necessitating advanced techniques for their efficient detection and classification. In this paper, we address the persistent challenges of spam emails and phishing attacks by introducing a cutting-edge approach to email filtering. Our methodology revolves around harnessing the capabilities of advanced language models, particularly the state-of-the-art GPT-4 Large Language Model (LLM), along with BERT and RoBERTa Natural Language Processing (NLP) models. Through meticulous fine-tuning tailored for spam classification tasks, we aim to surpass the limitations of traditional spam detection systems, such as Convolutional Neural Networks (CNNs). Through an extensive literature review, experimentation, and evaluation, we demonstrate the effectiveness of our approach in accurately identifying spam and phishing emails while minimizing false positives. Our methodology showcases the potential of fine-tuning LLMs for specialized tasks like spam classification, offering enhanced protection against evolving spam and phishing attacks. This research contributes to the advancement of spam filtering techniques and lays the groundwork for robust email security systems in the face of increasingly sophisticated threats.
引用
收藏
页数:24
相关论文
共 7 条
  • [1] An Application of Transfer Learning: Fine-Tuning BERT for Spam Email Classification
    Bhopale, Amol P.
    Tiwari, Ashish
    MACHINE LEARNING AND BIG DATA ANALYTICS (PROCEEDINGS OF INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND BIG DATA ANALYTICS (ICMLBDA) 2021), 2022, 256 : 67 - 77
  • [2] An Improved Dandelion Optimizer Algorithm for Spam Detection: Next-Generation Email Filtering System
    Tubishat, Mohammad
    Al-Obeidat, Feras
    Sadiq, Ali Safaa
    Mirjalili, Seyedali
    COMPUTERS, 2023, 12 (10)
  • [3] Fine-Tuning Next-Generation Genome Editing Tools
    Kanchiswamy, Chidananda Nagamangala
    Maffei, Massimo
    Malnoy, Mickael
    Velasco, Riccardo
    Kim, Jin-Soo
    TRENDS IN BIOTECHNOLOGY, 2016, 34 (07) : 562 - 574
  • [4] Ham or Spam? A comparative study for some Content-based Classification Algorithms for Email Filtering
    Saab, Salwa Adriana
    Mitri, Nicholas
    Awad, Mariette
    2014 17TH IEEE MEDITERRANEAN ELECTROTECHNICAL CONFERENCE (MELECON), 2014, : 439 - 443
  • [5] Overview of fine-tuning CNN-Based Models for X-ray Image Classification
    Ngoc Ha Pham
    Giang Son Tran
    PROCEEDINGS OF THE 2024 9TH INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATION TECHNOLOGY, ICIIT 2024, 2024, : 186 - 196
  • [6] A Comparative Analysis of Instruction Fine-Tuning Large Language Models for Financial Text Classification
    Fatemi, Sorouralsadat
    Hu, Yuheng
    Mousavi, Maryam
    ACM TRANSACTIONS ON MANAGEMENT INFORMATION SYSTEMS, 2025, 16 (01)
  • [7] Unveiling the Power of Large Language Models: A Comparative Study of Retrieval-Augmented Generation, Fine-Tuning, and Their Synergistic Fusion for Enhanced Performance
    Budakoglu, Gulsum
    Emekci, Hakan
    IEEE ACCESS, 2025, 13 : 30936 - 30951