Next-Generation Spam Filtering: Comparative Fine-Tuning of LLMs, NLPs, and CNN Models for Email Spam Classification

被引：4

作者：

Roumeliotis, Konstantinos I. ^{[1
]}

Tselikas, Nikolaos D. ^{[1
]}

Nasiopoulos, Dimitrios K. ^{[2
]}

机构：

[1] Univ Peloponnese, Dept Informat & Telecommun, Akadimaikou GK Vlachou St, Tripoli 22131, Greece

[2] Agr Univ Athens, Sch Appl Econ & Social Sci, Dept Agribusiness & Supply Chain Management, Athens 11855, Greece

来源：

ELECTRONICS | 2024年 / 13卷 / 11期

关键词：

spam filtering; spam classification; spam detection; spam detection systems; spam email; phishing email; phishing detection; phishing attacks; LLM fine-tuning; LLM classification; PHISHING EMAILS;

D O I：

10.3390/electronics13112034

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Spam emails and phishing attacks continue to pose significant challenges to email users worldwide, necessitating advanced techniques for their efficient detection and classification. In this paper, we address the persistent challenges of spam emails and phishing attacks by introducing a cutting-edge approach to email filtering. Our methodology revolves around harnessing the capabilities of advanced language models, particularly the state-of-the-art GPT-4 Large Language Model (LLM), along with BERT and RoBERTa Natural Language Processing (NLP) models. Through meticulous fine-tuning tailored for spam classification tasks, we aim to surpass the limitations of traditional spam detection systems, such as Convolutional Neural Networks (CNNs). Through an extensive literature review, experimentation, and evaluation, we demonstrate the effectiveness of our approach in accurately identifying spam and phishing emails while minimizing false positives. Our methodology showcases the potential of fine-tuning LLMs for specialized tasks like spam classification, offering enhanced protection against evolving spam and phishing attacks. This research contributes to the advancement of spam filtering techniques and lays the groundwork for robust email security systems in the face of increasingly sophisticated threats.

引用

页数：24

共 7 条

[1] An Application of Transfer Learning: Fine-Tuning BERT for Spam Email Classification
Bhopale, Amol P.
Tiwari, Ashish
MACHINE LEARNING AND BIG DATA ANALYTICS (PROCEEDINGS OF INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND BIG DATA ANALYTICS (ICMLBDA) 2021), 2022, 256 : 67 - 77
[2] An Improved Dandelion Optimizer Algorithm for Spam Detection: Next-Generation Email Filtering System
Tubishat, Mohammad
Al-Obeidat, Feras
Sadiq, Ali Safaa
Mirjalili, Seyedali
COMPUTERS, 2023, 12 (10)
[3] Fine-Tuning Next-Generation Genome Editing Tools
Kanchiswamy, Chidananda Nagamangala
Maffei, Massimo
Malnoy, Mickael
Velasco, Riccardo
Kim, Jin-Soo
TRENDS IN BIOTECHNOLOGY, 2016, 34 (07) : 562 - 574
[4] Ham or Spam? A comparative study for some Content-based Classification Algorithms for Email Filtering
Saab, Salwa Adriana
Mitri, Nicholas
Awad, Mariette
2014 17TH IEEE MEDITERRANEAN ELECTROTECHNICAL CONFERENCE (MELECON), 2014, : 439 - 443
[5] Overview of fine-tuning CNN-Based Models for X-ray Image Classification
Ngoc Ha Pham
Giang Son Tran
PROCEEDINGS OF THE 2024 9TH INTERNATIONAL CONFERENCE ON INTELLIGENT INFORMATION TECHNOLOGY, ICIIT 2024, 2024, : 186 - 196
[6] A Comparative Analysis of Instruction Fine-Tuning Large Language Models for Financial Text Classification
Fatemi, Sorouralsadat
Hu, Yuheng
Mousavi, Maryam
ACM TRANSACTIONS ON MANAGEMENT INFORMATION SYSTEMS, 2025, 16 (01)
[7] Unveiling the Power of Large Language Models: A Comparative Study of Retrieval-Augmented Generation, Fine-Tuning, and Their Synergistic Fusion for Enhanced Performance
Budakoglu, Gulsum
Emekci, Hakan
IEEE ACCESS, 2025, 13 : 30936 - 30951

← 1 →