Evaluating spam filters and Stylometric Detection of AI-generated phishing emails

被引:0
|
作者
Opara, Chidimma [1 ]
Modesti, Paolo [1 ]
Golightly, Lewis [1 ]
机构
[1] Teesside Univ, Dept Comp & Games, Middlesbrough TS1 3BX, England
关键词
AI-generated phishing email; Phishing detection; Stylometric analysis; Large Language Models (LLMs); Machine learning; Cybersecurity;
D O I
10.1016/j.eswa.2025.127044
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The advanced architecture of Large Language Models (LLMs) has revolutionised natural language processing, enabling the creation of text that convincingly mimics legitimate human communication, including phishing emails. As AI-generated phishing emails become increasingly sophisticated, a critical question arises: How effectively can current email systems and detection mechanisms identify these threats? This study addresses this issue by analysing 63 AI-generated phishing emails created using GPT-4o. It evaluates the effectiveness of major email services, Gmail, Outlook, and Yahoo, in filtering these malicious communications. The findings reveal that Gmail and Outlook allowed more AI-generated phishing emails to bypass their filters compared to Yahoo, highlighting vulnerabilities in existing email filtering systems. To mitigate these challenges, we applied 60 stylometric features across four machine learning models: Logistic Regression, Support Vector Machine, Random Forest, and XGBoost. Among these, XGBoost demonstrated superior performance, achieving 96% accuracy and an AUC score of 99%. Key features such as imperative verb count, clause density, and first- person pronoun usage were instrumental to the model's success. The dataset of AI-generated phishing emails is publicly available on Kaggle to foster further research.
引用
收藏
页数:19
相关论文
共 50 条
  • [1] Detection of AI-Generated Emails - A Case Study
    Gryka, Pawel
    Gradon, Kacper
    Kozlowski, Marek
    Kutyla, Milosz
    Janicki, Artur
    19TH INTERNATIONAL CONFERENCE ON AVAILABILITY, RELIABILITY, AND SECURITY, ARES 2024, 2024,
  • [2] StyloAI: Distinguishing AI-Generated Content with Stylometric Analysis
    Opara, Chidimma
    ARTIFICIAL INTELLIGENCE IN EDUCATION: POSTERS AND LATE BREAKING RESULTS, WORKSHOPS AND TUTORIALS, INDUSTRY AND INNOVATION TRACKS, PRACTITIONERS, DOCTORAL CONSORTIUM AND BLUE SKY, AIED 2024, 2024, 2151 : 105 - 114
  • [3] AI-Generated Spam Review Detection Framework with Deep Learning Algorithms and Natural Language Processing
    Wani, Mudasir Ahmad
    Elaffendi, Mohammed
    Shakil, Kashish Ara
    COMPUTERS, 2024, 13 (10)
  • [4] Online Detection of AI-Generated Images
    Epstein, David C.
    Jain, Ishan
    Wang, Oliver
    Zhang, Richard
    2023 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS, ICCVW, 2023, : 382 - 392
  • [5] Evaluating the efficacy of AI content detection tools in differentiating between human and AI-generated text
    Ahmed M. Elkhatat
    Khaled Elsaid
    Saeed Almeer
    International Journal for Educational Integrity, 19
  • [6] Evaluating the efficacy of AI content detection tools in differentiating between human and AI-generated text
    Elkhatat, Ahmed M.
    Elsaid, Khaled
    Almeer, Saeed
    INTERNATIONAL JOURNAL FOR EDUCATIONAL INTEGRITY, 2023, 19 (01)
  • [7] Towards Detection of AI-Generated Texts and Misinformation
    Najee-Ullah, Ahmad
    Landeros, Luis
    Balytskyi, Yaroslav
    Chang, Sang-Yoon
    SOCIO-TECHNICAL ASPECTS IN SECURITY, STAST 2021, 2022, 13176 : 194 - 205
  • [8] Testing of detection tools for AI-generated text
    Weber-Wulff, Debora
    Anohina-Naumeca, Alla
    Bjelobaba, Sonja
    Foltynek, Tomas
    Guerrero-Dib, Jean
    Popoola, Olumide
    Sigut, Petr
    Waddington, Lorna
    INTERNATIONAL JOURNAL FOR EDUCATIONAL INTEGRITY, 2023, 19 (01)
  • [9] Testing of detection tools for AI-generated text
    Debora Weber-Wulff
    Alla Anohina-Naumeca
    Sonja Bjelobaba
    Tomáš Foltýnek
    Jean Guerrero-Dib
    Olumide Popoola
    Petr Šigut
    Lorna Waddington
    International Journal for Educational Integrity, 19
  • [10] A Genre, Scoring, and Authorship Analysis of AI-Generated and Human-Written Refusal Emails
    Wilson, Winny
    Rose, Heath
    BUSINESS AND PROFESSIONAL COMMUNICATION QUARTERLY, 2025,