Evaluating spam filters and Stylometric Detection of AI-generated phishing emails

被引:0
|
作者
Opara, Chidimma [1 ]
Modesti, Paolo [1 ]
Golightly, Lewis [1 ]
机构
[1] Teesside Univ, Dept Comp & Games, Middlesbrough TS1 3BX, England
关键词
AI-generated phishing email; Phishing detection; Stylometric analysis; Large Language Models (LLMs); Machine learning; Cybersecurity;
D O I
10.1016/j.eswa.2025.127044
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The advanced architecture of Large Language Models (LLMs) has revolutionised natural language processing, enabling the creation of text that convincingly mimics legitimate human communication, including phishing emails. As AI-generated phishing emails become increasingly sophisticated, a critical question arises: How effectively can current email systems and detection mechanisms identify these threats? This study addresses this issue by analysing 63 AI-generated phishing emails created using GPT-4o. It evaluates the effectiveness of major email services, Gmail, Outlook, and Yahoo, in filtering these malicious communications. The findings reveal that Gmail and Outlook allowed more AI-generated phishing emails to bypass their filters compared to Yahoo, highlighting vulnerabilities in existing email filtering systems. To mitigate these challenges, we applied 60 stylometric features across four machine learning models: Logistic Regression, Support Vector Machine, Random Forest, and XGBoost. Among these, XGBoost demonstrated superior performance, achieving 96% accuracy and an AUC score of 99%. Key features such as imperative verb count, clause density, and first- person pronoun usage were instrumental to the model's success. The dataset of AI-generated phishing emails is publicly available on Kaggle to foster further research.
引用
收藏
页数:19
相关论文
共 50 条
  • [21] Advanced Detection of AI-Generated Images Through Vision Transformers
    Lamichhane, Darshan
    IEEE ACCESS, 2025, 13 : 3644 - 3652
  • [22] Intrinsic Dimension Estimation for Robust Detection of AI-Generated Texts
    Tulchinskii, Eduard
    Kuznetsov, Kristian
    Kushnareva, Laida
    Cherniavskii, Daniil
    Nikolenko, Sergey
    Burnaev, Evgeny
    Barannikov, Serguei
    Piontkovskaya, Irina
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [23] Do teachers spot AI? Evaluating the detectability of AI-generated texts among student essays
    Fleckenstein, Johanna
    Meyer, Jennifer
    Jansen, Thorben
    Keller, Stefan D.
    Köller, Olaf
    Möller, Jens
    Computers and Education: Artificial Intelligence, 2024, 6
  • [24] Evaluating Accuracy of AI-Generated Travel Vaccine Recommendations: GPTs in Public Health
    Marin-Rodriguez, J. A.
    Rodriguez, M.
    Leyva, L.
    Torralba, C.
    Agustin, F.
    Enriquez, F.
    EUROPEAN JOURNAL OF PUBLIC HEALTH, 2024, 34
  • [25] Synthetic Lies: Understanding AI-Generated Misinformation and Evaluating Algorithmic and Human Solutions
    Zhou, Jiawei
    Zhang, Yixuan
    Luo, Qianni
    Parker, Andrea G.
    De Choudhury, Munmun
    PROCEEDINGS OF THE 2023 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS (CHI 2023), 2023,
  • [26] Evaluating the Authenticity and Readability of AI-Generated Abstracts: An Intriguing Survey Among Ophthalmologists
    Vora, Paras
    Benningfield, Max
    Abou-Jaoude, Michelle
    INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2024, 65 (07)
  • [27] Evaluating the Efficacy of Hybrid Deep Learning Models in Distinguishing AI-Generated Text
    Oketunji, Abiodun Finbarrs
    arXiv, 2023,
  • [28] Evaluating Descriptive Quality of AI-Generated Audio Using Image-Schemas
    Kamath, Purnima
    Li, Zhuoyao
    Gupta, Chitralekha
    Jaidka, Kokil
    Nanayakkara, Suranga
    Wyse, Lonce
    PROCEEDINGS OF 2023 28TH ANNUAL CONFERENCE ON INTELLIGENT USER INTERFACES, IUI 2023, 2023, : 621 - 632
  • [29] Evaluating AI-Generated Language as Models for Strategic Competence in English Language Teaching
    Nguyen, Phuong-Anh
    IAFOR JOURNAL OF EDUCATION, 2024, 12 (03)
  • [30] Hidding the Ghostwriters: An Adversarial Evaluation of AI-Generated Student Essay Detection
    Peng, Xinlin
    Zhou, Ying
    He, Ben
    Sun, Le
    Sun, Yingfei
    2023 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP 2023), 2023, : 10406 - 10419