Evaluating spam filters and Stylometric Detection of AI-generated phishing emails

被引:0
|
作者
Opara, Chidimma [1 ]
Modesti, Paolo [1 ]
Golightly, Lewis [1 ]
机构
[1] Teesside Univ, Dept Comp & Games, Middlesbrough TS1 3BX, England
关键词
AI-generated phishing email; Phishing detection; Stylometric analysis; Large Language Models (LLMs); Machine learning; Cybersecurity;
D O I
10.1016/j.eswa.2025.127044
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The advanced architecture of Large Language Models (LLMs) has revolutionised natural language processing, enabling the creation of text that convincingly mimics legitimate human communication, including phishing emails. As AI-generated phishing emails become increasingly sophisticated, a critical question arises: How effectively can current email systems and detection mechanisms identify these threats? This study addresses this issue by analysing 63 AI-generated phishing emails created using GPT-4o. It evaluates the effectiveness of major email services, Gmail, Outlook, and Yahoo, in filtering these malicious communications. The findings reveal that Gmail and Outlook allowed more AI-generated phishing emails to bypass their filters compared to Yahoo, highlighting vulnerabilities in existing email filtering systems. To mitigate these challenges, we applied 60 stylometric features across four machine learning models: Logistic Regression, Support Vector Machine, Random Forest, and XGBoost. Among these, XGBoost demonstrated superior performance, achieving 96% accuracy and an AUC score of 99%. Key features such as imperative verb count, clause density, and first- person pronoun usage were instrumental to the model's success. The dataset of AI-generated phishing emails is publicly available on Kaggle to foster further research.
引用
收藏
页数:19
相关论文
共 50 条
  • [31] Learning Embodied Sound-Motion Mappings: Evaluating AI-Generated Dance Improvisation
    Wallace, Benedikte
    Martin, Charles P.
    Torresen, Jim
    Nymoen, Kristian
    C&C'21: PROCEEDINGS OF THE 13TH CONFERENCE ON CREATIVITY AND COGNITION, 2021,
  • [32] Evaluating the fidelity of AI-generated information on long-acting reversible contraceptive methods
    Riley, Grace
    Wang, Elizabeth
    Flynn, Camille
    Lopez, Ashley
    Sridhar, Aparna
    EUROPEAN JOURNAL OF CONTRACEPTION AND REPRODUCTIVE HEALTH CARE, 2025,
  • [33] The Promise and Pitfalls of AI-Generated Anatomical Images: Evaluating Midjourney for Aesthetic Surgery Applications
    Giovanni Buzzaccarini
    Rebecca Susanna Degliuomini
    Marco Borin
    Anastasia Fidanza
    Noemi Salmeri
    Luigi Schiraldi
    Pietro Giovanni Di Summa
    Franco Vercesi
    Valeria Stella Vanni
    Massimo Candiani
    Luca Pagliardini
    Aesthetic Plastic Surgery, 2024, 48 : 1874 - 1883
  • [34] The Promise and Pitfalls of AI-Generated Anatomical Images: Evaluating Midjourney for Aesthetic Surgery Applications
    Buzzaccarini, Giovanni
    Degliuomini, Rebecca Susanna
    Borin, Marco
    Fidanza, Anastasia
    Salmeri, Noemi
    Schiraldi, Luigi
    Di Summa, Pietro Giovanni
    Vercesi, Franco
    Vanni, Valeria Stella
    Candiani, Massimo
    Pagliardini, Luca
    AESTHETIC PLASTIC SURGERY, 2024, 48 (09) : 1874 - 1883
  • [35] Evaluating diagnostic content of AI-generated radiology reports of chest X-rays
    Babar, Zaheer
    van Laarhoven, Twan
    Zanzotto, Fabio Massimo
    Marchiori, Elena
    ARTIFICIAL INTELLIGENCE IN MEDICINE, 2021, 116 (116)
  • [36] Human vs. Machine: A Comparative Study on the Detection of AI-Generated Content
    Tadjine, Amal bou
    Harrag, Fouzi
    Shaalan, Khaled
    ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2025, 24 (02)
  • [37] Toward Robust Arabic AI-Generated Text Detection: Tackling Diacritics Challenges
    Alshammari, Hamed
    Elleithy, Khaled
    INFORMATION, 2024, 15 (07)
  • [38] AI-Generated Video Detection via Spatial-Temporal Anomaly Learning
    Bai, Jianfa
    Lin, Man
    Cao, Gang
    Lou, Zijie
    PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT X, 2025, 15040 : 460 - 470
  • [39] Artifact feature purification for cross-domain detection of AI-generated images
    Meng, Zheling
    Peng, Bo
    Dong, Jing
    Tan, Tieniu
    Cheng, Haonan
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 247
  • [40] Evaluating the Coherence and Diversity in AI-Generated and Paraphrased Scientific Abstracts: A Fuzzy Topic Modeling Approach
    Onan, Aytug
    Celikten, Tugba
    INTELLIGENT AND FUZZY SYSTEMS, INFUS 2024 CONFERENCE, VOL 1, 2024, 1088 : 149 - 157