On persuasion in spam email: A multi-granularity text analysis

被引:0
|
作者
Janez-Martino, Francisco [1 ]
Barron-Cedeno, Alberto [2 ]
Alaiz-Rodriguez, Rocio [1 ]
Gonzalez-Castro, Victor [1 ]
Muti, Arianna [2 ]
机构
[1] Univ Leon, Dept Elect Engn Syst & Automat, Leon, Spain
[2] Univ Bologna, DIT, Forli, Italy
关键词
Social engineering; Persuasion detection; Spam email; Cybersecurity; PHISHING EMAILS;
D O I
10.1016/j.eswa.2024.125767
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Electronic mail (email) is one of the most popular communication media for direct and private communication. Being typically a free service and anonymity-friendly, massive spam email campaigns are common. Nowadays, spam email encompasses scam, phishing, malware distribution, and various other cybersecurity threats. Within these emails, recipients frequently encounter social engineering techniques aimed at persuading them to take an action, such as clicking on a hyperlink, opening an attachment or responding. In this paper, we conduct a study on supervised models to identify persuasion (binary classification) and to identify the specific persuasion techniques that are commonly used in spam email (multilabel classification). To achieve this, we develop systems capable of spotting persuasion in spam emails based on natural language processing techniques. We approach this challenging task at different levels of granularity: full email, sentences and specific text snippets (i.e. text fragments composed by one or more words, typically shorter than a sentence). We replicate and adapt two methodologies used to detect propaganda in news articles. Additionally, we build a custom spam email dataset, and fine-tune pre-trained RoBERTa-based transformer models to tackle the sentence level detection. This allows us to determine how extensively spam emails rely on persuasion to achieve their goals and, if so, to identify those techniques that would be employed for user protection and cybersecurity improvements.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Text Sentiment Analysis Based on Multi-Granularity Joint Solution
    Fang, Xianghui
    Wang, Guoyin
    Liu, Qun
    2018 IEEE 3RD INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA ANALYSIS (ICCCBDA), 2018, : 315 - 321
  • [2] Multi-granularity Prediction for Scene Text Recognition
    Wang, Peng
    Da, Cheng
    Yao, Cong
    COMPUTER VISION - ECCV 2022, PT XXVIII, 2022, 13688 : 339 - 355
  • [3] Multi-Granularity Chinese Text Sentiment Analysis Driven by Knowledge and Data
    Liu, Zhongbao
    Wang, Yufei
    Computer Engineering and Applications, 2023, 59 (15) : 177 - 186
  • [4] A Multi-Granularity Heterogeneous Graph for Extractive Text Summarization
    Zhao, Henghui
    Zhang, Wensheng
    Huang, Mengxing
    Feng, Siling
    Wu, Yuanyuan
    ELECTRONICS, 2023, 12 (10)
  • [5] A Multi-Granularity Semantic Extraction Method for Text Classification
    Li, Min
    Liu, Zeyu
    Li, Gang
    Han, Delong
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT XIII, ICIC 2024, 2024, 14874 : 224 - 236
  • [6] MAGIC: Multi-granularity domain adaptation for text recognition
    Zhang, Jia-Ying
    Liu, Xiao-Qian
    Xue, Zhi-Yuan
    Luo, Xin
    Xu, Xin-Shun
    PATTERN RECOGNITION, 2025, 161
  • [7] Research on Text Classification by Fusing Multi-Granularity Information
    Xin, Miaomiao
    Ma, Li
    Hu, Bofa
    Computer Engineering and Applications, 2023, 59 (09) : 104 - 111
  • [8] Tracing content requirements in financial documents using multi-granularity text analysis
    Li, Xiaochen
    Bianculli, Domenico
    Briand, Lionel
    REQUIREMENTS ENGINEERING, 2025, : 109 - 132
  • [9] Text tendency analysis based on multi-granularity emotional chunks and integrated learning
    Haichao Sun
    Guoyin Wang
    Shuyin Xia
    Neural Computing and Applications, 2021, 33 : 8119 - 8129
  • [10] Text tendency analysis based on multi-granularity emotional chunks and integrated learning
    Sun, Haichao
    Wang, Guoyin
    Xia, Shuyin
    NEURAL COMPUTING & APPLICATIONS, 2021, 33 (14): : 8119 - 8129