On persuasion in spam email: A multi-granularity text analysis

被引:0
|
作者
Janez-Martino, Francisco [1 ]
Barron-Cedeno, Alberto [2 ]
Alaiz-Rodriguez, Rocio [1 ]
Gonzalez-Castro, Victor [1 ]
Muti, Arianna [2 ]
机构
[1] Univ Leon, Dept Elect Engn Syst & Automat, Leon, Spain
[2] Univ Bologna, DIT, Forli, Italy
关键词
Social engineering; Persuasion detection; Spam email; Cybersecurity; PHISHING EMAILS;
D O I
10.1016/j.eswa.2024.125767
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Electronic mail (email) is one of the most popular communication media for direct and private communication. Being typically a free service and anonymity-friendly, massive spam email campaigns are common. Nowadays, spam email encompasses scam, phishing, malware distribution, and various other cybersecurity threats. Within these emails, recipients frequently encounter social engineering techniques aimed at persuading them to take an action, such as clicking on a hyperlink, opening an attachment or responding. In this paper, we conduct a study on supervised models to identify persuasion (binary classification) and to identify the specific persuasion techniques that are commonly used in spam email (multilabel classification). To achieve this, we develop systems capable of spotting persuasion in spam emails based on natural language processing techniques. We approach this challenging task at different levels of granularity: full email, sentences and specific text snippets (i.e. text fragments composed by one or more words, typically shorter than a sentence). We replicate and adapt two methodologies used to detect propaganda in news articles. Additionally, we build a custom spam email dataset, and fine-tune pre-trained RoBERTa-based transformer models to tackle the sentence level detection. This allows us to determine how extensively spam emails rely on persuasion to achieve their goals and, if so, to identify those techniques that would be employed for user protection and cybersecurity improvements.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] A multi-granularity knowledge association model of geological text based on hypernetwork
    Can Zhuang
    Wenjia Li
    Zhong Xie
    Liang Wu
    Earth Science Informatics, 2021, 14 : 227 - 246
  • [22] A multi-granularity knowledge association model of geological text based on hypernetwork
    Zhuang, Can
    Li, Wenjia
    Xie, Zhong
    Wu, Liang
    EARTH SCIENCE INFORMATICS, 2021, 14 (01) : 227 - 246
  • [23] Short Text Hashing Improved by Integrating Multi-granularity Topics and Tags
    Xu, Jiaming
    Xu, Bo
    Tian, Guanhua
    Zhao, Jun
    Wang, Fangyuan
    Hao, Hongwei
    COMPUTATIONAL LINGUISTICS AND INTELLIGENT TEXT PROCESSING (CICLING 2015), PT I, 2015, 9041 : 444 - 455
  • [24] Multi-Granularity Matching Transformer for Text-Based Person Search
    Bao, Liping
    Wei, Longhui
    Zhou, Wengang
    Liu, Lin
    Xie, Lingxi
    Li, Houqiang
    Tian, Qi
    IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 4281 - 4293
  • [25] Multi-granularity Deep Local Representations for Irregular Scene Text Recognition
    Gao, Hongchao
    Li, Yujia
    Dai, Jiao
    Wang, Xi
    Han, Jizhong
    Li, Ruixuan
    ACM/IMS Transactions on Data Science, 2021, 2 (02):
  • [26] Towards Better Representations for Multi-Label Text Classification with Multi-granularity Information
    Li, Fangfang
    Su, Puzhen
    Duan, Junwen
    Xiao, Weidong
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (EMNLP 2023), 2023, : 9470 - 9480
  • [27] Multi-granularity Fatigue in Recommendation
    Xie, Ruobing
    Ling, Cheng
    Zhang, Shaoliang
    Xia, Feng
    Lin, Leyu
    PROCEEDINGS OF THE 31ST ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2022, 2022, : 4595 - 4599
  • [28] Utilizing Multi-Field Text Features for Efficient Email Spam Filtering
    Wuying Liu
    Ting Wang
    International Journal of Computational Intelligence Systems, 2012, 5 : 505 - 518
  • [29] Utilizing Multi-Field Text Features for Efficient Email Spam Filtering
    Liu, Wuying
    Wang, Ting
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2012, 5 (03) : 505 - 518
  • [30] Multi-granularity Attribute Reduction
    Liang, Shaochen
    Liu, Keyu
    Chen, Xiangjian
    Wang, Pingxin
    Yang, Xibei
    ROUGH SETS, IJCRS 2018, 2018, 11103 : 61 - 72