A comprehensive dual-layer architecture for phishing and spam email detection

被引:9
|
作者
Doshi, Jay [1 ]
Parmar, Kunal [1 ]
Sanghavi, Raj [1 ]
Shekokar, Narendra [1 ]
机构
[1] Dwarkadas J Sanghvi Coll Engineenng, Dept Comp Engineenng, Mumbai, India
关键词
Email phishing; Email spamming; Machine learning; Dual-layer architecture; Deep learning;
D O I
10.1016/j.cose.2023.103378
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The widespread use of email communication for sharing personal, professional, and financial data has rendered it vulnerable to cyber-attacks. Detecting untrustworthy emails with minimal errors is crucial to counter these threats. This research paper focuses on classifying spam and phishing emails, commonly employed to steal confidential information by impersonating legitimate sources. The scale of such attacks is alarmingly high, causing substantial financial losses across sectors like banking, healthcare, technology, and other businesses. This research aims to classify both spam and phishing emails, addressing the lim-itations of existing studies that only focus on one type and consider either the email body or content for feature selection. This research incorporates features from both the email body and content during model training. The authors propose a novel approach that ensures highly accurate classification while effectively handling the common issue of data imbalance in email phishing and spam classification. The approach utilizes a dual-layer architecture, with each layer containing a trained or pre-trained model that classifies data instances into their respective classes. Layer 1 classifies the phishing class, while Layer 2 classifies the spam class. Building upon the proven effectiveness of deep learning techniques for text classification and analysis, the proposed architecture employs models like Artificial Neural Net-works (ANN), Recurrent Neural Networks (RNN), and Convolutional Neural Networks (CNN). Experimen-tal evaluation demonstrates the approach's remarkable accuracy, recall, precision, and F1-score, achieving 99.51%, 99.68%, 99.5%, and 99.52%, respectively. This signifies its high efficacy in detecting and classifying malicious emails with minimal errors, thus holding great promise in enhancing system security against cyber-attacks in email communication.& COPY; 2023 Elsevier Ltd. All rights reserved.
引用
收藏
页数:20
相关论文
共 50 条
  • [1] A Comprehensive Study of Email Spam Botnet Detection
    Khan, Wazir Zada
    Khan, Muhammad Khurram
    Bin Muhaya, Fahad T.
    Aalsalem, Mohammed Y.
    Chao, Han-Chieh
    IEEE COMMUNICATIONS SURVEYS AND TUTORIALS, 2015, 17 (04): : 2271 - 2295
  • [2] A Comprehensive Survey for Intelligent Spam Email Detection
    Karim, Asif
    Azam, Sami
    Shanmugam, Bharanidharan
    Kannoorpatti, Krishnan
    Alazab, Mamoun
    IEEE ACCESS, 2019, 7 : 168261 - 168295
  • [3] A Comprehensive Survey of Phishing Email Detection and Protection Techniques
    Kumar Birthriya, Santosh
    Jain, Ankit Kumar
    INFORMATION SECURITY JOURNAL, 2022, 31 (04): : 411 - 440
  • [4] QUKU: A Dual-Layer Reconfigurable Architecture
    Bergmann, Neil W.
    Shukla, Sunil K.
    Becker, Juergen
    ACM TRANSACTIONS ON EMBEDDED COMPUTING SYSTEMS, 2013, 12
  • [5] Overconfidence in Phishing Email Detection
    Wang, Jingguo
    Li, Yuan
    Rao, H. Raghav
    JOURNAL OF THE ASSOCIATION FOR INFORMATION SYSTEMS, 2016, 17 (11): : 759 - 783
  • [6] Email Embeddings for Phishing Detection
    Gutierrez, Luis Felipe
    Abri, Faranak
    Armstrong, Miriam
    Namin, Akbar Siami
    Jones, Keith S.
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 2087 - 2092
  • [7] Cue Utilization, Phishing Feature and Phishing Email Detection
    Bayl-Smith, Piers
    Sturman, Daniel
    Wiggins, Mark
    FINANCIAL CRYPTOGRAPHY AND DATA SECURITY, FC 2020, 2020, 12063 : 56 - 70
  • [8] Applicability of machine learning in spam and phishing email filtering: review and approaches
    Tushaar Gangavarapu
    C. D. Jaidhar
    Bhabesh Chanduka
    Artificial Intelligence Review, 2020, 53 : 5019 - 5081
  • [9] Applicability of machine learning in spam and phishing email filtering: review and approaches
    Gangavarapu, Tushaar
    Jaidhar, C. D.
    Chanduka, Bhabesh
    ARTIFICIAL INTELLIGENCE REVIEW, 2020, 53 (07) : 5019 - 5081
  • [10] D-Fence: A Flexible, Efficient, and Comprehensive Phishing Email Detection System
    Lee, Jehyun
    Tang, Farren
    Ye, Pingxiao
    Abbasi, Fahim
    Hay, Phil
    Divakaran, Dinil Mon
    2021 IEEE EUROPEAN SYMPOSIUM ON SECURITY AND PRIVACY (EUROS&P 2021), 2021, : 578 - 597