Spam e-mail classification for the Internet of Things environment using semantic similarity approach

被引:0
|
作者
S. Venkatraman
B. Surendiran
P. Arun Raj Kumar
机构
[1] National Institute of Technology,Department of Computer Science and Engineering
[2] National Institute of Technology,Department of Computer Science and Engineering
来源
关键词
Conceptual similarity; E-mail spam detection; Knowledge engineering; Machine learning technique; Wikipedia link structure; Semantic similarity;
D O I
暂无
中图分类号
学科分类号
摘要
Unauthorized service or product advertising messages sent via electronic mails are called as spam e-mails. Detecting spam e-mail remains a challenging task. Existing countermeasures based on the statistical keyword, conceptual and IP address-based blacklists are not efficient due to difficulty in finding new attack patterns generated by the Internet of Things botnet devices. The other spam detection approaches rely on a hybrid of conceptual knowledge engineering with machine learning techniques. But, modern spammers evade the hybrid techniques through word polysemy and word ambiguity due to the context-sensitive nature of words. In this paper, the integration of Naïve Bayesian classification with conceptual and semantic similarity technique is proposed to combat the ambiguity raised through polysemy in spam detection. To analyse the effectiveness of our approach, the experiments were conducted on benchmark data sets such as Spambase, PU1, Enron corpus, and Ling-spam. From the experimental results, it is evident that our proposed system achieves high accuracy of 98.89% than the existing approaches.
引用
收藏
页码:756 / 776
页数:20
相关论文
共 50 条
  • [1] Spam e-mail classification for the Internet of Things environment using semantic similarity approach
    Venkatraman, S.
    Surendiran, B.
    Kumar, P. Arun Raj
    JOURNAL OF SUPERCOMPUTING, 2020, 76 (02): : 756 - 776
  • [2] SpamED: A Spam E-Mail Detection Approach Based on Phrase Similarity
    Pera, Maria Soledad
    Ng, Yiu-Kai
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2009, 60 (02): : 393 - 409
  • [3] Using E-mail Authentication and Disposable E-mail Addressing for Filtering Spam
    Luo, Jia-Ning
    Yang, Ming Hour
    2009 10TH INTERNATIONAL SYMPOSIUM ON PERVASIVE SYSTEMS, ALGORITHMS, AND NETWORKS (ISPAN 2009), 2009, : 356 - +
  • [4] Classification of Textual E-Mail Spam Using Data Mining Techniques
    Alguliev, Rasim M.
    Aliguliyev, Ramiz M.
    Nazirova, Saadat A.
    APPLIED COMPUTATIONAL INTELLIGENCE AND SOFT COMPUTING, 2011, 2011
  • [5] Spam Classification Based on E-Mail Path Analysis
    Palla, Srikanth
    Dantu, Ram
    Cangussu, Joao W.
    INTERNATIONAL JOURNAL OF INFORMATION SECURITY AND PRIVACY, 2008, 2 (02) : 46 - 69
  • [6] Spam E-Mail Classification Based on the IFWB Algorithm
    Jou, Chichang
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS (ACIIDS 2013), PT I,, 2013, 7802 : 314 - 324
  • [7] Addressing Spam E-Mail Using Hashcast
    Curran, Kevin
    Honan, John Stephen
    INTERNATIONAL JOURNAL OF BUSINESS DATA COMMUNICATIONS AND NETWORKING, 2005, 1 (02) : 41 - 65
  • [8] E-mail, hold the spam
    Hoyle, J
    JOURNAL OF THE AMERICAN DENTAL ASSOCIATION, 2000, 131 (10): : 1426 - 1426
  • [9] Semantic Graph Based Convolutional Neural Network for Spam e-mail Classification in Cybercrime Applications
    Nisha, S. Rahmath
    Muthurajkumar, S.
    INTERNATIONAL JOURNAL OF COMPUTERS COMMUNICATIONS & CONTROL, 2023, 18 (01)
  • [10] E-mail Spam Classification Using Grasshopper Optimization Algorithm and Neural Networks
    Ghaleb, Sanaa A. A.
    Mohamad, Mumtazimah
    Fadzli, Syed Abdullah
    Ghanem, Waheed A. H. M.
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 71 (03): : 4749 - 4766