Spam e-mail classification for the Internet of Things environment using semantic similarity approach

被引：0

作者：

S. Venkatraman

B. Surendiran

P. Arun Raj Kumar

机构：

[1] National Institute of Technology,Department of Computer Science and Engineering

[2] National Institute of Technology,Department of Computer Science and Engineering

来源：

The Journal of Supercomputing | 2020年 / 76卷

关键词：

Conceptual similarity; E-mail spam detection; Knowledge engineering; Machine learning technique; Wikipedia link structure; Semantic similarity;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Unauthorized service or product advertising messages sent via electronic mails are called as spam e-mails. Detecting spam e-mail remains a challenging task. Existing countermeasures based on the statistical keyword, conceptual and IP address-based blacklists are not efficient due to difficulty in finding new attack patterns generated by the Internet of Things botnet devices. The other spam detection approaches rely on a hybrid of conceptual knowledge engineering with machine learning techniques. But, modern spammers evade the hybrid techniques through word polysemy and word ambiguity due to the context-sensitive nature of words. In this paper, the integration of Naïve Bayesian classification with conceptual and semantic similarity technique is proposed to combat the ambiguity raised through polysemy in spam detection. To analyse the effectiveness of our approach, the experiments were conducted on benchmark data sets such as Spambase, PU1, Enron corpus, and Ling-spam. From the experimental results, it is evident that our proposed system achieves high accuracy of 98.89% than the existing approaches.

引用

页码：756 / 776

页数：20

共 50 条

[1] Spam e-mail classification for the Internet of Things environment using semantic similarity approach
Venkatraman, S.
Surendiran, B.
Kumar, P. Arun Raj
JOURNAL OF SUPERCOMPUTING, 2020, 76 (02): : 756 - 776
[2] SpamED: A Spam E-Mail Detection Approach Based on Phrase Similarity
Pera, Maria Soledad
Ng, Yiu-Kai
JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2009, 60 (02): : 393 - 409
[3] Using E-mail Authentication and Disposable E-mail Addressing for Filtering Spam
Luo, Jia-Ning
Yang, Ming Hour
2009 10TH INTERNATIONAL SYMPOSIUM ON PERVASIVE SYSTEMS, ALGORITHMS, AND NETWORKS (ISPAN 2009), 2009, : 356 - +
[4] Classification of Textual E-Mail Spam Using Data Mining Techniques
Alguliev, Rasim M.
Aliguliyev, Ramiz M.
Nazirova, Saadat A.
APPLIED COMPUTATIONAL INTELLIGENCE AND SOFT COMPUTING, 2011, 2011
[5] Spam Classification Based on E-Mail Path Analysis
Palla, Srikanth
Dantu, Ram
Cangussu, Joao W.
INTERNATIONAL JOURNAL OF INFORMATION SECURITY AND PRIVACY, 2008, 2 (02) : 46 - 69
[6] Spam E-Mail Classification Based on the IFWB Algorithm
Jou, Chichang
INTELLIGENT INFORMATION AND DATABASE SYSTEMS (ACIIDS 2013), PT I,, 2013, 7802 : 314 - 324
[7] Addressing Spam E-Mail Using Hashcast
Curran, Kevin
Honan, John Stephen
INTERNATIONAL JOURNAL OF BUSINESS DATA COMMUNICATIONS AND NETWORKING, 2005, 1 (02) : 41 - 65
[8] E-mail, hold the spam
Hoyle, J
JOURNAL OF THE AMERICAN DENTAL ASSOCIATION, 2000, 131 (10): : 1426 - 1426
[9] Semantic Graph Based Convolutional Neural Network for Spam e-mail Classification in Cybercrime Applications
Nisha, S. Rahmath
Muthurajkumar, S.
INTERNATIONAL JOURNAL OF COMPUTERS COMMUNICATIONS & CONTROL, 2023, 18 (01)
[10] E-mail Spam Classification Using Grasshopper Optimization Algorithm and Neural Networks
Ghaleb, Sanaa A. A.
Mohamad, Mumtazimah
Fadzli, Syed Abdullah
Ghanem, Waheed A. H. M.
CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 71 (03): : 4749 - 4766

← 1 2 3 4 5 →