Spam e-mail classification for the Internet of Things environment using semantic similarity approach

被引:0
|
作者
S. Venkatraman
B. Surendiran
P. Arun Raj Kumar
机构
[1] National Institute of Technology,Department of Computer Science and Engineering
[2] National Institute of Technology,Department of Computer Science and Engineering
来源
关键词
Conceptual similarity; E-mail spam detection; Knowledge engineering; Machine learning technique; Wikipedia link structure; Semantic similarity;
D O I
暂无
中图分类号
学科分类号
摘要
Unauthorized service or product advertising messages sent via electronic mails are called as spam e-mails. Detecting spam e-mail remains a challenging task. Existing countermeasures based on the statistical keyword, conceptual and IP address-based blacklists are not efficient due to difficulty in finding new attack patterns generated by the Internet of Things botnet devices. The other spam detection approaches rely on a hybrid of conceptual knowledge engineering with machine learning techniques. But, modern spammers evade the hybrid techniques through word polysemy and word ambiguity due to the context-sensitive nature of words. In this paper, the integration of Naïve Bayesian classification with conceptual and semantic similarity technique is proposed to combat the ambiguity raised through polysemy in spam detection. To analyse the effectiveness of our approach, the experiments were conducted on benchmark data sets such as Spambase, PU1, Enron corpus, and Ling-spam. From the experimental results, it is evident that our proposed system achieves high accuracy of 98.89% than the existing approaches.
引用
收藏
页码:756 / 776
页数:20
相关论文
共 50 条
  • [41] A new way to can e-mail Spam
    Creswell, J
    FORTUNE, 2003, 147 (08) : 34 - 34
  • [42] E-mail and internet abuse
    不详
    IRISH VETERINARY JOURNAL, 2003, 56 (11) : 578 - 578
  • [43] An E-mail starter kit: Internet E-mail with minimum hassle
    Jung, W
    ELECTRONIC DESIGN, 1997, 45 (03) : 167 - 169
  • [44] Spam/Ham E-Mail Classification using Machine Learning Methods based on Bag of Words Technique
    Sahin, Esra
    Aydos, Murat
    Orhan, Fatih
    2018 26TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2018,
  • [45] Experience in management of e-mail delivery delay problems associated with spam e-mail filtering in a university
    Hisanaga, Yutaka
    Sugii, Manabu
    Wang, Yue
    Osa, Atsushi
    Miike, Hidetoshi
    ELECTRONICS AND COMMUNICATIONS IN JAPAN, 2012, 95 (01) : 8 - 19
  • [46] USING E-MAIL AND THE INTERNET TO TEACH USERS AT THEIR DESKTOPS
    JENSEN, A
    SIH, J
    ONLINE, 1995, 19 (05): : 82 - 86
  • [47] Targeting spam control on middleboxes: Spam detection based on layer-3 e-mail content classification
    Marsono, Muhammad N.
    El-Kharashi, M. Watheq
    Gebali, Fayez
    COMPUTER NETWORKS, 2009, 53 (06) : 835 - 848
  • [48] An Optimistic Certified E-mail Protocol for the Current Internet E-mail Architecture
    Draper-Gil, Gerard
    Ferrer-Gomila, Josep L.
    Hinarejos, M. Francisca
    Tauber, Arne
    2014 IEEE CONFERENCE ON COMMUNICATIONS AND NETWORK SECURITY (CNS), 2014, : 382 - 390
  • [49] An ensemble design approach based on bagging technique for filtering e-mail spam
    Roy S.S.
    Viswanatham V.M.
    Krishna P.V.
    Roy, Sanjiban Sekhar (s.roy@vit.ac.in), 1600, Inderscience Enterprises Ltd., 29, route de Pre-Bois, Case Postale 856, CH-1215 Geneva 15, CH-1215, Switzerland (10): : 247 - 260
  • [50] Rule-Based Spam E-mail Annotation
    Fiumara, Giacomo
    Marchi, Massimo
    Pagano, Rosamaria
    Provetti, Alessandro
    WEB REASONING AND RULE SYSTEMS, 2010, 6333 : 231 - 234