Spam e-mail classification for the Internet of Things environment using semantic similarity approach

被引:0
|
作者
S. Venkatraman
B. Surendiran
P. Arun Raj Kumar
机构
[1] National Institute of Technology,Department of Computer Science and Engineering
[2] National Institute of Technology,Department of Computer Science and Engineering
来源
关键词
Conceptual similarity; E-mail spam detection; Knowledge engineering; Machine learning technique; Wikipedia link structure; Semantic similarity;
D O I
暂无
中图分类号
学科分类号
摘要
Unauthorized service or product advertising messages sent via electronic mails are called as spam e-mails. Detecting spam e-mail remains a challenging task. Existing countermeasures based on the statistical keyword, conceptual and IP address-based blacklists are not efficient due to difficulty in finding new attack patterns generated by the Internet of Things botnet devices. The other spam detection approaches rely on a hybrid of conceptual knowledge engineering with machine learning techniques. But, modern spammers evade the hybrid techniques through word polysemy and word ambiguity due to the context-sensitive nature of words. In this paper, the integration of Naïve Bayesian classification with conceptual and semantic similarity technique is proposed to combat the ambiguity raised through polysemy in spam detection. To analyse the effectiveness of our approach, the experiments were conducted on benchmark data sets such as Spambase, PU1, Enron corpus, and Ling-spam. From the experimental results, it is evident that our proposed system achieves high accuracy of 98.89% than the existing approaches.
引用
收藏
页码:756 / 776
页数:20
相关论文
共 50 条
  • [21] Performance Analysis of E-Mail Spam Classification using different Machine Learning Techniques
    Vinitha, V. Sri
    Renuka, D. Karthika
    PROCEEDINGS OF THE 2019 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING & COMMUNICATION ENGINEERING (ICACCE-2019), 2019,
  • [22] Reverse of E-mail Spam Filtering Algorithms to Maintain E-mail Deliverability
    AlRashid, Hussah
    AlZahrani, Rasheed
    ElQawasmeh, Eyas
    2014 FOURTH INTERNATIONAL CONFERENCE ON DIGITAL INFORMATION AND COMMUNICATION TECHNOLOGY AND IT'S APPLICATIONS (DICTAP), 2014, : 297 - 300
  • [23] Distributed layer-3 e-mail classification for SPAM control
    Marsono, Muhammad N.
    El-Khaxashi, M. Watheq
    Gebali, Fayez
    Ganti, Sudhakar
    2006 CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, VOLS 1-5, 2006, : 957 - +
  • [24] Internet and E-mail
    The Internet Business Journal, 1993, 1 (02):
  • [25] Using E-Mail SPAM DNS Blacklists for qualifying the SPAM-over-Internet-Telephony Probability of a SIP Call
    Hirschbichler, M.
    Egger, C.
    Pasteka, O.
    Berger, A.
    THIRD INTERNATIONAL CONFERENCE ON DIGITAL SOCIETY: ICDS 2009, PROCEEDINGS, 2009, : 254 - 259
  • [26] E-mail marketing at the crossroads - A stakeholder analysis of unsolicited commercial e-mail (spam)
    Moustakas, E
    Ranganathan, C
    Duquenoy, P
    INTERNET RESEARCH, 2006, 16 (01) : 38 - 52
  • [27] SPAM E-MAIL FILTERING USING POLYNOMIAL NEURAL NETWORKS
    Al-Tahrawi, Mayy M.
    Abualhaj, Mosleh M.
    Shambour, Qusai Y.
    JOURNAL OF ENGINEERING SCIENCE AND TECHNOLOGY, 2020, 15 (03): : 2090 - 2109
  • [28] Cloud e-mail security: An accurate e-mail spam classification based on enhanced binary differential evolution (BDE) algorithm
    Hamed, Nadir O.
    Samak, Ahmed H.
    Ahmad, Mostafa A.
    Journal of Intelligent and Fuzzy Systems, 2021, 41 (06): : 5943 - 5955
  • [29] Cloud e-mail security: An accurate e-mail spam classification based on enhanced binary differential evolution (BDE) algorithm
    Hamed, Nadir O.
    Samak, Ahmed H.
    Ahmad, Mostafa A.
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2021, 41 (06) : 5943 - 5955
  • [30] Impact of spam advertisement through e-mail: A study to assess the influence of the anti-spam on the e-mail marketing
    Raad, Mostafa
    Yeassen, Norizan Mohd
    Alam, Gazi Mahabubul
    Zaidan, B. B.
    Zaidan, A. A.
    AFRICAN JOURNAL OF BUSINESS MANAGEMENT, 2010, 4 (11): : 2362 - 2367