Content-based concept drift detection for Email spam filtering

被引:9
|
作者
Zi Hayat M. [1 ]
Basiri J. [1 ]
Seyedhossein L. [1 ]
Shakery A. [1 ]
机构
[1] School of Electrical and Computer Engineering, University of Tehran, Tehran
关键词
Concept drift; KL divergence; Language model; Spam filtering;
D O I
10.1109/ISTEL.2010.5734082
中图分类号
学科分类号
摘要
The continued growth of Email usage, which is naturally followed by an increase in unsolicited emails so called spams, motivates research in spam filtering area. In the context of spam filtering systems, addressing the evolving nature of spams, which leads to obsolete the related models, has been always a challenge. In this paper an adaptive spam filtering system based on language model is proposed which can detect concept drift based on computing the deviation in email contents distribution. The proposed method can be used along with any existing classifier; particularly in this paper we use Naïve Bayes method as classifier. The proposed method has been evaluated with Enron data set. The results indicate the efficiency of the method in detecting concept drift and its superiority over Naïve Bayes classifier in terms of accuracy. © 2010 IEEE.
引用
收藏
页码:531 / 536
页数:5
相关论文
共 50 条
  • [41] Feature Selection and Similarity Coefficient Based Method for Email Spam Filtering
    Abdelrahim, Ali Ahmed A.
    Elhadi, Ammar Ahmed E.
    Ibrahim, Hamza
    Elmisbah, Naser
    2013 INTERNATIONAL CONFERENCE ON COMPUTING, ELECTRICAL AND ELECTRONICS ENGINEERING (ICCEEE), 2013, : 630 - 633
  • [42] Detection of Zombie PCs Based on Email Spam Analysis
    Jeong, HyunCheol
    Kim, Huy Kang
    Lee, Sangjin
    Kim, Eunjin
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2012, 6 (05): : 1445 - 1462
  • [43] Content-Based Spam Filtering Using Hybrid Generative Discriminative Learning of Both Textual and Visual Features
    Amayri, Ola
    Bouguila, Nizar
    2012 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS 2012), 2012, : 862 - 865
  • [44] Content-based analysis to detect Arabic web spam
    Al-Kabi, Mohammed
    Wahsheh, Heider
    Alsmadi, Izzat
    Al-Shawakfa, Emad
    Wahbeh, Abdullah
    Al-Hmoud, Ahmed
    JOURNAL OF INFORMATION SCIENCE, 2012, 38 (03) : 284 - 296
  • [45] New approaches for content-based analysis towards Online Social Network spam detection
    Ezpeleta, Enaitz
    PROCESAMIENTO DEL LENGUAJE NATURAL, 2018, (60): : 71 - 74
  • [47] Breaking and Fixing Content-Based Filtering
    Dhiman, Mayank
    Jakobsson, Markus
    Yen, Ting-Fang
    PROCEEDINGS OF THE 2017 APWG SYMPOSIUM ON ELECTRONIC CRIME RESEARCH (ECRIME), 2017, : 52 - 56
  • [48] Content-based image filtering for recommendation
    Jung, Kyung-Yong
    FOUNDATIONS OF INTELLIGENT SYSTEMS, PROCEEDINGS, 2006, 4203 : 312 - 321
  • [49] Multi-field Learning for Email Spam Filtering
    Liu, Wuying
    Wang, Ting
    SIGIR 2010: PROCEEDINGS OF THE 33RD ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH DEVELOPMENT IN INFORMATION RETRIEVAL, 2010, : 745 - 746
  • [50] ADAPTIVE PRIVACY POLICY PREDICTION FOR EMAIL SPAM FILTERING
    Rajendran, P.
    Hemalatha, S. M.
    Janaki, M.
    Durkananthini, B.
    2016 WORLD CONFERENCE ON FUTURISTIC TRENDS IN RESEARCH AND INNOVATION FOR SOCIAL WELFARE (STARTUP CONCLAVE), 2016,