Content-based concept drift detection for Email spam filtering

被引:9
|
作者
Zi Hayat M. [1 ]
Basiri J. [1 ]
Seyedhossein L. [1 ]
Shakery A. [1 ]
机构
[1] School of Electrical and Computer Engineering, University of Tehran, Tehran
关键词
Concept drift; KL divergence; Language model; Spam filtering;
D O I
10.1109/ISTEL.2010.5734082
中图分类号
学科分类号
摘要
The continued growth of Email usage, which is naturally followed by an increase in unsolicited emails so called spams, motivates research in spam filtering area. In the context of spam filtering systems, addressing the evolving nature of spams, which leads to obsolete the related models, has been always a challenge. In this paper an adaptive spam filtering system based on language model is proposed which can detect concept drift based on computing the deviation in email contents distribution. The proposed method can be used along with any existing classifier; particularly in this paper we use Naïve Bayes method as classifier. The proposed method has been evaluated with Enron data set. The results indicate the efficiency of the method in detecting concept drift and its superiority over Naïve Bayes classifier in terms of accuracy. © 2010 IEEE.
引用
收藏
页码:531 / 536
页数:5
相关论文
共 50 条
  • [1] Content-Based Spam Filtering
    Almeida, Tiago A.
    Yamakami, Akebo
    2010 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS IJCNN 2010, 2010,
  • [2] Ham or Spam? A comparative study for some Content-based Classification Algorithms for Email Filtering
    Saab, Salwa Adriana
    Mitri, Nicholas
    Awad, Mariette
    2014 17TH IEEE MEDITERRANEAN ELECTROTECHNICAL CONFERENCE (MELECON), 2014, : 439 - 443
  • [3] An Overview of Content-Based Spam Filtering Techniques
    Khorsi, Ahmed
    INFORMATICA-JOURNAL OF COMPUTING AND INFORMATICS, 2007, 31 (03): : 269 - 277
  • [4] Symbiotic filtering for spam email detection
    Lopes, Clotilde
    Cortez, Paulo
    Sousa, Pedro
    Rocha, Miguel
    Rio, Miguel
    EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (08) : 9365 - 9372
  • [5] Content-based Approach for Vietnamese Spam SMS Filtering
    Pham, Thai-Hoang
    Le-Hong, Phuong
    PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2016, : 41 - 44
  • [6] A Content-Based Phishing Email Detection Method
    Che, Hongming
    Liu, Qinyun
    Zou, Lin
    Yang, Hongji
    Zhou, Dongdai
    Yu, Feng
    2017 IEEE INTERNATIONAL CONFERENCE ON SOFTWARE QUALITY, RELIABILITY AND SECURITY COMPANION (QRS-C), 2017, : 415 - 422
  • [7] Email Spam Filtering
    Puertas Sanz, Enrique
    Gomez Hidalgo, Jose Maria
    Cortizo Perez, Jose Carlos
    ADVANCES IN COMPUTERS, VOL 74: SOFTWARE DEVELOPMENT, 2008, 74 : 45 - 114
  • [8] Content Based Spam Detection in Email using Bayesian Classifier
    Rathod, Sunil B.
    Pattewar, Tareek M.
    2015 INTERNATIONAL CONFERENCE ON COMMUNICATIONS AND SIGNAL PROCESSING (ICCSP), 2015, : 1257 - 1261
  • [9] Lazy associative classification for content-based spam detection
    Veloso, Adriano
    Meira, Wagner, Jr.
    LA-WEB 06: FOURTH LATIN AMERICAN WEB CONGRESS, PROCEEDINGS, 2006, : 154 - +
  • [10] Email Spam Filtering Based on the MNMF Algorithm
    Liu, Zun-xiong
    Tian, Shan-shan
    Huang, Zhi-qiang
    Liu, Jiang-wei
    INTERNATIONAL JOURNAL OF SECURITY AND ITS APPLICATIONS, 2016, 10 (01): : 31 - 44