Collective Classification for Spam Filtering

被引:0
|
作者
Laorden, Carlos [1 ]
Sanz, Borja [1 ]
Santos, Igor [1 ]
Galan-Garcia, Patxi [1 ]
Bringas, Pablo G. [1 ]
机构
[1] Univ Deusto, DeustoTech Comp S3Lab, Bilbao 48007, Spain
关键词
Spam filtering; collective classification; semi-supervised learning; MODEL;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Spam has become a major issue in computer security because it is a channel for threats such as computer viruses, worms and phishing. Many solutions feature machine-learning algorithms trained using statistical representations of the terms that usually appear in the e-mails. Still, these methods require a training step with labelled data. Dealing with the situation where the availability of labelled training instances is limited slows down the progress of filtering systems and offers advantages to spammers. Currently, many approaches direct their efforts into Semi-Supervised Learning (SSL). SSL is a halfway method between supervised and unsupervised learning, which, in addition to unlabelled data, receives some supervision information such as the association of the targets with some of the examples. Collective Classification for Text Classification poses as an interesting method for optimising the classification of partially-labelled data. In this way, we propose here, for the first time, Collective Classification algorithms for spam filtering to overcome the amount of unclassified e-mails that are sent every day.
引用
收藏
页码:1 / 8
页数:8
相关论文
共 50 条
  • [21] Adaptive filtering of SPAM
    Pelletier, L
    Almhana, J
    Choulakian, V
    SECOND ANNUAL CONFERENCE ON COMMUNICATION NETWORKS AND SERVICES RESEARCH, PROCEEDINGS, 2004, : 218 - 224
  • [22] Spam filtering scheme
    Wang, Jing (wngjing@hotmail.com), 1600, Northeast University (35):
  • [23] Email Spam Filtering
    Puertas Sanz, Enrique
    Gomez Hidalgo, Jose Maria
    Cortizo Perez, Jose Carlos
    ADVANCES IN COMPUTERS, VOL 74: SOFTWARE DEVELOPMENT, 2008, 74 : 45 - 114
  • [24] Spam Filtering Email Classification (SFECM) using Gain and Graph Mining Algorithm
    Chae, M. K.
    Alsadoon, Abeer
    Prasad, P. W. C.
    Sreedharan, Sasikumaran
    2017 2ND INTERNATIONAL CONFERENCE ON ANTI-CYBER CRIMES (ICACC), 2017, : 217 - 222
  • [25] Spam Filtering Email Classification (SFECM) using Gain and Graph Mining Algorithm
    Chae, M. K.
    Alsadoon, Abeer
    Prasad, P. W. C.
    Elchouemi, A.
    2017 IEEE 7TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE IEEE CCWC-2017, 2017,
  • [26] Discovering Classification Rules for Email Spam Filtering with an Ant Colony Optimization Algorithm
    El-Alfy, El-Sayed M.
    2009 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION, VOLS 1-5, 2009, : 1778 - 1783
  • [27] Ending Spam-Bayesian Content Filtering and the Art of Statistical Language Classification
    Webster, Craig S.
    PROMETHEUS, 2006, 24 (01) : 121 - 124
  • [28] Layout Based Spam Filtering
    Musat, Claudiu N.
    PROCEEDINGS OF WORLD ACADEMY OF SCIENCE, ENGINEERING AND TECHNOLOGY, VOL 12, 2006, 12 : 161 - 164
  • [29] A scalable spam filtering architecture
    Ferreira, Nuno
    Carvalho, Gracinda
    Pereira, Paulo Rogério
    IFIP Advances in Information and Communication Technology, 2013, 394 : 107 - 114
  • [30] Filtering Spam with Behavioral Blacklisting
    Ramachandran, Anirudh
    Feamster, Nick
    Vempala, Santosh
    CCS'07: PROCEEDINGS OF THE 14TH ACM CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, 2007, : 342 - 351