An Improved Selective Ensemble Method for Spam Filtering

被引:0
|
作者
Cai, Jinye [1 ,2 ]
Xu, Pingping [1 ,2 ]
Tang, Huiyu [3 ]
Sun, Lin [1 ,2 ]
机构
[1] Southeast Univ, Natl Mobile Commun Res Lab, Nanjing 210096, Jiangsu, Peoples R China
[2] Southeast Univ, Jiangsu Prov Key Lab Sensor Network Technol, Wuxi 214135, Peoples R China
[3] Waseda Univ, Grad Sch IPS, Kitakyushu, Fukuoka 8080135, Japan
关键词
Text mining; Classification; Spam filtering; SVM; Clustering; Selective ensemble;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents an improved method of selective ensemble to filter the spam messages. The design adopts clustering based on the diversity between sub-classifiers to solve the problem of selection. To improve accuracy and stability, a conception of confidence weight is proposed to evaluate the reliability of selected sub-classifiers. The training model is created with small datasets as in the real situation. For practical usage, this method only uses 150 samples of user's file and executes bootstrapping between 50 and 70 times on them. Experiments validate the effectiveness of this method in handling the spam filtering problem.
引用
收藏
页码:743 / 747
页数:5
相关论文
共 50 条
  • [21] Filtering spam
    Editor & Publisher, 1999, (Suppl):
  • [22] Filtering spam
    Baker, B
    INTERNET WORLD, 1998, 9 (01): : 14 - 14
  • [23] An improved Bayes algorithm for filtering spam e-mail
    Wang, Meizhen
    Li, Zhitang
    Wu, Hantao
    Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2009, 37 (08): : 27 - 30
  • [24] A Method of SMS Spam Filtering Based on AdaBoost Algorithm
    Zhang, Xipeng
    Xiong, Gang
    Hu, Yuexiang
    Zhu, Fenghua
    Dong, Xisong
    Nyberg, Timo R.
    PROCEEDINGS OF THE 2016 12TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2016, : 2328 - 2332
  • [25] Research and Realization on Preprocessing Method for Spam Filtering System
    Yang, Lihua
    Li, Baolin
    PROCEEDINGS OF THE 2012 INTERNATIONAL CONFERENCE ON MANAGEMENT INNOVATION AND PUBLIC POLICY (ICMIPP 2012), VOLS 1-6, 2012, : 730 - 733
  • [26] A fuzzy inference method for spam-mail filtering
    Kim, JW
    Kang, SJ
    Kim, BM
    AI 2005: ADVANCES IN ARTIFICIAL INTELLIGENCE, 2005, 3809 : 1112 - 1115
  • [27] Spam Filtering method based on an Artificial Immune System
    Chen, Jiujun
    Xiao, Gang
    Gao, Fei
    Zhang, Yuanming
    2008 INTERNATIONAL CONFERENCE ON MULTIMEDIA AND INFORMATION TECHNOLOGY, PROCEEDINGS, 2008, : 169 - 171
  • [28] Spam Mail Filtering Method Based on Suffix Tree
    Hu, Runqiu
    Yang, Yitao
    ADVANCES IN INTERNETWORKING, DATA & WEB TECHNOLOGIES, EIDWT-2017, 2018, 6 : 436 - 447
  • [29] Spam filtering using spam mail communities
    Deepak, P
    Parameswaran, S
    2005 SYMPOSIUM ON APPLICATIONS AND THE INTERNET, PROCEEDINGS, 2005, : 377 - 383
  • [30] Spam Filtering in Social Networks Using Regularized Deep Neural Networks with Ensemble Learning
    Barushka, Aliaksandr
    Hajek, Petr
    ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS, AIAI 2018, 2018, 519 : 38 - 49