An Improved Selective Ensemble Method for Spam Filtering

被引:0
|
作者
Cai, Jinye [1 ,2 ]
Xu, Pingping [1 ,2 ]
Tang, Huiyu [3 ]
Sun, Lin [1 ,2 ]
机构
[1] Southeast Univ, Natl Mobile Commun Res Lab, Nanjing 210096, Jiangsu, Peoples R China
[2] Southeast Univ, Jiangsu Prov Key Lab Sensor Network Technol, Wuxi 214135, Peoples R China
[3] Waseda Univ, Grad Sch IPS, Kitakyushu, Fukuoka 8080135, Japan
关键词
Text mining; Classification; Spam filtering; SVM; Clustering; Selective ensemble;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents an improved method of selective ensemble to filter the spam messages. The design adopts clustering based on the diversity between sub-classifiers to solve the problem of selection. To improve accuracy and stability, a conception of confidence weight is proposed to evaluate the reliability of selected sub-classifiers. The training model is created with small datasets as in the real situation. For practical usage, this method only uses 150 samples of user's file and executes bootstrapping between 50 and 70 times on them. Experiments validate the effectiveness of this method in handling the spam filtering problem.
引用
收藏
页码:743 / 747
页数:5
相关论文
共 50 条
  • [41] A Novel Method for Filtering Group Sending Short Message Spam
    He, Peizhou
    Wen, Xiangming
    Zheng, Wei
    ICHIT 2008: INTERNATIONAL CONFERENCE ON CONVERGENCE AND HYBRID INFORMATION TECHNOLOGY, PROCEEDINGS, 2008, : 60 - 65
  • [42] An Improved Selective Ensemble Learning Method for Highway Traffic Flow State Identification
    Wang, Zhanzhong
    Chu, Ruijuan
    Zhang, Minghang
    Wang, Xiaochao
    Luan, Siliang
    IEEE ACCESS, 2020, 8 : 212623 - 212634
  • [43] Improved Online Support Vector Machines Spam Filtering Using String Kernels
    Amayri, Ola
    Bouguila, Nizar
    PROGRESS IN PATTERN RECOGNITION, IMAGE ANALYSIS, COMPUTER VISION, AND APPLICATIONS, PROCEEDINGS, 2009, 5856 : 621 - 628
  • [44] Layout Based Spam Filtering
    Musat, Claudiu N.
    PROCEEDINGS OF WORLD ACADEMY OF SCIENCE, ENGINEERING AND TECHNOLOGY, VOL 12, 2006, 12 : 161 - 164
  • [45] Collective classification for spam filtering
    Laorden, Carlos
    Sanz, Borja
    Santos, Igor
    Galan-Garcia, Patxi
    Bringas, Pablo G.
    LOGIC JOURNAL OF THE IGPL, 2013, 21 (04) : 540 - 548
  • [46] A scalable spam filtering architecture
    Ferreira, Nuno
    Carvalho, Gracinda
    Pereira, Paulo Rogério
    IFIP Advances in Information and Communication Technology, 2013, 394 : 107 - 114
  • [47] Filtering Spam with Behavioral Blacklisting
    Ramachandran, Anirudh
    Feamster, Nick
    Vempala, Santosh
    CCS'07: PROCEEDINGS OF THE 14TH ACM CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, 2007, : 342 - 351
  • [48] Towards Proactive Spam Filtering
    Goebel, Jan
    Holz, Thorsten
    Trinius, Philipp
    DETECTION OF INTRUSIONS AND MALWARE, AND VULNERABILITY ASSESSMENT, PROCEEDINGS, 2009, 5587 : 38 - 47
  • [49] Term frequency combined hybrid feature selection method for spam filtering
    Liu, Yuanning
    Wang, Youwei
    Feng, Lizhou
    Zhu, Xiaodong
    PATTERN ANALYSIS AND APPLICATIONS, 2016, 19 (02) : 369 - 383
  • [50] A Scalable Spam Filtering Architecture
    Ferreira, Nuno
    Carvalho, Gracinda
    Pereira, Paulo Rogerio
    TECHNOLOGICAL INNOVATION FOR THE INTERNET OF THINGS, 2013, 394 : 107 - 114