An Improved Selective Ensemble Method for Spam Filtering

被引:0
|
作者
Cai, Jinye [1 ,2 ]
Xu, Pingping [1 ,2 ]
Tang, Huiyu [3 ]
Sun, Lin [1 ,2 ]
机构
[1] Southeast Univ, Natl Mobile Commun Res Lab, Nanjing 210096, Jiangsu, Peoples R China
[2] Southeast Univ, Jiangsu Prov Key Lab Sensor Network Technol, Wuxi 214135, Peoples R China
[3] Waseda Univ, Grad Sch IPS, Kitakyushu, Fukuoka 8080135, Japan
关键词
Text mining; Classification; Spam filtering; SVM; Clustering; Selective ensemble;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents an improved method of selective ensemble to filter the spam messages. The design adopts clustering based on the diversity between sub-classifiers to solve the problem of selection. To improve accuracy and stability, a conception of confidence weight is proposed to evaluate the reliability of selected sub-classifiers. The training model is created with small datasets as in the real situation. For practical usage, this method only uses 150 samples of user's file and executes bootstrapping between 50 and 70 times on them. Experiments validate the effectiveness of this method in handling the spam filtering problem.
引用
收藏
页码:743 / 747
页数:5
相关论文
共 50 条
  • [1] Clustering Ensemble for Spam Filtering
    Porras, Santiago
    Baruque, Bruno
    Vaquerizo, Belen
    Corchado, Emilio
    HYBRID ARTIFICIAL INTELLIGENT SYSTEMS, PART II, 2011, 6679 : 363 - +
  • [2] Improved retry patterns method for the mail spam filtering
    Yun-wei, Li
    Hong-xue, Yang
    Lei, Ma
    Ying, Chang
    Jing, Wang
    International Journal of Applied Mathematics and Statistics, 2013, 46 (16): : 454 - 460
  • [3] Spam filtering based on classifiers ensemble
    Yang, Zhen
    Fan, Ke-Feng
    Lei, Jian-Jun
    Lai, Ying-Xu
    Tongxin Xuebao/Journal on Communication, 2008, 29 (SUPPL.): : 7 - 11
  • [4] Spam Filtering Based on Improved CHI Feature Selection Method
    Lu, Zhimao
    Yu, Hongxia
    Fan, Dongmei
    Yuan, Chaoyue
    PROCEEDINGS OF THE 2009 CHINESE CONFERENCE ON PATTERN RECOGNITION AND THE FIRST CJK JOINT WORKSHOP ON PATTERN RECOGNITION, VOLS 1 AND 2, 2009, : 771 - 773
  • [5] An Improved Multiple Features Fusion Method for Image Spam Filtering
    Yuan, Saijie
    Zhang, Chongyang
    2016 3RD INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND CONTROL ENGINEERING (ICISCE), 2016, : 200 - 203
  • [6] Structured ensemble learning for email spam filtering
    Liu, W. (wyliu@nudt.edu.cn), 2012, Science Press (49):
  • [7] Study on Ensemble Classification Methods towards Spam Filtering
    Wang, Jinlong
    Gao, Ke
    Jiao, Yana
    Li, Gang
    ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2009, 5678 : 314 - +
  • [8] The Improved Logistic Regression Models for Spam Filtering
    Han, Yong
    Yang, Muyun
    Qi, Haoliang
    He, Xiaoning
    Li, Sheng
    2009 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING, 2009, : 314 - 317
  • [9] A Novel Method for Image Spam Filtering
    Huang, Hailing
    Guo, Weiqiang
    Zhang, Yu
    PROCEEDINGS OF THE 9TH INTERNATIONAL CONFERENCE FOR YOUNG COMPUTER SCIENTISTS, VOLS 1-5, 2008, : 826 - 830
  • [10] An imbalanced spam mail filtering method
    Ma, Zhiqiang
    Yan, Rui
    Yuan, Dongliong
    Liu, Limin
    International Journal of Multimedia and Ubiquitous Engineering, 2015, 10 (03): : 119 - 126