An Improved Selective Ensemble Method for Spam Filtering

被引:0
|
作者
Cai, Jinye [1 ,2 ]
Xu, Pingping [1 ,2 ]
Tang, Huiyu [3 ]
Sun, Lin [1 ,2 ]
机构
[1] Southeast Univ, Natl Mobile Commun Res Lab, Nanjing 210096, Jiangsu, Peoples R China
[2] Southeast Univ, Jiangsu Prov Key Lab Sensor Network Technol, Wuxi 214135, Peoples R China
[3] Waseda Univ, Grad Sch IPS, Kitakyushu, Fukuoka 8080135, Japan
关键词
Text mining; Classification; Spam filtering; SVM; Clustering; Selective ensemble;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents an improved method of selective ensemble to filter the spam messages. The design adopts clustering based on the diversity between sub-classifiers to solve the problem of selection. To improve accuracy and stability, a conception of confidence weight is proposed to evaluate the reliability of selected sub-classifiers. The training model is created with small datasets as in the real situation. For practical usage, this method only uses 150 samples of user's file and executes bootstrapping between 50 and 70 times on them. Experiments validate the effectiveness of this method in handling the spam filtering problem.
引用
收藏
页码:743 / 747
页数:5
相关论文
共 50 条
  • [31] An ensemble design approach based on bagging technique for filtering e-mail spam
    Roy S.S.
    Viswanatham V.M.
    Krishna P.V.
    Roy, Sanjiban Sekhar (s.roy@vit.ac.in), 1600, Inderscience Enterprises Ltd., 29, route de Pre-Bois, Case Postale 856, CH-1215 Geneva 15, CH-1215, Switzerland (10): : 247 - 260
  • [32] Adaptive filtering of SPAM
    Pelletier, L
    Almhana, J
    Choulakian, V
    SECOND ANNUAL CONFERENCE ON COMMUNICATION NETWORKS AND SERVICES RESEARCH, PROCEEDINGS, 2004, : 218 - 224
  • [33] Spam filtering scheme
    Wang, Jing (wngjing@hotmail.com), 1600, Northeast University (35):
  • [34] Spamcooling: A parallel heterogeneous ensemble spam filtering system based on active learning techniques
    Wang, Jinlong
    Gao, Ke
    Vu, Huy Quan
    Journal of Convergence Information Technology, 2010, 5 (04)
  • [35] Email Spam Filtering
    Puertas Sanz, Enrique
    Gomez Hidalgo, Jose Maria
    Cortizo Perez, Jose Carlos
    ADVANCES IN COMPUTERS, VOL 74: SOFTWARE DEVELOPMENT, 2008, 74 : 45 - 114
  • [36] Research in Anti-Spam Method Based on Bayesian Filtering
    Wu, Jiansheng
    Deng, Tao
    PACIIA: 2008 PACIFIC-ASIA WORKSHOP ON COMPUTATIONAL INTELLIGENCE AND INDUSTRIAL APPLICATION, VOLS 1-3, PROCEEDINGS, 2008, : 1838 - 1842
  • [37] A Method of Spam Filtering Based on Weighted Support Vector Machines
    Chen Xiao-li
    Liu Pei-yu
    Zhu Zhen-fang
    Qiu Ye
    2009 IEEE INTERNATIONAL SYMPOSIUM ON IT IN MEDICINE & EDUCATION, VOLS 1 AND 2, PROCEEDINGS, 2009, : 947 - 950
  • [38] Word Embedding Method of SMS Messages for Spam Message Filtering
    Lee, Hyun-Young
    Kang, Seung-Shik
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP), 2019, : 652 - 655
  • [39] A Spam Filtering Method Based on Multi-Modal Fusion
    Yang, Hong
    Liu, Qihe
    Zhou, Shijie
    Luo, Yang
    APPLIED SCIENCES-BASEL, 2019, 9 (06):
  • [40] A spam filtering method learning from Web browsing behavior
    Takashita, Taiki
    Itokawa, Tsuyoshi
    Kitasuka, Teruaki
    Aritsugi, Masayoshi
    KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 2, PROCEEDINGS, 2008, 5178 : 774 - 781