Cost-sensitive three-way email spam filtering

被引:0
|
作者
Bing Zhou
Yiyu Yao
Jigang Luo
机构
[1] Sam Houston State University,Department of Computer Science
[2] University of Regina,Department of Computer Science
关键词
Email spam filtering; Cost-sensitive learning; Ternary classification; Three-way decision; Naive Bayes classifier;
D O I
暂无
中图分类号
学科分类号
摘要
Email spam filtering is typically treated as a binary classification problem that can be solved by machine learning algorithms. We argue that a three-way decision approach provides a more meaningful way to users for precautionary handling their incoming emails. Three email folders instead of two are produced in a three-way spam filtering system, a suspected folder is added to allow users make further examinations of suspicious emails, thereby reducing the chances of misclassification. Different from existing ternary email spam filtering systems, we focus on two issues that are less studied, that is, the computation of required thresholds to define the three email categories, and the interpretation of the cost-sensitive characteristics of spam filtering. Instead of supplying the thresholds based on intuitive understandings of the levels of tolerance for errors, we systematically calculate the thresholds based on decision-theoretic rough set model. A loss function is interpreted as the costs of making classification decisions. A decision is made for which the overall cost is minimum. Experimental results show that the new approach reduces the error rate of misclassifying a legitimate email to spam and demonstrates a better performance for the cost-sensitivity aspect.
引用
收藏
页码:19 / 45
页数:26
相关论文
共 50 条
  • [1] Cost-sensitive three-way email spam filtering
    Zhou, Bing
    Yao, Yiyu
    Luo, Jigang
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2014, 42 (01) : 19 - 45
  • [2] A Three-Way Decision Approach to Email Spam Filtering
    Zhou, Bing
    Yao, Yiyu
    Luo, Jigang
    ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2010, 6085 : 28 - 39
  • [3] Multistage Email Spam Filtering Based on Three-Way Decisions
    Li, Jianlin
    Deng, Xiaofei
    Yao, Yiyu
    ROUGH SETS AND KNOWLEDGE TECHNOLOGY: 8TH INTERNATIONAL CONFERENCE, 2013, 8171 : 313 - 324
  • [4] Three-way Email Spam Filtering with Game-theoretic Rough Sets
    Zhang, Yan
    Liu, PengFei
    Yao, JingTao
    2019 INTERNATIONAL CONFERENCE ON COMPUTING, NETWORKING AND COMMUNICATIONS (ICNC), 2019, : 552 - 556
  • [5] Cost-Sensitive Three-Way Decision: A Sequential Strategy
    Li, Huaxiong
    Zhou, Xianzhong
    Huang, Bing
    Liu, Dun
    ROUGH SETS AND KNOWLEDGE TECHNOLOGY: 8TH INTERNATIONAL CONFERENCE, 2013, 8171 : 325 - 337
  • [6] Cost-Sensitive Sequential Three-Way Decision for Face Recognition
    Zhang, Libo
    Li, Huaxiong
    Zhou, Xianzhong
    Huang, Bing
    Shang, Lin
    ROUGH SETS AND INTELLIGENT SYSTEMS PARADIGMS, RSEISP 2014, 2014, 8537 : 375 - 383
  • [7] Cost-Sensitive Three-Way Decisions Model Based on CCA
    Zhang, Yanping
    Zou, Huijin
    Chen, Xi
    Wang, Xiangyang
    Tang, Xuqing
    Zhao, Shu
    ROUGH SETS AND CURRENT TRENDS IN SOFT COMPUTING, RSCTC 2014, 2014, 8536 : 172 - 180
  • [8] Cost-sensitive approximate attribute reduction with three-way decisions
    Fang, Yu
    Min, Fan
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2019, 104 : 148 - 165
  • [9] Design and implement cost-sensitive email filtering algorithms
    Li, WB
    Liu, CN
    Chen, YY
    ARTIFICIAL INTELLIGENCE APPLICATIONS AND INNOVATIONS II, 2005, 187 : 325 - 334
  • [10] Cost-sensitive three-way class-specific attribute reduction
    Ma, Xi-Ao
    Zhao, Xue Rong
    INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2019, 105 : 153 - 174