Less naive Bayes spam detection

被引:0
|
作者
Yang, Hongming [1 ,2 ]
Stassen, Maurice [3 ]
Tjalkens, Tjalling [1 ]
机构
[1] Eindhoven Univ Technol, Dept EE, Rm PT 3-27,POB 513, NL-5600 MB Eindhoven, Netherlands
[2] CoSiNe Connectiv Syst & Networks, Eindhoven, Netherlands
[3] NXP Semicond, NL-5656 AE Eindhoven, Netherlands
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We consider a binary classification problem with a feature vector of high dimensionality. Spam mail filters are a popular example hereof. A naive Bayes filter assumes conditional independence of the feature vector components. We use the context tree weighting method as an application of the minimum description length principle to allow for dependencies between the feature vector components. It turns out that, due to the limited amount of training data, we must assume conditional independence between groups of vector components. We consider several ad-hoc algorithms to find good groupings and good conditional models.
引用
收藏
页码:386 / +
页数:2
相关论文
共 50 条
  • [21] A Support Vector Machine based Naive Bayes Algorithm for Spam Filtering
    Feng, Weimiao
    Sun, Jianguo
    Zhang, Liguo
    Cao, Cuiling
    Yang, Qing
    2016 IEEE 35TH INTERNATIONAL PERFORMANCE COMPUTING AND COMMUNICATIONS CONFERENCE (IPCCC), 2016,
  • [22] REVISED NAIVE BAYES CL ASSIFIER FOR COMBATING THE FOCUS ATTACK IN SPAM FILTERING
    Peng, Junyan
    Chan, Patrick P. K.
    PROCEEDINGS OF 2013 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS (ICMLC), VOLS 1-4, 2013, : 610 - 614
  • [23] Email Spam Classification using Neighbor Probability based Naive Bayes Algorithm
    Anitha, P. U.
    Rao, C. V. Guru
    Babu, Suresh
    2017 7TH INTERNATIONAL CONFERENCE ON COMMUNICATION SYSTEMS AND NETWORK TECHNOLOGIES (CSNT), 2017, : 350 - 355
  • [24] Combining Naive Bayes and Tri-gram Language Model for Spam Filtering
    Ma, Xi
    Shen, Yao
    Chen, Junbo
    Xue, Guirong
    KNOWLEDGE ENGINEERING AND MANAGEMENT, 2011, 123 : 509 - +
  • [25] Spam Filtering Using Hybrid Local-Global Naive Bayes Classifier
    Solanki, Rohit Kumar
    Verma, Karun
    Kumar, Ravinder
    2015 INTERNATIONAL CONFERENCE ON ADVANCES IN COMPUTING, COMMUNICATIONS AND INFORMATICS (ICACCI), 2015, : 829 - 833
  • [26] Analysis of Naive Bayes Algorithm for Email Spam Filtering across Multiple Datasets
    Rusland, Nurul Fitriah
    Wahid, Norfaradilla
    Kasim, Shahreen
    Hafit, Hanayanti
    INTERNATIONAL RESEARCH AND INNOVATION SUMMIT (IRIS2017), 2017, 226
  • [27] An evaluation of Naive Bayes variants in content-based learning for spam filtering
    Seewald, Alexander K.
    INTELLIGENT DATA ANALYSIS, 2007, 11 (05) : 497 - 524
  • [28] A comparative impact study of attribute selection techniques on Naive Bayes spam filters
    Mendez, J. R.
    Cid, I.
    Glez-Pena, D.
    Rocha, M.
    Fdez-Riverola, F.
    ADVANCES IN DATA MINING, PROCEEDINGS: MEDICAL APPLICATIONS, E-COMMERCE, MARKETING, AND THEORETICAL ASPECTS, 2008, 5077 : 213 - +
  • [29] Machine Learning-based spam detection using Naive Bayes Classifier in comparison with Logistic Regression for improving accuracy
    Kumar, K. Varun
    Ramamoorthy, M.
    JOURNAL OF PHARMACEUTICAL NEGATIVE RESULTS, 2022, 13 : 548 - 554
  • [30] Classification Spam Email with Elimination of Unsuitable Features with Hybrid of GA-Naive Bayes
    Ebadati, O. M. E.
    Ahmadzadeh, F.
    JOURNAL OF INFORMATION & KNOWLEDGE MANAGEMENT, 2019, 18 (01)