Email Filtering based on Supervised Learning and Mutual Information Feature Selection

被引:0
|
作者
Gad, Walaa [1 ]
Rady, Sherine [1 ]
机构
[1] Ain Shams Univ, Dept Informat Syst, Fac Comp & Informat Sci, Cairo, Egypt
关键词
email filtering; supervised learning; classification; mutual information; feature selection;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Electronic mail is one of today's most important ways to communicate and transfer information. Because of fast delivery and easy to access, it is used almost in every aspect of communication in work and life. However, the increase in email users has resulted in a dramatic increase in spam emails during the past few years. In this paper, we propose an email-filtering approach that is based on supervised classifier and mutual information. The proposed model has the advantage of combining machine supervised learning with feature selection. Term frequency (TF) is presented to assign relevance weights to words of each email class. We conduct experiments to compare between six different classifiers. Results show that the proposed approach has high performance in terms of precision, recall and accuracy performance measures.
引用
收藏
页码:147 / 152
页数:6
相关论文
共 50 条
  • [41] Feature Selection based on Mutual Information for Machine learning prediction of Petroleum reservoir properties
    Sulaiman, Muhammad Aliyu
    Labadin, Jane
    2015 9TH INTERNATIONAL CONFERENCE ON IT IN ASIA (CITA), 2015,
  • [42] Dynamic mutual information-based feature selection for multi-label learning
    Kim, Kyung-Jun
    Jun, Chi-Hyuck
    INTELLIGENT DATA ANALYSIS, 2023, 27 (04) : 891 - 909
  • [43] Supervised feature extraction for tensor objects based on maximization of mutual information
    Jukic, Ante
    Filipovic, Marko
    PATTERN RECOGNITION LETTERS, 2013, 34 (13) : 1476 - 1484
  • [44] A Gaussian mixture based maximization of mutual information for supervised feature extraction
    Leiva-Murillo, JM
    Artés-Rodríguez, A
    INDEPENDENT COMPONENT ANALYSIS AND BLIND SIGNAL SEPARATION, 2004, 3195 : 271 - 278
  • [45] High-dimensional supervised feature selection via optimized kernel mutual information
    Bi, Ning
    Tan, Jun
    Lai, Jian-Huang
    Suen, Ching Y.
    EXPERT SYSTEMS WITH APPLICATIONS, 2018, 108 : 81 - 95
  • [46] Weighted Mutual Information for Feature Selection
    Schaffernicht, Erik
    Gross, Horst-Michael
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2011, PT II, 2011, 6792 : 181 - 188
  • [47] Quadratic Mutual Information Feature Selection
    Sluga, Davor
    Lotric, Uros
    ENTROPY, 2017, 19 (04)
  • [48] Mutual Information Criteria for Feature Selection
    Zhang, Zhihong
    Hancock, Edwin R.
    SIMILARITY-BASED PATTERN RECOGNITION: FIRST INTERNATIONAL WORKSHOP, SIMBAD 2011, 2011, 7005 : 235 - 249
  • [49] Normalized Mutual Information Feature Selection
    Estevez, Pablo. A.
    Tesmer, Michel
    Perez, Claudio A.
    Zurada, Jacek A.
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 2009, 20 (02): : 189 - 201
  • [50] Mutual Information Criteria for Feature Selection
    Zhang, Zhihong
    Hancock, Edwin R.
    SIMILARITY-BASED PATTERN RECOGNITION, 2011, 7005 : 235 - 249