Email Filtering based on Supervised Learning and Mutual Information Feature Selection

被引:0
|
作者
Gad, Walaa [1 ]
Rady, Sherine [1 ]
机构
[1] Ain Shams Univ, Dept Informat Syst, Fac Comp & Informat Sci, Cairo, Egypt
关键词
email filtering; supervised learning; classification; mutual information; feature selection;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Electronic mail is one of today's most important ways to communicate and transfer information. Because of fast delivery and easy to access, it is used almost in every aspect of communication in work and life. However, the increase in email users has resulted in a dramatic increase in spam emails during the past few years. In this paper, we propose an email-filtering approach that is based on supervised classifier and mutual information. The proposed model has the advantage of combining machine supervised learning with feature selection. Term frequency (TF) is presented to assign relevance weights to words of each email class. We conduct experiments to compare between six different classifiers. Results show that the proposed approach has high performance in terms of precision, recall and accuracy performance measures.
引用
收藏
页码:147 / 152
页数:6
相关论文
共 50 条
  • [21] A review of feature selection methods based on mutual information
    Jorge R. Vergara
    Pablo A. Estévez
    Neural Computing and Applications, 2014, 24 : 175 - 186
  • [22] An Improved Feature Selection for Categorization Based on Mutual Information
    Liu, Haifeng
    Su, Zhan
    Yao, Zeqing
    Liu, Shousheng
    WEB INFORMATION SYSTEMS AND MINING, PROCEEDINGS, 2009, 5854 : 80 - 87
  • [23] A Powerful Feature Selection approach based on Mutual Information
    El Akadi, Ali
    El Ouardighi, Abdeljalil
    Aboutajdine, Driss
    INTERNATIONAL JOURNAL OF COMPUTER SCIENCE AND NETWORK SECURITY, 2008, 8 (04): : 116 - 121
  • [24] Feature selection using a mutual information based measure
    Al-Ani, A
    Deriche, M
    16TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITON, VOL IV, PROCEEDINGS, 2002, : 82 - 85
  • [25] Feature selection based on mutual information with correlation coefficient
    Hongfang Zhou
    Xiqian Wang
    Rourou Zhu
    Applied Intelligence, 2022, 52 : 5457 - 5474
  • [26] FEATURE SELECTION BASED ON STATISTICAL ESTIMATION OF MUTUAL INFORMATION
    Kozhevin, A. A.
    SIBERIAN ELECTRONIC MATHEMATICAL REPORTS-SIBIRSKIE ELEKTRONNYE MATEMATICHESKIE IZVESTIYA, 2021, 18 : 720 - 728
  • [27] Mutual information-based feature selection for radiomics
    Oubel, Estanislao
    Beaumont, Hubert
    Iannessi, Antoine
    MEDICAL IMAGING 2016: PACS AND IMAGING INFORMATICS: NEXT GENERATION AND INNOVATIONS, 2016, 9789
  • [28] A review of feature selection methods based on mutual information
    Vergara, Jorge R.
    Estevez, Pablo A.
    NEURAL COMPUTING & APPLICATIONS, 2014, 24 (01): : 175 - 186
  • [29] Speeding up feature subset selection through mutual information relevance filtering
    Van Dijck, Gert
    Van Hulle, Marc M.
    KNOWLEDGE DISCOVERY IN DATABASES: PKDD 2007, PROCEEDINGS, 2007, 4702 : 277 - +
  • [30] Early Stopping for Mutual Information Based Feature Selection
    Beinrucker, Andre
    Dogan, Ueruen
    Blanchard, Gilles
    2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 975 - 978