A Fuzzy Clustering Approach to Filter Spam E-Mail

被引:0
|
作者
Mohammad, N. T. [1 ]
机构
[1] Univ Jordan, Dept Comp Informat Syst, Amman, Jordan
关键词
Spam filtering; Fuzzy clustering; Fuzzy C-Means;
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Spam email, is the practice of frequently sending unwanted email messages, usually with commercial content, in large quantities to a set of indiscriminate email accounts. However, since spammers continuously improve their techniques in order to compromise the spam filters, building a spam filter that can be incrementally learned and adapted became an active research field. Researches employed machine learning techniques which have been widely used in solving similar problems like document classification and pattern recognition, such as Naive Bayesian, and Support Vector Machine. In this Paper, we examine the use of the fuzzy clustering algorithm (Fuzzy C-Means) to build a spam filter. The proposed use of the Fuzzy has been tested on different data set sizes collected from Spam assassin corpora by real user's emails. After testing Fuzzy C-Means using Heterogeneous Value Difference Metric with variable percentages of spam and using a standard model of assessment for the spam problem, we demonstrate the potential value of our approach.
引用
收藏
页码:1839 / 1844
页数:6
相关论文
共 50 条
  • [31] Overview of e-mail SPAM Elimination and its Efficiency
    Sochor, Tomas
    2014 IEEE EIGHTH INTERNATIONAL CONFERENCE ON RESEARCH CHALLENGES IN INFORMATION SCIENCE (RCIS), 2014,
  • [32] An improved Bayes algorithm for filtering spam e-mail
    Wang, Meizhen
    Li, Zhitang
    Wu, Hantao
    Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2009, 37 (08): : 27 - 30
  • [33] A Study on E-mail Image Spam Filtering Techniques
    Dhanaraj, S.
    Karthikeyani, V.
    2013 INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, INFORMATICS AND MEDICAL ENGINEERING (PRIME), 2013,
  • [34] A Novel Spam Classification System for E-Mail Using a Gradient Fuzzy Guideline-Based Spam Classifier (GFGSC)
    Subramaniam, Vinoth Narayanan Arumugam
    Annamalai, Rajesh
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2023, 20 (03) : 398 - 406
  • [35] Spam e-mail classification for the Internet of Things environment using semantic similarity approach
    S. Venkatraman
    B. Surendiran
    P. Arun Raj Kumar
    The Journal of Supercomputing, 2020, 76 : 756 - 776
  • [36] Spam e-mail classification for the Internet of Things environment using semantic similarity approach
    Venkatraman, S.
    Surendiran, B.
    Kumar, P. Arun Raj
    JOURNAL OF SUPERCOMPUTING, 2020, 76 (02): : 756 - 776
  • [37] E-mail about e-mail?
    Recine, L
    DATAMATION, 1996, 42 (13): : 7 - 8
  • [39] E-Mail Spam Detection Based on Part of Speech Tagging
    Parsaei, Mohammad Reza
    Salehi, Mohammad
    2015 2ND INTERNATIONAL CONFERENCE ON KNOWLEDGE-BASED ENGINEERING AND INNOVATION (KBEI), 2015, : 1010 - 1013
  • [40] Weight Problems and Spam E-mail for Weight Loss Products
    Fogel, Joshua
    Shlivko, Sam
    SOUTHERN MEDICAL JOURNAL, 2010, 103 (01) : 31 - 36