Spam Detection Using Clustering-Based SVM

被引:0
|
作者
Pandya, Darshit [1 ]
机构
[1] Indus Univ, Dept Comp Engn, Ahmadabad 382115, Gujarat, India
关键词
Text Classification; SVM; Clustering;
D O I
10.1145/3366750.3366754
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Spam detection task is of much more importance than earlier due to the increase in the use of messaging and mailing services. Efficient classification in such a variety of messages is a comparatively onerous task. There are a variety of machine learning algorithms used for spam detection, one of which is Support Vector Machine, also known as SVM. SVM is widely used to classify text-based documents. Though SVM is a widely used technique in document classification, its performance in the spam classification is not the best due to the uneven density of the training data. In order to improve the efficiency of SVM, I introduce a clustering-based SVM method. The training data is pre-processed using clustering algorithms and then the SVM classifier is implemented on the processed dataset. This method would increase the performance by overcoming the problem of uneven distribution of training data. The experimental results show that the performance is improved compared to that of SVM.
引用
收藏
页码:12 / 15
页数:4
相关论文
共 50 条
  • [21] Clustering-based Anomaly Detection for Smartphone Applications
    El Attar, Ali
    Khatoun, Rida
    Lemercier, Marc
    2014 IEEE NETWORK OPERATIONS AND MANAGEMENT SYMPOSIUM (NOMS), 2014,
  • [22] Clustering-Based Network Intrusion Detection System
    Fan, Chun-I
    Lai, Yen-Lin
    Shie, Cheng-Han
    2022 5TH IEEE CONFERENCE ON DEPENDABLE AND SECURE COMPUTING (IEEE DSC 2022), 2022,
  • [23] Clustering-Based Discriminant Analysis for Eye Detection
    Chen, Shuo
    Liu, Chengjun
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2014, 23 (04) : 1629 - 1638
  • [24] Data clustering-based fault detection in WSNs
    Yang, Yang
    Liu, Qian
    Gao, Zhipeng
    Qiu, Xuesong
    Rui, Lanlan
    2015 SEVENTH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTATIONAL INTELLIGENCE (ICACI), 2015, : 334 - 339
  • [25] The Adaptive SPAM Mail Detection System using Clustering based on Text Mining
    Hong, Sung-Sam
    Kong, Jong-Hwan
    Han, Myung-Mook
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2014, 8 (06): : 2186 - 2196
  • [26] Dynamic classifier selection using clustering for spam detection
    Saeedian, Mehrnoush Famil
    Beigy, Hamid
    2009 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DATA MINING, 2009, : 84 - 88
  • [27] Spectral clustering-based community detection using graph distance and node attributes
    Tang, Fengqin
    Wang, Chunning
    Su, Jinxia
    Wang, Yuanyuan
    COMPUTATIONAL STATISTICS, 2020, 35 (01) : 69 - 94
  • [28] Detection of Random Body Movements Using Clustering-Based Methods in Bioradar Systems
    Rouco, Andre
    Silva, Filipe
    Soares, Beatriz
    Albuquerque, Daniel
    Gouveia, Carolina
    Bras, Susana
    Pinho, Pedro
    INFORMATION, 2024, 15 (10)
  • [29] Spectral clustering-based community detection using graph distance and node attributes
    Fengqin Tang
    Chunning Wang
    Jinxia Su
    Yuanyuan Wang
    Computational Statistics, 2020, 35 : 69 - 94
  • [30] Header Based Email Spam Detection Framework Using Support Vector Machine (SVM) Technique
    Khamis, Siti Aqilah
    Foozy, Cik Feresa Mohd
    Aziz, Mohd Firdaus Ab
    Rahim, Nordiana
    RECENT ADVANCES ON SOFT COMPUTING AND DATA MINING (SCDM 2020), 2020, 978 : 57 - 65