Spam Detection Using Clustering-Based SVM

被引:0
|
作者
Pandya, Darshit [1 ]
机构
[1] Indus Univ, Dept Comp Engn, Ahmadabad 382115, Gujarat, India
关键词
Text Classification; SVM; Clustering;
D O I
10.1145/3366750.3366754
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Spam detection task is of much more importance than earlier due to the increase in the use of messaging and mailing services. Efficient classification in such a variety of messages is a comparatively onerous task. There are a variety of machine learning algorithms used for spam detection, one of which is Support Vector Machine, also known as SVM. SVM is widely used to classify text-based documents. Though SVM is a widely used technique in document classification, its performance in the spam classification is not the best due to the uneven density of the training data. In order to improve the efficiency of SVM, I introduce a clustering-based SVM method. The training data is pre-processed using clustering algorithms and then the SVM classifier is implemented on the processed dataset. This method would increase the performance by overcoming the problem of uneven distribution of training data. The experimental results show that the performance is improved compared to that of SVM.
引用
收藏
页码:12 / 15
页数:4
相关论文
共 50 条
  • [31] Efficient SVM-based Hotspot Detection using Spectral Clustering
    Yang, Fan
    Chiang, Charles C.
    Zeng, Xuan
    Zhou, Dian
    2017 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS), 2017, : 702 - 705
  • [32] SVM classifier incorporating feature selection using GA for spam detection
    Wang, HB
    Yu, Y
    Liu, Z
    EMBEDDED AND UBIQUITOUS COMPUTING - EUC 2005, 2005, 3824 : 1147 - 1154
  • [33] Clustering-Based Detection of Debye-Scherrer Rings
    Sirhindi, Rabia
    Khan, Nazar
    JOURNAL OF COMPUTING AND INFORMATION SCIENCE IN ENGINEERING, 2023, 23 (04)
  • [35] Clustering-based attack detection for adversarial reinforcement learning
    Rubén Majadas
    Javier García
    Fernando Fernández
    Applied Intelligence, 2024, 54 : 2631 - 2647
  • [36] Clustering-Based Recommendation System for Preliminary Disease Detection
    Jain, Gourav
    Mahara, Tripti
    Sharma, S. C.
    Verma, Om Prakash
    Sharma, Tarun
    INTERNATIONAL JOURNAL OF E-HEALTH AND MEDICAL COMMUNICATIONS, 2022, 13 (04)
  • [37] An improved unsupervised clustering-based intrusion detection method
    Hai, YJ
    Wu, Y
    Wang, GY
    Data Mining, Intrusion Detection, Information Assurance, and Data Networks Security 2005, 2005, 5812 : 52 - 60
  • [38] A Mixed Unsupervised Clustering-based Intrusion Detection Model
    Zhang, Cuixiao
    Zhang, Guobing
    Sun, Shanshan
    THIRD INTERNATIONAL CONFERENCE ON GENETIC AND EVOLUTIONARY COMPUTING, 2009, : 426 - 428
  • [39] Clustering-based Novelty Detection to Uncover Electricity Theft
    Viegas, Joaquim L.
    Vieira, Susana M.
    2017 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2017,
  • [40] A Clustering-Based Unsupervised Approach to Anomaly Intrusion Detection
    Nikolova, Evgeniya
    Jecheva, Veselina
    PROCEEDINGS OF THE 2ND INTERNATIONAL SYMPOSIUM ON COMPUTER, COMMUNICATION, CONTROL AND AUTOMATION, 2013, 68 : 202 - 205