A Bi-Level Text Classification Approach for SMS Spam Filtering and Identifying Priority Messages

被引:0
|
作者
Nagwani, Naresh Kumar [1 ]
机构
[1] Natl Inst Technol Raipur, Dept Comp Sci & Engn, Raipur, Chhattisgarh, India
关键词
SMS spam; priority sms; important sms; sms spam filtering; bi-level binary classification;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Short Message Service (SMS) traffic is increasing day by day and trillions of sms are sent and received by billions of users every day. Spam messages are also increasing in same proportionate. Numbers of recent advancements are taking place. in the field of sms spam detection and filtering. The objective of this work is twofold, first is to identify and classify spam messages from the collection of sms messages and second is to identify the priority or important sms messages from the filtered non-spam messages. The objective of the work is to categorize the sms messages for effective management and handling of sms messages. the work is planned in two level of binary classification wherein at the first level of classification the sms messages are categorized into the two classes spam and non-spam using popular binary classifiers, and then at the second level of classification non-spam sms messages are further categorized into the priority and normal sms messages. four state of the art popular text classification techniques namely, Naive Bayes (NB), Support Vector Machine (SVM), Latent Dirichlet Allocation (LDA) and Non-negative Matrix Factorization (NMF) are used to categorize the sms text message at different levels of classification. The proposed bi-level classification model is also evaluated using the performance measures accuracy and f measure. Combinations of classifiers at both levels are compared and it is shown from the experiments that SVM algorithm performs better for filtering the spam messages and categorizing the priority messages.
引用
收藏
页码:473 / 480
页数:8
相关论文
共 50 条
  • [1] SMS spam filtering and thread identification using bi-level text classification and clustering techniques
    Nagwani, Naresh Kumar
    Sharaff, Aakanksha
    JOURNAL OF INFORMATION SCIENCE, 2017, 43 (01) : 75 - 87
  • [2] SMS Spam Filtering based on Text Classification and Expert System
    Bozan, Yavuz Selim
    Coban, Onder
    Ozyer, Gulsah Tumuklu
    Ozyer, Baris
    2015 23RD SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2015, : 2345 - 2348
  • [3] Index-based Online Text Classification for SMS Spam Filtering
    Liu, Wuying
    Wang, Ting
    JOURNAL OF COMPUTERS, 2010, 5 (06) : 844 - 851
  • [4] Word Embedding Method of SMS Messages for Spam Message Filtering
    Lee, Hyun-Young
    Kang, Seung-Shik
    2019 IEEE INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP), 2019, : 652 - 655
  • [5] Identifying Gender From SMS Text Messages
    Silessi, Shannon
    Varol, Cihan
    Karabatak, Murat
    2016 15TH IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA 2016), 2016, : 488 - 491
  • [6] A novel approach to filtering SMS messages
    McClement, KN
    Kumar, V
    5th World Wireless Congress, Proceedings, 2004, : 67 - 71
  • [7] An Effective and Economic Bi-level Approach to Ranking and Rating Spam Detection
    Xie, Sihong
    Hu, Qingbo
    Zhang, Jingyuan
    Yu, Philip S.
    PROCEEDINGS OF THE 2015 IEEE INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (IEEE DSAA 2015), 2015, : 123 - 132
  • [8] Towards Filtering of SMS Spam Messages Using Machine Learning Based Technique
    Choudhary, Neelam
    Jain, Ankit Kumar
    ADVANCED INFORMATICS FOR COMPUTING RESEARCH, ICAICR 2017, 2017, 712 : 18 - 30
  • [9] Filtering Spam Text Messages by Using Twitter-LDA Algorithm
    Gunawan, Dani
    Rahmat, Romi Fadillah
    Putra, Arsandi
    Pasha, Muhammad Fermi
    2018 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATION, NETWORKS AND SATELLITE (COMNETSAT), 2018, : 1 - 6
  • [10] Content-based Approach for Vietnamese Spam SMS Filtering
    Pham, Thai-Hoang
    Le-Hong, Phuong
    PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2016, : 41 - 44