A Comparative Approach to Email Classification Using Naive Bayes Classifier and Hidden Markov Model

被引:0
|
作者
Gomes, Sebastian Romv [1 ]
Saroar, Sk Golam [1 ]
Telot, Md Mosfaiul Alam [1 ]
Khan, Behroz Newaz [1 ]
Chakrabarty, Amitabha [1 ]
Mostakim, Moin [1 ]
机构
[1] BRAC Univ, Dept Comp Sci & Engn, 66 Bir Uttam AK Khandakar Rd, Dhaka 1212, Bangladesh
关键词
Email Classification; Hidden Markov Model; Naive Bayes; Natural Language Processing; NLTK; Supervised Learning;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This research investigates a comparison between two different approaches for classifying emails based on their categories. Naive Bayes and Hidden Markov Model (HMM), two different machine learning algorithms, both have been used for detecting whether an email is important or spam. Naive Bayes Classifier is based on conditional probabilities. It is fast and works great with small dataset. It considers independent words as a feature. HMM is a generative, probabilistic model that provides us with distribution over the sequences of observations. HMMs can handle inputs of variable length and help programs come to the most likely decision, based on both previous decisions and current data. Various combinations of NLP techniques-stopwords removing, stemming, lemmatizing have been tried on both the algorithms to inspect the differences in accuracy as well as to find the best method among them.
引用
收藏
页码:482 / 487
页数:6
相关论文
共 50 条
  • [31] A hidden Markov model fingerprint classifier
    Senior, A
    THIRTY-FIRST ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, VOLS 1 AND 2, 1998, : 306 - 310
  • [32] A Collaborative Filtering Approach Based on Naive Bayes Classifier
    Valdiviezo-Diaz, Priscila
    Ortega, Fernando
    Cobos, Eduardo
    Lara-Cabrera, Raul
    IEEE ACCESS, 2019, 7 : 108581 - 108592
  • [33] Modifled Naive Bayes Classifier for e-catalog classification
    Kim, Young-Gon
    Lee, Taehee
    Chun, Jonghoon
    Lee, Sang-goo
    DATA ENGINEERING ISSUES IN E-COMMERCE AND SERVICES, PROCEEDINGS, 2006, 4055 : 246 - 257
  • [34] Integrated Hidden Markov Model and Bayes Packet classifier for effective mitigation of application DDOS attacks
    Prabha, S.
    Anitha, R.
    International Journal of Computer Science Issues, 2011, 8 (4 4-2): : 587 - 597
  • [35] Laplace Naive Bayes classifier in the classification of text in machine learning
    Kalcheva, Neli
    Nikolov, Nedyalko
    PROCEEDINGS OF THE 2020 INTERNATIONAL CONFERENCE ON BIOMEDICAL INNOVATIONS AND APPLICATIONS (BIA 2020), 2020, : 18 - 20
  • [36] An Approach to Improving Quality of Crawlers Using Naive Bayes for Classifier and Hyperlink Filter
    Huu-Thien-Tan Nguyen
    Duy-Khanh Le
    COMPUTATIONAL COLLECTIVE INTELLIGENCE - TECHNOLOGIES AND APPLICATIONS, PT I, 2012, 7653 : 525 - 535
  • [37] Non stationary Signal Analysis and Classification using FTT Transform and Naive Bayes classifier
    Khan, Md Rizwan
    Padhi, S. K.
    Sahu, B. N.
    Behera, S.
    2015 IEEE POWER, COMMUNICATION AND INFORMATION TECHNOLOGY CONFERENCE (PCITC-2015), 2015, : 967 - 972
  • [38] International Reputable Journal Classification Using Inter-correlated Naive Bayes Classifier
    Adiperkasa, Risky Perdana
    Wibawa, Aji Prasetya
    Zaeni, Ilham Ari Elbaith
    Widiyaningtyas, Triyanna
    2019 2ND INTERNATIONAL CONFERENCE OF COMPUTER AND INFORMATICS ENGINEERING (IC2IE 2019): ARTIFICIAL INTELLIGENCE ROLES IN INDUSTRIAL REVOLUTION 4.0, 2019, : 49 - 52
  • [39] Classification and Optimization Scheme for Text Data using Machine Learning Naive Bayes Classifier
    Venkatesh
    Ranjitha, K., V
    PROCEEDINGS OF 2018 IEEE WORLD SYMPOSIUM ON COMMUNICATION ENGINEERING (WSCE), 2018, : 33 - 36
  • [40] Optimized classification approach for SAR images using hidden Markov chains model
    Yuan, L. H.
    Song, J. S.
    Xue, W. T.
    Zheng, Y. A.
    DYNAMICS OF CONTINUOUS DISCRETE AND IMPULSIVE SYSTEMS-SERIES B-APPLICATIONS & ALGORITHMS, 2006, 13E : 1899 - 1903