A Comparative Approach to Email Classification Using Naive Bayes Classifier and Hidden Markov Model

被引:0
|
作者
Gomes, Sebastian Romv [1 ]
Saroar, Sk Golam [1 ]
Telot, Md Mosfaiul Alam [1 ]
Khan, Behroz Newaz [1 ]
Chakrabarty, Amitabha [1 ]
Mostakim, Moin [1 ]
机构
[1] BRAC Univ, Dept Comp Sci & Engn, 66 Bir Uttam AK Khandakar Rd, Dhaka 1212, Bangladesh
关键词
Email Classification; Hidden Markov Model; Naive Bayes; Natural Language Processing; NLTK; Supervised Learning;
D O I
暂无
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This research investigates a comparison between two different approaches for classifying emails based on their categories. Naive Bayes and Hidden Markov Model (HMM), two different machine learning algorithms, both have been used for detecting whether an email is important or spam. Naive Bayes Classifier is based on conditional probabilities. It is fast and works great with small dataset. It considers independent words as a feature. HMM is a generative, probabilistic model that provides us with distribution over the sequences of observations. HMMs can handle inputs of variable length and help programs come to the most likely decision, based on both previous decisions and current data. Various combinations of NLP techniques-stopwords removing, stemming, lemmatizing have been tried on both the algorithms to inspect the differences in accuracy as well as to find the best method among them.
引用
收藏
页码:482 / 487
页数:6
相关论文
共 50 条
  • [41] Classification of cellular phone mobility using Naive Bayes model
    Puntumapon, K.
    Pattara-atikom, W.
    2008 IEEE 67TH VEHICULAR TECHNOLOGY CONFERENCE-SPRING, VOLS 1-7, 2008, : 3021 - 3025
  • [42] Predictive model for admission uncertainty in high education using Naive Bayes classifier
    Rawal, Atul
    Lal, Bechoo
    JOURNAL OF INDIAN BUSINESS RESEARCH, 2023, 15 (02) : 262 - 277
  • [43] Sentiment Analysis using Naive Bayes and Complement Naive Bayes Classifier Algorithms on Hadoop Framework
    Seref, Berna
    Bostanci, Erkan
    2018 2ND INTERNATIONAL SYMPOSIUM ON MULTIDISCIPLINARY STUDIES AND INNOVATIVE TECHNOLOGIES (ISMSIT), 2018, : 555 - 561
  • [44] EEG-Based Fatigue Classification by Using Parallel Hidden Markov Model and Pattern Classifier Combination
    Sun, Hui
    Lu, Bao-Liang
    NEURAL INFORMATION PROCESSING, ICONIP 2012, PT IV, 2012, 7666 : 484 - 491
  • [45] Bug Fix-Time Prediction Model Using Naive Bayes Classifier
    Abdelmoez, W.
    Kholief, Mohamed
    Elsalmy, Fayrouz M.
    2012 22ND INTERNATIONAL CONFERENCE ON COMPUTER THEORY AND APPLICATIONS (ICCTA), 2012, : 167 - 172
  • [46] A network intrusion detection system based on a Hidden Naive Bayes multiclass classifier
    Koc, Levent
    Mazzuchi, Thomas A.
    Sarkani, Shahram
    EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (18) : 13492 - 13500
  • [47] The empirical study of the naive Bayes classifier in the case of Markov chain recognition task
    Zolnierek, A
    Rubacha, B
    COMPUTER RECOGNITION SYSTEMS, PROCEEDINGS, 2005, : 329 - 336
  • [48] Network Disruption Prediction Using Naive Bayes Classifier
    Oktaviana, Shinta
    Ermis, Iklima
    Anasanti, Mila Desi
    Hammad, Jehad
    2019 2ND INTERNATIONAL CONFERENCE OF COMPUTER AND INFORMATICS ENGINEERING (IC2IE 2019): ARTIFICIAL INTELLIGENCE ROLES IN INDUSTRIAL REVOLUTION 4.0, 2019, : 159 - 163
  • [49] Repairing Broken Links Using Naive Bayes Classifier
    Khan, Faheem Nawaz
    Ali, Adnan
    Hussain, Imtiaz
    Sarwar, Nadeem
    Rafique, Hamaad
    INTELLIGENT TECHNOLOGIES AND APPLICATIONS, INTAP 2018, 2019, 932 : 461 - 472
  • [50] Prediction of Slope Stability using Naive Bayes Classifier
    Feng, Xianda
    Li, Shuchen
    Yuan, Chao
    Zeng, Peng
    Sun, Yang
    KSCE JOURNAL OF CIVIL ENGINEERING, 2018, 22 (03) : 941 - 950