Optimizing Mail Sorting with Naive Bayes Classifier and Enhanced Feature Extraction Method

被引:0
|
作者
C. Pavithra [1 ]
M. Saradha [1 ]
B. Antline Nisha [2 ]
机构
[1] REVA University,Department of Mathematics
[2] St. Joseph’s Institute of Technology,Department of Mathematics
关键词
Naive Bayes; Maximum entropy; Stop words; Ensemble BOW model; Natural programing language(NPL);
D O I
10.1007/s42979-024-03178-5
中图分类号
学科分类号
摘要
Email sorting refers to the process of organizing and categorizing incoming emails in order to efficiently manage and prioritize them. By implementing various sorting techniques, users can quickly identify important messages, reduce clutter, and enhance overall productivity. The Naive Bayes Classifier is used in the research to classify emails as spam or not spam using the conditional probability distribution idea. The Objective of the research is to implement Naive Bayes Classifier to classify emails as spam or not spam using the conditional probability distribution idea. In this method, the bag of phrases is frequently used along with the maximum entropy method for text classification. Stop words are used to reduce redundant terms, and each word’s frequency is a key factor in the classifier’s training. Further the feature sets are being classified to positive and negative data using the binary values 0 and 1, and the probabilities of the same are calculated using Naive Bayes classifier. P(y = True/sentence) = 0.0073 and P(y = False/sentence) = 0.0123. The significance of the research is to measure the performance of the classifier is then assessed after normalizing these values. We have obtained Normalized (P)y = True/Sentence = 0.848 and Normalized (P)y = False/ Sentence = 0.1511.
引用
收藏
相关论文
共 50 条
  • [1] Naive bayes face/nonface classifier: A study of preprocessing and feature extraction techniques
    Phung, SL
    Bouzerdoum, A
    Chai, D
    Watson, A
    ICIP: 2004 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOLS 1- 5, 2004, : 1385 - 1388
  • [2] Optimizing MapReduce Partitioner Using Naive Bayes Classifier
    Chen, Lei
    Lu, Wei
    Wang, Liqiang
    Bao, Ergude
    Xing, Weiwei
    Yang, Yong
    Yuan, Victor
    2017 IEEE 15TH INTL CONF ON DEPENDABLE, AUTONOMIC AND SECURE COMPUTING, 15TH INTL CONF ON PERVASIVE INTELLIGENCE AND COMPUTING, 3RD INTL CONF ON BIG DATA INTELLIGENCE AND COMPUTING AND CYBER SCIENCE AND TECHNOLOGY CONGRESS(DASC/PICOM/DATACOM/CYBERSCI, 2017, : 812 - 819
  • [3] Feature Selection for Chemical Compound Extraction using Wrapper Approach with Naive Bayes Classifier
    Alshaikhdeeb, Basel
    Ahmad, Kamsuriah
    PROCEEDINGS OF THE 2017 6TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING AND INFORMATICS (ICEEI'17), 2017,
  • [4] Feature selection for optimizing the Naive Bayes algorithm
    Winarti, Titin
    Vydia, Vensy
    ENGINEERING, INFORMATION AND AGRICULTURAL TECHNOLOGY IN THE GLOBAL DIGITAL REVOLUTION, 2020, : 47 - 51
  • [5] The impact of feature extraction on the performance of a classifier: kNN, Naive Bayes and C4.5
    Pechenizkiy, M
    ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2005, 3501 : 268 - 279
  • [6] A Dependent Feature Weighting Filter for Naive Bayes Classifier
    Chatip, Gieliz
    Yilmaz, Ferkan
    2022 30TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU, 2022,
  • [7] Effectiveness of Feature Extraction by PCA-Based Detection and Naive Bayes Classifier for Glaucoma Images
    Christobel, J. Shiny
    Vimala, D.
    Athanesious, J. Joshan
    Singh, S. Christopher Ezhil
    Murugan, Sivaraj
    INTERNATIONAL JOURNAL OF DIGITAL MULTIMEDIA BROADCASTING, 2022, 2022
  • [8] Feature extraction and classification of proteomics data using stationary wavelet transform and naive Bayes classifier
    Liu Dan
    Huang Yuan-yuan
    Ma Chen-xiang
    2010 4TH INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICAL ENGINEERING (ICBBE 2010), 2010,
  • [9] Effectiveness of Feature Extraction by PCA-Based Detection and Naive Bayes Classifier for Glaucoma Images
    Shiny Christobel, J.
    Vimala, D.
    Joshan Athanesious, J.
    Christopher Ezhil Singh, S.
    Murugan, Sivaraj
    International Journal of Digital Multimedia Broadcasting, 2022, 2022
  • [10] Class dependent feature scaling method using naive Bayes classifier for text datamining
    Youn, Eunseog
    Jeong, Myong K.
    PATTERN RECOGNITION LETTERS, 2009, 30 (05) : 477 - 485