Optimizing Mail Sorting with Naive Bayes Classifier and Enhanced Feature Extraction Method

被引:0
|
作者
C. Pavithra [1 ]
M. Saradha [1 ]
B. Antline Nisha [2 ]
机构
[1] REVA University,Department of Mathematics
[2] St. Joseph’s Institute of Technology,Department of Mathematics
关键词
Naive Bayes; Maximum entropy; Stop words; Ensemble BOW model; Natural programing language(NPL);
D O I
10.1007/s42979-024-03178-5
中图分类号
学科分类号
摘要
Email sorting refers to the process of organizing and categorizing incoming emails in order to efficiently manage and prioritize them. By implementing various sorting techniques, users can quickly identify important messages, reduce clutter, and enhance overall productivity. The Naive Bayes Classifier is used in the research to classify emails as spam or not spam using the conditional probability distribution idea. The Objective of the research is to implement Naive Bayes Classifier to classify emails as spam or not spam using the conditional probability distribution idea. In this method, the bag of phrases is frequently used along with the maximum entropy method for text classification. Stop words are used to reduce redundant terms, and each word’s frequency is a key factor in the classifier’s training. Further the feature sets are being classified to positive and negative data using the binary values 0 and 1, and the probabilities of the same are calculated using Naive Bayes classifier. P(y = True/sentence) = 0.0073 and P(y = False/sentence) = 0.0123. The significance of the research is to measure the performance of the classifier is then assessed after normalizing these values. We have obtained Normalized (P)y = True/Sentence = 0.848 and Normalized (P)y = False/ Sentence = 0.1511.
引用
收藏
相关论文
共 50 条
  • [41] Sentiment Analysis Of Practice Service's Questionnaire Using Naive Bayes Classifier Method
    Yunita, Selly
    Amaliah, Yusni
    Suprianto
    Indriani, Aida
    Fadlan, Muhammad
    Muhammad
    3RD INTERNATIONAL CONFERENCE ON CYBERNETICS AND INTELLIGENT SYSTEMS (ICORIS 2021), 2021, : 117 - 122
  • [42] Jakarta Congestion Mapping And Classification From Twitter Data Extraction Using Tokenization And Naive Bayes Classifier
    Septianto, Gigih Rezki
    Mukti, Firman Fakhri
    Nasrun, Muhammad
    Gozali, Alfian Akbar
    2015 ASIA PACIFIC CONFERENCE ON MULTIMEDIA AND BROADCASTING, 2015, : 14 - 19
  • [43] Concept relation extraction using Naive Bayes classifier for ontology-based question answering systems
    Kumar, G. Suresh
    Zayaraz, G.
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2015, 27 (01) : 13 - 24
  • [45] Multi-Feature Analysis for Automated Brain Stroke Classification Using Weighted Gaussian Naive Bayes Classifier
    Jayachitra, S.
    Prasanth, A.
    JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2021, 30 (10)
  • [46] Distributed denial of service attack detection using Naive Bayes Classifier through Info Gain Feature Selection
    Singh, Ningombam Anandshree
    Singh, Khundrakpam Johnson
    De, Tanmay
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INFORMATICS AND ANALYTICS (ICIA' 16), 2016,
  • [47] Ok-NB: An Enhanced OPTICS and k-Naive Bayes Classifier for Imbalance Classification With Overlapping
    Ahmed, Zahid
    Issac, Biju
    Das, Sufal
    IEEE ACCESS, 2024, 12 : 57458 - 57477
  • [48] Improving the Naive Bayes Classifier via a Quick Variable Selection Method Using Maximum of Entropy
    Abellan, Joaquin
    Castellano, Javier G.
    ENTROPY, 2017, 19 (06)
  • [49] Development of novel in silico model for developmental toxicity assessment by using naive Bayes classifier method
    Zhang, Hui
    Ren, Ji-Xia
    Kang, Yan-Li
    Bo, Peng
    Liang, Jun-Yu
    Ding, Lan
    Kong, Wei-Bao
    Zhang, Ji
    REPRODUCTIVE TOXICOLOGY, 2017, 71 : 8 - 15
  • [50] Sentiment analysis system for movie review in Bahasa Indonesia using naive bayes classifier method
    Nurdiansyah, Yanuar
    Bukhori, Saiful
    Hidayat, Rahmad
    1ST INTERNATIONAL CONFERENCE OF COMBINATORICS, GRAPH THEORY, AND NETWORK TOPOLOGY, 2018, 1008