Optimizing Mail Sorting with Naive Bayes Classifier and Enhanced Feature Extraction Method

被引:0
|
作者
C. Pavithra [1 ]
M. Saradha [1 ]
B. Antline Nisha [2 ]
机构
[1] REVA University,Department of Mathematics
[2] St. Joseph’s Institute of Technology,Department of Mathematics
关键词
Naive Bayes; Maximum entropy; Stop words; Ensemble BOW model; Natural programing language(NPL);
D O I
10.1007/s42979-024-03178-5
中图分类号
学科分类号
摘要
Email sorting refers to the process of organizing and categorizing incoming emails in order to efficiently manage and prioritize them. By implementing various sorting techniques, users can quickly identify important messages, reduce clutter, and enhance overall productivity. The Naive Bayes Classifier is used in the research to classify emails as spam or not spam using the conditional probability distribution idea. The Objective of the research is to implement Naive Bayes Classifier to classify emails as spam or not spam using the conditional probability distribution idea. In this method, the bag of phrases is frequently used along with the maximum entropy method for text classification. Stop words are used to reduce redundant terms, and each word’s frequency is a key factor in the classifier’s training. Further the feature sets are being classified to positive and negative data using the binary values 0 and 1, and the probabilities of the same are calculated using Naive Bayes classifier. P(y = True/sentence) = 0.0073 and P(y = False/sentence) = 0.0123. The significance of the research is to measure the performance of the classifier is then assessed after normalizing these values. We have obtained Normalized (P)y = True/Sentence = 0.848 and Normalized (P)y = False/ Sentence = 0.1511.
引用
收藏
相关论文
共 50 条
  • [21] Spam Mail Detection using Naive Bayes method with Apache Spark
    Aydogan, Murat
    Karci, Ali
    2018 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND DATA PROCESSING (IDAP), 2018,
  • [22] Extraction of Action Rules for Chronic Kidney Disease using Naive Bayes Classifier
    Dulhare, Uma N.
    Ayesha, Mohammad
    2016 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMPUTING RESEARCH, 2016, : 771 - 775
  • [23] Study on PPG Biometric Recognition Based on Multifeature Extraction and Naive Bayes Classifier
    Yang, Junfeng
    Huang, Yuwen
    Zhang, Ruili
    Huang, Fuxian
    Meng, Qinggang
    Feng, Shixin
    SCIENTIFIC PROGRAMMING, 2021, 2021
  • [24] OPTIMIZED FEATURE-EXTRACTION AND THE BAYES DECISION IN FEEDFORWARD CLASSIFIER NETWORKS
    LOWE, D
    WEBB, AR
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1991, 13 (04) : 355 - 364
  • [25] A Classifier Learning Method Based on Tree-Augmented Naive Bayes
    Chen Xi
    Zhang Kun
    JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2019, 41 (08) : 2001 - 2008
  • [26] Removal of impulse noise in digital images with naive Bayes classifier method
    Budak, Cafer
    Turk, Mustafa
    Toprak, Abdullah
    TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2016, 24 (04) : 2717 - 2729
  • [27] Teacher Performance Assesment Application using Naive Bayes Classifier Method
    Findawati, Y.
    Taurusta, C.
    Widiaty, I.
    Nandiyanto, A. B. D.
    INTERNATIONAL SYMPOSIUM ON MATERIALS AND ELECTRICAL ENGINEERING (ISMEE) 2017, 2018, 384
  • [28] A method of cleaning RFID data streams based on Naive Bayes classifier
    Lin, Qiao-min
    Xiao, Yan
    Ye, Ning
    Wang, Ru-chuan
    INTERNATIONAL JOURNAL OF AD HOC AND UBIQUITOUS COMPUTING, 2016, 21 (04) : 237 - 244
  • [29] A sequential feature extraction approach for naive bayes classification of microarray data
    Fan, Liwei
    Poh, Kim-Leng
    Zhou, Peng
    EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (06) : 9919 - 9923
  • [30] Enhanced Naive Bayes Classifier for Real-time Sentiment Analysis with SparkR
    Jung, Young Gyo
    Kim, Kyung Tae
    Lee, Byungjun
    Youn, Hee Yong
    2016 INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY CONVERGENCE (ICTC 2016): TOWARDS SMARTER HYPER-CONNECTED WORLD, 2016, : 141 - 146