Complement-Class Harmonized Naive Bayes Classifier

被引:2
|
作者
Alenazi, Fahad S. [1 ]
El Hindi, Khalil [1 ]
AsSadhan, Basil [2 ]
机构
[1] King Saud Univ, Dept Comp Sci, Riyadh 11543, Saudi Arabia
[2] King Saud Univ, Dept Elect Engn, Riyadh 11421, Saudi Arabia
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 08期
关键词
scarce data; harmonic average; attribute weighting; Naive Bayes;
D O I
10.3390/app13084852
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Naive Bayes (NB) classification performance degrades if the conditional independence assumption is not satisfied or if the conditional probability estimate is not realistic due to the attributes of correlation and scarce data, respectively. Many works address these two problems, but few works tackle them simultaneously. Existing methods heuristically employ information theory or applied gradient optimization to enhance NB classification performance, however, to the best of our knowledge, the enhanced model generalization capability deteriorated especially on scant data. In this work, we propose a fine-grained boosting of the NB classifier to identify hidden and potential discriminative attribute values that lead the NB model to underfit or overfit on the training data and to enhance their predictive power. We employ the complement harmonic average of the conditional probability terms to measure their distribution divergence and impact on the classification performance for each attribute value. The proposed method is subtle yet significant enough in capturing the attribute values' inter-correlation (between classes) and intra-correlation (within the class) and elegantly and effectively measuring their impact on the model's performance. We compare our proposed complement-class harmonized Naive Bayes classifier (CHNB) with the state-of-the-art Naive Bayes and imbalanced ensemble boosting methods on general and imbalanced machine-learning benchmark datasets, respectively. The empirical results demonstrate that CHNB significantly outperforms the compared methods.
引用
收藏
页数:18
相关论文
共 50 条
  • [1] Sentiment Analysis using Naive Bayes and Complement Naive Bayes Classifier Algorithms on Hadoop Framework
    Seref, Berna
    Bostanci, Erkan
    2018 2ND INTERNATIONAL SYMPOSIUM ON MULTIDISCIPLINARY STUDIES AND INNOVATIVE TECHNOLOGIES (ISMSIT), 2018, : 555 - 561
  • [2] A naive Bayes classifier for identifying Class II YSOs
    Wilson, Andrew J.
    Lakeland, Ben S.
    Wilson, Tom J.
    Naylor, Tim
    MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 2023, 521 (01) : 354 - 388
  • [3] Estimating a one -class naive Bayes text classifier
    Zhang, Yihong
    Jatowt, Adam
    INTELLIGENT DATA ANALYSIS, 2020, 24 (03) : 567 - 579
  • [4] Complement Naive Bayes Classifier for Sentiment Analysis of Internet Movie Database
    Dewi, Christine
    Chen, Rung-Ching
    INTELLIGENT INFORMATION AND DATABASE SYSTEMS, ACIIDS 2022, PT I, 2022, 13757 : 81 - 93
  • [5] Naive Bayes text classifier
    Zhang, Haiyi
    Li, Di
    GRC: 2007 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING, PROCEEDINGS, 2007, : 708 - 711
  • [6] Performance Comparison of Naive Bayes and Complement Naive Bayes Algorithms
    Seref, Berna
    Bostanci, Erkan
    2019 6TH INTERNATIONAL CONFERENCE ON ELECTRICAL AND ELECTRONICS ENGINEERING (ICEEE 2019), 2019, : 131 - 138
  • [7] A FUZZY EXPONENTIAL NAIVE BAYES CLASSIFIER
    Moraes, R. M.
    Machado, L. S.
    UNCERTAINTY MODELLING IN KNOWLEDGE ENGINEERING AND DECISION MAKING, 2016, 10 : 207 - 212
  • [8] A Fuzzy Gamma Naive Bayes classifier
    de Moraes, Ronei Marcos
    de Melo Gomes Soares, Elaine Anita
    Machado, Liliane dos Santos
    DATA SCIENCE AND KNOWLEDGE ENGINEERING FOR SENSING DECISION SUPPORT, 2018, 11 : 691 - 699
  • [9] Learning an optimal naive Bayes classifier
    Martinez-Arroyo, Miriam
    Sucar, L. Enrique
    18TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOL 3, PROCEEDINGS, 2006, : 1236 - +
  • [10] The naive Bayes classifier for functional data
    Zhang, Yi-Chen
    Sakhanenko, Lyudmila
    STATISTICS & PROBABILITY LETTERS, 2019, 152 : 137 - 146