Complement-Class Harmonized Naive Bayes Classifier

被引:2
|
作者
Alenazi, Fahad S. [1 ]
El Hindi, Khalil [1 ]
AsSadhan, Basil [2 ]
机构
[1] King Saud Univ, Dept Comp Sci, Riyadh 11543, Saudi Arabia
[2] King Saud Univ, Dept Elect Engn, Riyadh 11421, Saudi Arabia
来源
APPLIED SCIENCES-BASEL | 2023年 / 13卷 / 08期
关键词
scarce data; harmonic average; attribute weighting; Naive Bayes;
D O I
10.3390/app13084852
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
Naive Bayes (NB) classification performance degrades if the conditional independence assumption is not satisfied or if the conditional probability estimate is not realistic due to the attributes of correlation and scarce data, respectively. Many works address these two problems, but few works tackle them simultaneously. Existing methods heuristically employ information theory or applied gradient optimization to enhance NB classification performance, however, to the best of our knowledge, the enhanced model generalization capability deteriorated especially on scant data. In this work, we propose a fine-grained boosting of the NB classifier to identify hidden and potential discriminative attribute values that lead the NB model to underfit or overfit on the training data and to enhance their predictive power. We employ the complement harmonic average of the conditional probability terms to measure their distribution divergence and impact on the classification performance for each attribute value. The proposed method is subtle yet significant enough in capturing the attribute values' inter-correlation (between classes) and intra-correlation (within the class) and elegantly and effectively measuring their impact on the model's performance. We compare our proposed complement-class harmonized Naive Bayes classifier (CHNB) with the state-of-the-art Naive Bayes and imbalanced ensemble boosting methods on general and imbalanced machine-learning benchmark datasets, respectively. The empirical results demonstrate that CHNB significantly outperforms the compared methods.
引用
收藏
页数:18
相关论文
共 50 条
  • [21] Exact Learning Augmented Naive Bayes Classifier
    Sugahara, Shouta
    Ueno, Maomi
    ENTROPY, 2021, 23 (12)
  • [22] Incremental discretization for Naive-Bayes classifier
    Lu, Jingli
    Yang, Ying
    Webb, Geoffrey I.
    ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2006, 4093 : 223 - 238
  • [23] Threshold-based Naive Bayes classifier
    Romano, Maurizio
    Contu, Giulia
    Mola, Francesco
    Conversano, Claudio
    ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2024, 18 (02) : 325 - 361
  • [24] Regularization and averaging of the selective Naive Bayes classifier
    Boulle, Marc
    2006 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORK PROCEEDINGS, VOLS 1-10, 2006, : 1680 - 1688
  • [25] An Extension of Tree Augmented Naive Bayes Classifier
    Wang, Zhongfeng
    Tian, Jianwei
    2011 SECOND ETP/IITA CONFERENCE ON TELECOMMUNICATION AND INFORMATION (TEIN 2011), VOL 1, 2011, : 243 - 246
  • [26] Federated Learning with Discriminative Naive Bayes Classifier
    Torrijos, Pablo
    Alfaro, Juan C.
    Gamez, Jose A.
    Puerta, Jose M.
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2024, PT II, 2025, 15347 : 328 - 339
  • [27] Understanding of the Naive Bayes Classifier in Spam Filtering
    Wei, Qijia
    6TH INTERNATIONAL CONFERENCE ON COMPUTER-AIDED DESIGN, MANUFACTURING, MODELING AND SIMULATION (CDMMS 2018), 2018, 1967
  • [28] Boosting the Tree Augmented Naive Bayes classifier
    Downs, T
    Tang, A
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING IDEAL 2004, PROCEEDINGS, 2004, 3177 : 708 - 713
  • [29] Multiple explanations driven Naive Bayes classifier
    Almonayyes, A
    JOURNAL OF UNIVERSAL COMPUTER SCIENCE, 2006, 12 (02) : 127 - 139
  • [30] A sequential naive Bayes classifier for DNA barcodes
    Anderson, Michael P.
    Dubnicka, Suzanne R.
    STATISTICAL APPLICATIONS IN GENETICS AND MOLECULAR BIOLOGY, 2014, 13 (04) : 423 - 434