Attribute grouping-based naive Bayesian classifier

被引:0
|
作者
He, Yulin [1 ,2 ]
Ou, Guiliang [2 ]
Fournier-Viger, Philippe [2 ]
Huang, Joshua Zhexue [1 ,2 ]
机构
[1] Guangdong Lab Artificial Intelligence & Digital Ec, Shenzhen 518107, Peoples R China
[2] Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen 518060, Peoples R China
基金
中国国家自然科学基金;
关键词
naive Bayesian classifier; attribute independence assumption; attribute grouping; dependent attribute group; posterior probability; class-conditional probability; DENSITY-ESTIMATION; ALGORITHMS;
D O I
10.1007/s11432-022-3728-2
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The naive Bayesian classifier (NBC) is a supervised machine learning algorithm having a simple model structure and good theoretical interpretability. However, the generalization performance of NBC is limited to a large extent by the assumption of attribute independence. To address this issue, this paper proposes a novel attribute grouping-based NBC (AG-NBC), which is a variant of the classical NBC trained with different attribute groups. AG-NBC first applies a novel effective objective function to automatically identify optimal dependent attribute groups (DAGs). Condition attributes in the same DAG are strongly dependent on the class attribute, whereas attributes in different DAGs are independent of one another. Then, for each DAG, a random vector functional link network with a SoftMax layer is trained to output posterior probabilities in the form of joint probability density estimation. The NBC is trained using the grouping attributes that correspond to the original condition attributes. Extensive experiments were conducted to validate the rationality, feasibility, and effectiveness of AG-NBC. Our findings showed that the attribute groups chosen for NBC can accurately represent attribute dependencies and reduce overlaps between different posterior probability densities. In addition, the comparative results with NBC, flexible NBC (FNBC), tree augmented Bayes network (TAN), gain ratio-based attribute weighted naive Bayes (GRAWNB), averaged one-dependence estimators (AODE), weighted AODE (WAODE), independent component analysis-based NBC (ICA-NBC), hidden naive Bayesian (HNB) classifier, and correlation-based feature weighting filter for naive Bayes (CFW) show that AG-NBC obtains statistically better testing accuracies, higher area under the receiver operating characteristic curves (AUCs), and fewer probability mean square errors (PMSEs) than other Bayesian classifiers. The experimental results demonstrate that AG-NBC is a valid and efficient approach for alleviating the attribute independence assumption when building NBCs.
引用
收藏
页数:25
相关论文
共 50 条
  • [31] Nomograms for visualization of naive Bayesian classifier
    Mozina, M
    Demsar, J
    Kattan, M
    Zupan, B
    KNOWLEDGE DISCOVERY IN DATABASES: PKDD 2004, PROCEEDINGS, 2004, 3202 : 337 - 348
  • [32] A semi-naive Bayes classifier with grouping of cases
    Abellan, Joaquin
    Cano, Andres
    Masegosa, Andres R.
    Moral, Serafin
    SYMBOLIC AND QUANTITATIVE APPROACHES TO REASONING WITH UNCERTAINTY, PROCEEDINGS, 2007, 4724 : 477 - +
  • [33] Selective Bayesian classifier: feature selection for the Naive Bayesian classifier using decision trees
    Ratanamahatana, C
    Gunopulos, D
    DATA MINING III, 2002, 6 : 613 - 623
  • [34] Classification of Micro-blog Sentiment Based on Naive Bayesian Classifier
    Ou, Xiaoheng
    Cao, Yan
    Mu, Xiangwei
    LISS 2013, 2015, : 585 - 589
  • [35] Public Bicycle System Fault Diagnosis Based on Naive Bayesian Classifier
    Shi Z.
    Hao W.
    Dong H.
    Zhongguo Jixie Gongcheng/China Mechanical Engineering, 2019, 30 (08): : 983 - 987
  • [36] Incident Duration Prediction Based on Latent Gaussian Naive Bayesian classifier
    Li, Dawei
    Cheng, Lin
    Ma, Jiangshan
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE SYSTEMS, 2011, 4 (03) : 345 - 352
  • [37] INCIDENT DURATION PREDICTION BASED ON TREE AUGMENTED NAIVE BAYESIAN CLASSIFIER
    Li, Dawei
    Cheng, Lin
    TRANSPORTATION AND URBAN SUSTAINABILITY, 2010, : 407 - 414
  • [38] Incident Duration Prediction Based on Latent Gaussian Naive Bayesian classifier
    Li D.
    Cheng L.
    Ma J.
    International Journal of Computational Intelligence Systems, 2011, 4 (3) : 345 - 352
  • [39] Robustness Analysis of Naive Bayesian Classifier-Based Collaborative Filtering
    Kaleli, Cihan
    Polat, Huseyin
    E-COMMERCE AND WEB TECHNOLOGIES, EC-WEB 2013, 2013, 152 : 202 - 209
  • [40] The improvement of Naive Bayesian Classifier based on the strategy of fuzzy feature selection
    Zhang, Xuefeng
    Liu, Peng
    Fan, Jinjin
    ISDA 2006: SIXTH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, VOL 1, 2006, : 377 - 382