Predicting the Performance of Ensemble Classification Using Conditional Joint Probability

被引:1
|
作者
Murtza, Iqbal [1 ,2 ]
Kim, Jin-Young [3 ]
Adnan, Muhammad [4 ]
机构
[1] Chonnam Natl Univ, Educ & Res Ctr IoT Convergence Intelligent City Sa, Gwangju 61186, South Korea
[2] Air Univ, Fac Comp & AI, Dept Creat Technol, Islamabad 44230, Pakistan
[3] Chonnam Natl Univ, Dept Intelligent Elect & Comp Engn, Gwangju 61186, South Korea
[4] UiT Arctic Univ Norway, Dept Technol & Safety, N-9019 Tromso, Norway
基金
新加坡国家研究基金会;
关键词
machine learning; probability theory; ensemble classification; cost-sensitive learning; binary classification;
D O I
10.3390/math12162586
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
In many machine learning applications, there are many scenarios when performance is not satisfactory by single classifiers. In this case, an ensemble classification is constructed using several weak base learners to achieve satisfactory performance. Unluckily, the construction of the ensemble classification is empirical, i.e., to try an ensemble classification and if performance is not satisfactory then discard it. In this paper, a challenging analytical problem of the estimation of ensemble classification using the prediction performance of the base learners is considered. The proposed formulation is aimed at estimating the performance of ensemble classification without physically developing it, and it is derived from the perspective of probability theory by manipulating the decision probabilities of the base learners. For this purpose, the output of a base learner (which is either true positive, true negative, false positive, or false negative) is considered as a random variable. Then, the effects of logical disjunction-based and majority voting-based decision combination strategies are analyzed from the perspective of conditional joint probability. To evaluate the forecasted performance of ensemble classifier by the proposed methodology, publicly available standard datasets have been employed. The results show the effectiveness of the derived formulations to estimate the performance of ensemble classification. In addition to this, the theoretical and experimental results show that the logical disjunction-based decision outperforms majority voting in imbalanced datasets and cost-sensitive scenarios.
引用
收藏
页数:16
相关论文
共 50 条
  • [41] Predicting Student Performance by Using Data Mining Methods for Classification
    Kabakchieva, Dorina
    CYBERNETICS AND INFORMATION TECHNOLOGIES, 2013, 13 (01) : 61 - 72
  • [42] An Optimized Bagging Ensemble Learning Approach Using BESTrees for Predicting Students’ Performance
    Evangelista, Edmund
    International Journal of Emerging Technologies in Learning, 2023, 18 (10): : 150 - 165
  • [43] Predicting and Interpreting Student Performance Using Ensemble Models and Shapley Additive Explanations
    Sahlaoui, Hayat
    Alaoui, El Arbi Abdellaoui
    Nayyar, Anand
    Agoujil, Said
    Jaber, Mustafa Musa
    IEEE ACCESS, 2021, 9 : 152688 - 152703
  • [44] Informative Band Subset Selection for Hyperspectral Image Classification using Joint and Conditional Mutual Information
    Ali, U. A. Md Ehsan
    Kameyama, Keisuke
    2022 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2022, : 573 - 580
  • [45] Conditional probability estimation based classification with class label missing at random
    Sheng, Ying
    Wang, Qihua
    JOURNAL OF MULTIVARIATE ANALYSIS, 2020, 176
  • [46] Dengue symptoms classification analysis with improved conditional probability decision analysis
    Babu, D. Suresh
    Raju, B.
    Swapna, S.
    Kolluri, Johnson
    Ramesh, D.
    Bonagiri, Rajitha
    APPLIED NANOSCIENCE, 2022, 13 (4) : 3085 - 3093
  • [47] Dengue symptoms classification analysis with improved conditional probability decision analysis
    D. Suresh Babu
    B. Raju
    S. Swapna
    Johnson Kolluri
    D. Ramesh
    Rajitha Bonagiri
    Applied Nanoscience, 2023, 13 : 3085 - 3093
  • [48] The k conditional nearest neighbor algorithm for classification and class probability estimation
    Gweon, Hyukjun
    Schonlau, Matthias
    Steiner, Stefan H.
    PEERJ COMPUTER SCIENCE, 2019, 2019 (05)
  • [49] Robust Ensemble Clustering Using Probability Trajectories
    Huang, Dong
    Lai, Jian-Huang
    Wang, Chang-Dong
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2016, 28 (05) : 1312 - 1326
  • [50] Email Classifier: an Ensemble using Probability and Rules
    Chharia, Astha
    Gupta, R. K.
    2013 SIXTH INTERNATIONAL CONFERENCE ON CONTEMPORARY COMPUTING (IC3), 2013, : 130 - 136