Predicting the Performance of Ensemble Classification Using Conditional Joint Probability

被引:1
|
作者
Murtza, Iqbal [1 ,2 ]
Kim, Jin-Young [3 ]
Adnan, Muhammad [4 ]
机构
[1] Chonnam Natl Univ, Educ & Res Ctr IoT Convergence Intelligent City Sa, Gwangju 61186, South Korea
[2] Air Univ, Fac Comp & AI, Dept Creat Technol, Islamabad 44230, Pakistan
[3] Chonnam Natl Univ, Dept Intelligent Elect & Comp Engn, Gwangju 61186, South Korea
[4] UiT Arctic Univ Norway, Dept Technol & Safety, N-9019 Tromso, Norway
基金
新加坡国家研究基金会;
关键词
machine learning; probability theory; ensemble classification; cost-sensitive learning; binary classification;
D O I
10.3390/math12162586
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
In many machine learning applications, there are many scenarios when performance is not satisfactory by single classifiers. In this case, an ensemble classification is constructed using several weak base learners to achieve satisfactory performance. Unluckily, the construction of the ensemble classification is empirical, i.e., to try an ensemble classification and if performance is not satisfactory then discard it. In this paper, a challenging analytical problem of the estimation of ensemble classification using the prediction performance of the base learners is considered. The proposed formulation is aimed at estimating the performance of ensemble classification without physically developing it, and it is derived from the perspective of probability theory by manipulating the decision probabilities of the base learners. For this purpose, the output of a base learner (which is either true positive, true negative, false positive, or false negative) is considered as a random variable. Then, the effects of logical disjunction-based and majority voting-based decision combination strategies are analyzed from the perspective of conditional joint probability. To evaluate the forecasted performance of ensemble classifier by the proposed methodology, publicly available standard datasets have been employed. The results show the effectiveness of the derived formulations to estimate the performance of ensemble classification. In addition to this, the theoretical and experimental results show that the logical disjunction-based decision outperforms majority voting in imbalanced datasets and cost-sensitive scenarios.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] Intuitive ideas of 9th grade pupils about conditional probability and joint probability
    Fernandes, Jose Antonio
    Correia, Paulo Ferreira
    Miguel Contreras, Jose
    AVANCES DE INVESTIGACION EN EDUCACION MATEMATICA, 2013, (04): : 5 - 26
  • [32] High Performance Classification of Android Malware Using Ensemble Machine Learning
    Ouk, Pagnchakneat C.
    Pak, Wooguil
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 72 (01): : 381 - 398
  • [34] Performance Improvement for Diagnosis of Colon Cancer by Using Ensemble Classification Methods
    Turhal, Ugur
    Babur, Sebahattin
    Avci, Cafer
    Akbas, Ahmet
    2013 INTERNATIONAL CONFERENCE ON TECHNOLOGICAL ADVANCES IN ELECTRICAL, ELECTRONICS AND COMPUTER ENGINEERING (TAEECE), 2013, : 271 - 275
  • [35] On the conditional performance of the synthetic chart with unknown process parameters using the exceedance probability criterion
    Hu, XueLong
    Tang, AnAn
    Qiao, YuLong
    Sun, JinSheng
    Guo, BaoCai
    PLOS ONE, 2020, 15 (10):
  • [36] Predicting phenotype transition probabilities via conditional algorithmic probability approximations
    Dingle, Kamaludin
    Novev, Javor K.
    Ahnert, Sebastian E.
    Louis, Ard A.
    JOURNAL OF THE ROYAL SOCIETY INTERFACE, 2022, 19 (197)
  • [37] Solder joint defect classification based on ensemble learning
    Wu, Hao
    SOLDERING & SURFACE MOUNT TECHNOLOGY, 2017, 29 (03) : 164 - 170
  • [38] Joint probability distributions and conditional probabilities in the tomographic representation of quantum states
    Man'ko, Margarita A.
    PHYSICA SCRIPTA, 2013, T153
  • [39] Probability-Weighted Voting Ensemble Learning for Classification ModelProbability-Weighted Voting Ensemble Learning for Classification Model
    Rojarath, Artitayapron
    Songpan, Wararat
    JOURNAL OF ADVANCES IN INFORMATION TECHNOLOGY, 2020, 11 (04) : 217 - 227
  • [40] TESTING PROBABILITY DISTRIBUTIONS USING CONDITIONAL SAMPLES
    Canonne, Clement L.
    Ron, Dana
    Servedio, Rocco A.
    SIAM JOURNAL ON COMPUTING, 2015, 44 (03) : 540 - 616