Cost-sensitive boosting algorithms: Do we really need them?

被引:41
|
作者
Nikolaou, Nikolaos [1 ]
Edakunni, Narayanan [1 ]
Kull, Meelis [2 ]
Flach, Peter [2 ]
Brown, Gavin [1 ]
机构
[1] Univ Manchester, Sch Comp Sci, Kilburn Bldg,Oxford Rd, Manchester M13 9PL, Lancs, England
[2] Univ Bristol, Dept Comp Sci, Merchant Venturers Bldg,Woodland Rd, Bristol BS8 1UB, Avon, England
基金
英国工程与自然科学研究理事会;
关键词
Boosting; Cost-sensitive; Class imbalance; Classifier calibration;
D O I
10.1007/s10994-016-5572-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We provide a unifying perspective for two decades of work on cost-sensitive Boosting algorithms. When analyzing the literature 1997-2016, we find 15 distinct cost-sensitive variants of the original algorithm; each of these has its own motivation and claims to superiority-so who should we believe? In this work we critique the Boosting literature using four theoretical frameworks: Bayesian decision theory, the functional gradient descent view, margin theory, and probabilistic modelling. Our finding is that only three algorithms are fully supported-and the probabilistic model view suggests that all require their outputs to be calibrated for best performance. Experiments on 18 datasets across 21 degrees of imbalance support the hypothesis-showing that once calibrated, they perform equivalently, and outperform all others. Our final recommendation-based on simplicity, flexibility and performance-is to use the original Adaboost algorithm with a shifted decision threshold and calibrated probability estimates.
引用
收藏
页码:359 / 384
页数:26
相关论文
共 50 条
  • [21] Instance-Based Cost-Sensitive Boosting
    Sharifnia, Ensieh
    Boostani, Reza
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2020, 34 (03)
  • [22] Cost-sensitive boosting in software quality modeling
    Khoshgoftaar, TM
    7TH IEEE INTERNATIONAL SYMPOSIUM ON HIGH ASSURANCE SYSTEMS ENGINEERING, PROCEEDINGS, 2002, : 51 - 60
  • [23] Cost-sensitive cache replacement algorithms
    Jeong, J
    Dubois, M
    NINTH INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE, PROCEEDINGS, 2003, : 327 - 337
  • [24] An Empirical Study on the Performance of Cost-Sensitive Boosting Algorithms with Different Levels of Class Imbalance
    Yin, Qing-Yan
    Zhang, Jiang-She
    Zhang, Chun-Xia
    Liu, Sheng-Cai
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2013, 2013
  • [25] Do we really need to do this?
    Ehnert, Jesse
    JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2023, 153 (03):
  • [26] Do we really need this?
    Milo, P
    EE-EVALUATION ENGINEERING, 2002, 41 (08): : 8 - 8
  • [28] Molecular tools for the diagnosis of parasitic infections: Do we really need them?
    Schallig, HDFH
    van der Meide, WF
    MULTIDISCIPLINARITY FOR PARASITES, VECTORS AND PARASITIC DISEASES, VOL 1, 2004, : 331 - 336
  • [29] Lasers in Transurethral Enucleation of the Prostate-Do We Really Need Them
    Herrmann, Thomas R. W.
    Gravas, Stavros
    de la Rosette, Jean J. M. C. H.
    Wolters, Mathias
    Anastasiadis, Aristotelis G.
    Giannakis, Ioannis
    JOURNAL OF CLINICAL MEDICINE, 2020, 9 (05)
  • [30] Non-conventional diagnostic tools: Do we really need them?
    Gavish, D
    ISRAEL MEDICAL ASSOCIATION JOURNAL, 2005, 7 (10): : 653 - 653