Improving Bayesian credibility intervals for classifier error rates using maximum entropy empirical priors

被引:4
|
作者
Gustafsson, Mats G. [1 ]
Waman, Mikael [1 ,2 ,3 ]
Bolin, Ulrika Wickenberg [1 ]
Goransson, Hanna [1 ]
Fryknas, M. [1 ]
Andersson, Claes R. [1 ]
Isaksson, Anders [1 ]
机构
[1] Uppsala Univ, Dept Med Sci, Acad Hosp, S-75185 Uppsala, Sweden
[2] Fraunhofer Chalmers Res Ctr Ind Math, SE-41288 Gothenburg, Sweden
[3] Univ Oxford, Comp Lab, Computat Biol Grp, Oxford OX1 3QD, England
基金
瑞典研究理事会;
关键词
Classifier design; Performance evaluation; Small sample learning; Decision support system; Diagnosis; Prognosis; LOGISTIC-REGRESSION; INFORMATION-THEORY; DECISION-SUPPORT; MICROARRAY DATA; PREDICTION; SELECTION; MODEL; VALIDATION; DIAGNOSIS; VARIANCE;
D O I
10.1016/j.artmed.2010.02.004
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Objective: Successful use of classifiers that learn to make decisions from a set of patient examples require robust methods for performance estimation. Recently many promising approaches for determination of an upper bound for the error rate of a single classifier have been reported but the Bayesian credibility interval (Cl) obtained from a conventional holdout test still delivers one of the tightest bounds. The conventional Bayesian CI becomes unacceptably large in real world applications where the test set sizes are less than a few hundred. The source of this problem is that fact that the Cl is determined exclusively by the result on the test examples. In other words, there is no information at all provided by the uniform prior density distribution employed which reflects complete lack of prior knowledge about the unknown error rate. Therefore, the aim of the study reported here was to study a maximum entropy (ME) based approach to improved prior knowledge and Bayesian CIs, demonstrating its relevance for biomedical research and clinical practice. Method and material: It is demonstrated how a refined non-uniform prior density distribution can be obtained by means of the ME principle using empirical results from a few designs and tests using non-overlapping sets of examples. Results: Experimental results show that ME based priors improve the CIs when employed to four quite different simulated and two real world data sets. Conclusions: An empirically derived ME prior seems promising for improving the Bayesian Cl for the unknown error rate of a designed classifier. (C) 2010 Elsevier B.V. All rights reserved.
引用
收藏
页码:93 / 104
页数:12
相关论文
共 50 条
  • [21] Improving Bayesian radiological profiling of waste drums using Dirichlet priors, Gaussian process priors, and hierarchical modeling
    Laloy, Eric
    Rogiers, Bart
    Bielen, An
    Borella, Alessandro
    Boden, Sven
    APPLIED RADIATION AND ISOTOPES, 2023, 194
  • [22] Improving Bayesian Classifier Using Vine Copula and Fuzzy Clustering Technique
    Che-Ngoc H.
    Nguyen-Trang T.
    Huynh-Van H.
    Vo-Van T.
    Annals of Data Science, 2024, 11 (02) : 709 - 732
  • [23] Bayesian inference for an extended simple regression measurement error model using skewed priors
    Rodrigues, Josemar
    Bolfarine, Heleno
    BAYESIAN ANALYSIS, 2007, 2 (02): : 349 - 364
  • [24] Improving Estimations of Spatial Distribution of Soil Respiration Using the Bayesian Maximum Entropy Algorithm and Soil Temperature as Auxiliary Data
    Hu, Junguo
    Zhou, Jian
    Zhou, Guomo
    Luo, Yiqi
    Xu, Xiaojun
    Li, Pingheng
    Liang, Junyi
    PLOS ONE, 2016, 11 (01):
  • [25] Improving fault delineation using maximum entropy multispectral coherence
    Lyu, Bin
    Qi, Jie
    Sinha, Saurabh
    Li, Jianjun
    Marfurt, Kurt J.
    INTERPRETATION-A JOURNAL OF SUBSURFACE CHARACTERIZATION, 2020, 8 (04): : T835 - T850
  • [26] Improving predictability of time series using maximum entropy methods
    Chliamovitch, G.
    Dupuis, A.
    Golub, A.
    Chopard, B.
    EPL, 2015, 110 (01)
  • [27] Improving Persian POS Tagging Using the Maximum Entropy Model
    Kardan, Ahmad A.
    Imani, Maryam Bahojb
    2014 IRANIAN CONFERENCE ON INTELLIGENT SYSTEMS (ICIS), 2014,
  • [28] Analysis and Comparison of Kidney Stone Detection using Gaussian Maximum Likelihood Classifier and Bayesian Classifier with Improved Accuracy
    Kishore, U.
    Ramadevi, R.
    CARDIOMETRY, 2022, (25): : 799 - 805
  • [29] Deflating Trees: Improving Bayesian Branch-Length Estimates using Informed Priors
    Nelson, Bradley J.
    Andersen, John J.
    Brown, Jeremy M.
    SYSTEMATIC BIOLOGY, 2015, 64 (03) : 441 - 447
  • [30] Empirical estimation of sequencing error rates using smoothing splines
    Xuan Zhu
    Jian Wang
    Bo Peng
    Sanjay Shete
    BMC Bioinformatics, 17