Improving Bayesian credibility intervals for classifier error rates using maximum entropy empirical priors

被引：4

作者：

Gustafsson, Mats G. ^{[1
]}

Waman, Mikael ^{[1
,2
,3
]}

Bolin, Ulrika Wickenberg ^{[1
]}

Goransson, Hanna ^{[1
]}

Fryknas, M. ^{[1
]}

Andersson, Claes R. ^{[1
]}

Isaksson, Anders ^{[1
]}

机构：

[1] Uppsala Univ, Dept Med Sci, Acad Hosp, S-75185 Uppsala, Sweden

[2] Fraunhofer Chalmers Res Ctr Ind Math, SE-41288 Gothenburg, Sweden

[3] Univ Oxford, Comp Lab, Computat Biol Grp, Oxford OX1 3QD, England

来源：

ARTIFICIAL INTELLIGENCE IN MEDICINE | 2010年 / 49卷 / 02期

基金：

瑞典研究理事会;

关键词：

Classifier design; Performance evaluation; Small sample learning; Decision support system; Diagnosis; Prognosis; LOGISTIC-REGRESSION; INFORMATION-THEORY; DECISION-SUPPORT; MICROARRAY DATA; PREDICTION; SELECTION; MODEL; VALIDATION; DIAGNOSIS; VARIANCE;

D O I：

10.1016/j.artmed.2010.02.004

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Objective: Successful use of classifiers that learn to make decisions from a set of patient examples require robust methods for performance estimation. Recently many promising approaches for determination of an upper bound for the error rate of a single classifier have been reported but the Bayesian credibility interval (Cl) obtained from a conventional holdout test still delivers one of the tightest bounds. The conventional Bayesian CI becomes unacceptably large in real world applications where the test set sizes are less than a few hundred. The source of this problem is that fact that the Cl is determined exclusively by the result on the test examples. In other words, there is no information at all provided by the uniform prior density distribution employed which reflects complete lack of prior knowledge about the unknown error rate. Therefore, the aim of the study reported here was to study a maximum entropy (ME) based approach to improved prior knowledge and Bayesian CIs, demonstrating its relevance for biomedical research and clinical practice. Method and material: It is demonstrated how a refined non-uniform prior density distribution can be obtained by means of the ME principle using empirical results from a few designs and tests using non-overlapping sets of examples. Results: Experimental results show that ME based priors improve the CIs when employed to four quite different simulated and two real world data sets. Conclusions: An empirically derived ME prior seems promising for improving the Bayesian Cl for the unknown error rate of a designed classifier. (C) 2010 Elsevier B.V. All rights reserved.

引用

页码：93 / 104

页数：12

共 50 条

[21] Improving Bayesian radiological profiling of waste drums using Dirichlet priors, Gaussian process priors, and hierarchical modeling
Laloy, Eric
Rogiers, Bart
Bielen, An
Borella, Alessandro
Boden, Sven
APPLIED RADIATION AND ISOTOPES, 2023, 194
[22] Improving Bayesian Classifier Using Vine Copula and Fuzzy Clustering Technique
Che-Ngoc H.
Nguyen-Trang T.
Huynh-Van H.
Vo-Van T.
Annals of Data Science, 2024, 11 (02) : 709 - 732
[23] Bayesian inference for an extended simple regression measurement error model using skewed priors
Rodrigues, Josemar
Bolfarine, Heleno
BAYESIAN ANALYSIS, 2007, 2 (02): : 349 - 364
[24] Improving Estimations of Spatial Distribution of Soil Respiration Using the Bayesian Maximum Entropy Algorithm and Soil Temperature as Auxiliary Data
Hu, Junguo
Zhou, Jian
Zhou, Guomo
Luo, Yiqi
Xu, Xiaojun
Li, Pingheng
Liang, Junyi
PLOS ONE, 2016, 11 (01):
[25] Improving fault delineation using maximum entropy multispectral coherence
Lyu, Bin
Qi, Jie
Sinha, Saurabh
Li, Jianjun
Marfurt, Kurt J.
INTERPRETATION-A JOURNAL OF SUBSURFACE CHARACTERIZATION, 2020, 8 (04): : T835 - T850
[26] Improving predictability of time series using maximum entropy methods
Chliamovitch, G.
Dupuis, A.
Golub, A.
Chopard, B.
EPL, 2015, 110 (01)
[27] Improving Persian POS Tagging Using the Maximum Entropy Model
Kardan, Ahmad A.
Imani, Maryam Bahojb
2014 IRANIAN CONFERENCE ON INTELLIGENT SYSTEMS (ICIS), 2014,
[28] Analysis and Comparison of Kidney Stone Detection using Gaussian Maximum Likelihood Classifier and Bayesian Classifier with Improved Accuracy
Kishore, U.
Ramadevi, R.
CARDIOMETRY, 2022, (25): : 799 - 805
[29] Deflating Trees: Improving Bayesian Branch-Length Estimates using Informed Priors
Nelson, Bradley J.
Andersen, John J.
Brown, Jeremy M.
SYSTEMATIC BIOLOGY, 2015, 64 (03) : 441 - 447
[30] Empirical estimation of sequencing error rates using smoothing splines
Xuan Zhu
Jian Wang
Bo Peng
Sanjay Shete
BMC Bioinformatics, 17

← 1 2 3 4 5 →