Improving Bayesian credibility intervals for classifier error rates using maximum entropy empirical priors

被引:4
|
作者
Gustafsson, Mats G. [1 ]
Waman, Mikael [1 ,2 ,3 ]
Bolin, Ulrika Wickenberg [1 ]
Goransson, Hanna [1 ]
Fryknas, M. [1 ]
Andersson, Claes R. [1 ]
Isaksson, Anders [1 ]
机构
[1] Uppsala Univ, Dept Med Sci, Acad Hosp, S-75185 Uppsala, Sweden
[2] Fraunhofer Chalmers Res Ctr Ind Math, SE-41288 Gothenburg, Sweden
[3] Univ Oxford, Comp Lab, Computat Biol Grp, Oxford OX1 3QD, England
基金
瑞典研究理事会;
关键词
Classifier design; Performance evaluation; Small sample learning; Decision support system; Diagnosis; Prognosis; LOGISTIC-REGRESSION; INFORMATION-THEORY; DECISION-SUPPORT; MICROARRAY DATA; PREDICTION; SELECTION; MODEL; VALIDATION; DIAGNOSIS; VARIANCE;
D O I
10.1016/j.artmed.2010.02.004
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Objective: Successful use of classifiers that learn to make decisions from a set of patient examples require robust methods for performance estimation. Recently many promising approaches for determination of an upper bound for the error rate of a single classifier have been reported but the Bayesian credibility interval (Cl) obtained from a conventional holdout test still delivers one of the tightest bounds. The conventional Bayesian CI becomes unacceptably large in real world applications where the test set sizes are less than a few hundred. The source of this problem is that fact that the Cl is determined exclusively by the result on the test examples. In other words, there is no information at all provided by the uniform prior density distribution employed which reflects complete lack of prior knowledge about the unknown error rate. Therefore, the aim of the study reported here was to study a maximum entropy (ME) based approach to improved prior knowledge and Bayesian CIs, demonstrating its relevance for biomedical research and clinical practice. Method and material: It is demonstrated how a refined non-uniform prior density distribution can be obtained by means of the ME principle using empirical results from a few designs and tests using non-overlapping sets of examples. Results: Experimental results show that ME based priors improve the CIs when employed to four quite different simulated and two real world data sets. Conclusions: An empirically derived ME prior seems promising for improving the Bayesian Cl for the unknown error rate of a designed classifier. (C) 2010 Elsevier B.V. All rights reserved.
引用
收藏
页码:93 / 104
页数:12
相关论文
共 50 条
  • [1] Bayesian Inference in Auditing with Partial Prior Information Using Maximum Entropy Priors
    Martel-Escobar, Maria
    Vazquez-Polo, Francisco-Jose
    Hernandez-Bastida, Agustin
    ENTROPY, 2018, 20 (12):
  • [2] Estimating entropy rates with Bayesian confidence intervals
    Kennel, MB
    Shlens, J
    Abarbanel, HDI
    Chichilnisky, EJ
    NEURAL COMPUTATION, 2005, 17 (07) : 1531 - 1576
  • [3] Improving the Naive Bayes Classifier via a Quick Variable Selection Method Using Maximum of Entropy
    Abellan, Joaquin
    Castellano, Javier G.
    ENTROPY, 2017, 19 (06)
  • [4] Bayesian and Maximum Entropy Analyses of Flow Networks with Non-Gaussian Priors and Soft Constraints
    Waldrip, Steven H.
    Niven, Robert K.
    BAYESIAN INFERENCE AND MAXIMUM ENTROPY METHODS IN SCIENCE AND ENGINEERING, MAXENT 37, 2018, 239 : 285 - 294
  • [5] Exploring the use of transformation group priors and the method of maximum relative entropy for Bayesian glaciological inversions
    Arthern, Robert J.
    JOURNAL OF GLACIOLOGY, 2015, 61 (229) : 947 - 962
  • [6] Bayesian analysis for the Shannon entropy of the Lomax distribution using noninformative priors
    Dong, Guoqing
    Shakhatreh, Mohammed K.
    He, Daojiang
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2024, 94 (06) : 1317 - 1338
  • [7] FigSearch: Using maximum entropy classifier to categorize biological figures
    Liu, F
    Jenssen, TK
    Nygaard, V
    Sack, J
    Hovig, E
    2004 IEEE COMPUTATIONAL SYSTEMS BIOINFORMATICS CONFERENCE, PROCEEDINGS, 2004, : 476 - 477
  • [8] Beyond the Classical Type I Error: Bayesian Metrics for Bayesian Designs Using Informative Priors
    Best, Nicky
    Ajimi, Maxine
    Neuenschwander, Beat
    Saint-Hilary, Gaelle
    Wandel, Simon
    STATISTICS IN BIOPHARMACEUTICAL RESEARCH, 2024,
  • [9] Selecting Bayesian priors for stochastic rates using extended functional models
    Gibson, GJ
    INVERSE PROBLEMS, 2003, 19 (02) : 265 - 278
  • [10] Improving geostatistical predictions of two environmental variables using Bayesian maximum entropy in the Sungun mining site
    Safoura Rezaei
    Enayatollah Ranjineh Khojasteh
    Morovvat Faridazad
    Stochastic Environmental Research and Risk Assessment, 2020, 34 : 1775 - 1794