Improving Bayesian credibility intervals for classifier error rates using maximum entropy empirical priors

被引:4
|
作者
Gustafsson, Mats G. [1 ]
Waman, Mikael [1 ,2 ,3 ]
Bolin, Ulrika Wickenberg [1 ]
Goransson, Hanna [1 ]
Fryknas, M. [1 ]
Andersson, Claes R. [1 ]
Isaksson, Anders [1 ]
机构
[1] Uppsala Univ, Dept Med Sci, Acad Hosp, S-75185 Uppsala, Sweden
[2] Fraunhofer Chalmers Res Ctr Ind Math, SE-41288 Gothenburg, Sweden
[3] Univ Oxford, Comp Lab, Computat Biol Grp, Oxford OX1 3QD, England
基金
瑞典研究理事会;
关键词
Classifier design; Performance evaluation; Small sample learning; Decision support system; Diagnosis; Prognosis; LOGISTIC-REGRESSION; INFORMATION-THEORY; DECISION-SUPPORT; MICROARRAY DATA; PREDICTION; SELECTION; MODEL; VALIDATION; DIAGNOSIS; VARIANCE;
D O I
10.1016/j.artmed.2010.02.004
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Objective: Successful use of classifiers that learn to make decisions from a set of patient examples require robust methods for performance estimation. Recently many promising approaches for determination of an upper bound for the error rate of a single classifier have been reported but the Bayesian credibility interval (Cl) obtained from a conventional holdout test still delivers one of the tightest bounds. The conventional Bayesian CI becomes unacceptably large in real world applications where the test set sizes are less than a few hundred. The source of this problem is that fact that the Cl is determined exclusively by the result on the test examples. In other words, there is no information at all provided by the uniform prior density distribution employed which reflects complete lack of prior knowledge about the unknown error rate. Therefore, the aim of the study reported here was to study a maximum entropy (ME) based approach to improved prior knowledge and Bayesian CIs, demonstrating its relevance for biomedical research and clinical practice. Method and material: It is demonstrated how a refined non-uniform prior density distribution can be obtained by means of the ME principle using empirical results from a few designs and tests using non-overlapping sets of examples. Results: Experimental results show that ME based priors improve the CIs when employed to four quite different simulated and two real world data sets. Conclusions: An empirically derived ME prior seems promising for improving the Bayesian Cl for the unknown error rate of a designed classifier. (C) 2010 Elsevier B.V. All rights reserved.
引用
收藏
页码:93 / 104
页数:12
相关论文
共 50 条
  • [41] Improving Semantic Information Retrieval Using Multinomial Naive Bayes Classifier and Bayesian Networks
    Chebil, Wiem
    Wedyan, Mohammad
    Alazab, Moutaz
    Alturki, Ryan
    Elshaweesh, Omar
    INFORMATION, 2023, 14 (05)
  • [42] Space-time mapping of soil salinity using probabilistic bayesian maximum entropy
    A. Douaik
    M. van Meirvenne
    T. Tóth
    M. Serre
    Stochastic Environmental Research and Risk Assessment, 2004, 18 : 219 - 227
  • [43] Space-time mapping of soil salinity using probabilistic bayesian maximum entropy
    Douaik, A
    van Meirvenne, M
    Tóth, T
    Serre, M
    STOCHASTIC ENVIRONMENTAL RESEARCH AND RISK ASSESSMENT, 2004, 18 (04) : 219 - 227
  • [44] Analysis of Indoor Radon Data Using Bayesian, Random Binning, and Maximum Entropy Methods
    Pylak, Maciej
    Fornalski, Krzysztof Wojciech
    Reszczynska, Joanna
    Kukulski, Piotr
    Waligorski, Michael P. R.
    Dobrzynski, Ludwik
    DOSE-RESPONSE, 2021, 19 (02):
  • [45] Modeling a syphilis outbreak through space and time using the Bayesian maximum entropy approach
    Law, Dionne C. Gesink
    Bernstein, Kyle T.
    Serre, Marc L.
    Schumacher, Christina M.
    Leone, Peter A.
    Zenilman, Jonathan M.
    Miller, William C.
    Rompalo, Anne M.
    ANNALS OF EPIDEMIOLOGY, 2006, 16 (11) : 797 - 804
  • [46] Geostatistical space-time mapping of house prices using Bayesian maximum entropy
    Hayunga, Darren K.
    Kolovos, Alexander
    INTERNATIONAL JOURNAL OF GEOGRAPHICAL INFORMATION SCIENCE, 2016, 30 (12) : 2339 - 2354
  • [47] THE DETERMINATION OF NUCLEAR-CHARGE DISTRIBUTIONS USING A BAYESIAN MAXIMUM-ENTROPY METHOD
    MACAULAY, VA
    BUCK, B
    NUCLEAR PHYSICS A, 1995, 591 (01) : 85 - 103
  • [48] Bayesian Reliability Estimation for Deteriorating Systems with Limited Samples Using the Maximum Entropy Approach
    Xiao, Ning-Cong
    Li, Yan-Feng
    Wang, Zhonglai
    Peng, Weiwen
    Huang, Hong-Zhong
    ENTROPY, 2013, 15 (12) : 5492 - 5509
  • [49] Ridge estimation in linear mixed measurement error models using generalized maximum entropy
    Janamiri, Fariba
    Rasekh, Abdolrahman
    Chaji, Alireza
    Babadi, Babak
    STATISTICS, 2022, 56 (05) : 1095 - 1112
  • [50] Improving Biochemical Named Entity Recognition Using PSO Classifier Selection and Bayesian Combination Methods
    Akkasi, Abbas
    Varoglu, Ekrem
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2017, 14 (06) : 1327 - 1338