Estimating Accuracy from Unlabeled Data: A Bayesian Approach

被引:0
|
作者
Platanios, Emmanouil Antonios [1 ]
Dubey, Avinava [1 ]
Mitchell, Tom [1 ]
机构
[1] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
关键词
DISTRIBUTIONS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We consider the question of how unlabeled data can be used to estimate the true accuracy of learned classifiers, and the related question of how outputs from several classifiers performing the same task can be combined based on their estimated accuracies. To answer these questions, we first present a simple graphical model that performs well in practice. We then provide two nonparametric extensions to it that improve its performance. Experiments on two real-world data sets produce accuracy estimates within a few percent of the true accuracy, using solely unlabeled data. Our models also outperform existing state-of-the-art solutions in both estimating accuracies, and combining multiple classifier outputs.
引用
收藏
页数:10
相关论文
共 50 条
  • [21] Bayesian Logistic Model for Positive and Unlabeled Data
    Lazecka, Malgorzata
    MODELING DECISIONS FOR ARTIFICIAL INTELLIGENCE, MDAI 2023, 2023, 13890 : 157 - 168
  • [22] Estimating and Exploiting Language Distributions of Unlabeled Data
    McCree, Alan
    ODYSSEY 2010: THE SPEAKER AND LANGUAGE RECOGNITION WORKSHOP, 2010, : 209 - 214
  • [23] Estimating Classification Accuracy for Unlabeled Datasets Based on Block Scaling
    You, Shingchern D.
    Lin, Kai-Rong
    Liu, Chien-Hung
    INTERNATIONAL JOURNAL OF ENGINEERING AND TECHNOLOGY INNOVATION, 2023, 13 (04) : 313 - 327
  • [24] ESTIMATING THE RATE CONSTANT FROM BIOSENSOR DATA VIA AN ADAPTIVE VARIATIONAL BAYESIAN APPROACH
    Zhang, Ye
    Yao, Zhigang
    Forssen, Patrik
    Fornstedt, Torgny
    ANNALS OF APPLIED STATISTICS, 2019, 13 (04): : 2011 - 2042
  • [25] A flexible Bayesian approach for estimating survival probabilities from age-at-harvest data
    Skelly, Brett P. P.
    Clipp, Hannah L. L.
    Landry, Stephanie M. M.
    Rogers, Rich
    Phelps, Quinton
    Anderson, James T. T.
    Rota, Christopher T. T.
    METHODS IN ECOLOGY AND EVOLUTION, 2023, 14 (04): : 1061 - 1073
  • [26] THE ACCURACY OF ESTIMATING Q FROM SEISMIC DATA
    WHITE, RE
    GEOPHYSICS, 1992, 57 (11) : 1508 - 1511
  • [27] Accuracy of Estimating the Mean from Rounded Data
    Ushakov N.G.
    Ushakov V.G.
    Journal of Mathematical Sciences, 2020, 246 (4) : 565 - 568
  • [28] Exploiting unlabeled data for improving accuracy of predictive data mining
    Peng, K
    Vucetic, S
    Han, B
    Xie, HB
    Obradovic, Z
    THIRD IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2003, : 267 - 274
  • [29] A multilevel approach for learning from labeled and unlabeled data on graphs
    Zhang, Changshui
    Wang, Fei
    PATTERN RECOGNITION, 2010, 43 (06) : 2301 - 2314
  • [30] Efficient heuristics for learning scalable Bayesian network classifier from labeled and unlabeled data
    Wang, Limin
    Wang, Junjie
    Guo, Lu
    Li, Qilong
    APPLIED INTELLIGENCE, 2024, 54 (02) : 1957 - 1979