Estimating Accuracy from Unlabeled Data: A Bayesian Approach

被引:0
|
作者
Platanios, Emmanouil Antonios [1 ]
Dubey, Avinava [1 ]
Mitchell, Tom [1 ]
机构
[1] Carnegie Mellon Univ, Pittsburgh, PA 15213 USA
关键词
DISTRIBUTIONS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We consider the question of how unlabeled data can be used to estimate the true accuracy of learned classifiers, and the related question of how outputs from several classifiers performing the same task can be combined based on their estimated accuracies. To answer these questions, we first present a simple graphical model that performs well in practice. We then provide two nonparametric extensions to it that improve its performance. Experiments on two real-world data sets produce accuracy estimates within a few percent of the true accuracy, using solely unlabeled data. Our models also outperform existing state-of-the-art solutions in both estimating accuracies, and combining multiple classifier outputs.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Estimating Accuracy from Unlabeled Data: A Probabilistic Logic Approach
    Platanios, Emmanouil A.
    Poon, Hoifung
    Mitchell, Tom M.
    Horvitz, Eric
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
  • [2] Estimating Accuracy from Unlabeled Data
    Platanios, Emmanouil Antonios
    Blum, Avrim
    Mitchell, Tom
    UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2014, : 682 - 691
  • [3] A Bayesian Approach for Estimating Causal Effects from Observational Data
    Pensar, Johan
    Talvitie, Topi
    Hyttinen, Antti
    Koivisto, Mikko
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 5395 - 5402
  • [4] Estimating genealogies from unlinked marker data:: A Bayesian approach
    Gasbarra, Dario
    Pirinen, Matti
    Sillanpaa, Mikko J.
    Salmela, Elina
    Arjas, Elja
    THEORETICAL POPULATION BIOLOGY, 2007, 72 (03) : 305 - 322
  • [5] Estimating genealogies from linked marker data:: a Bayesian approach
    Gasbarra, Dario
    Pirinen, Matti
    Sillanpaa, Mikko J.
    Arjas, Elja
    BMC BIOINFORMATICS, 2007, 8 (1)
  • [6] A Bayesian approach to estimating tectonic stress from seismological data
    Arnold, Richard
    Townend, John
    GEOPHYSICAL JOURNAL INTERNATIONAL, 2007, 170 (03) : 1336 - 1356
  • [7] A Bayesian approach for estimating bioterror attacks from patient data
    Ray, J.
    Marzouk, Y. M.
    Najm, H. N.
    STATISTICS IN MEDICINE, 2011, 30 (02) : 101 - 126
  • [8] Estimating genealogies from linked marker data: a Bayesian approach
    Dario Gasbarra
    Matti Pirinen
    Mikko J Sillanpää
    Elja Arjas
    BMC Bioinformatics, 8
  • [9] Detecting Errors and Estimating Accuracy on Unlabeled Data with Self-training Ensembles
    Chen, Jiefeng
    Liu, Frederick
    Avci, Besim
    Wu, Xi
    Liang, Yingyu
    Jha, Somesh
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [10] A Bayesian Semisupervised Approach to Keyword Extraction with Only Positive and Unlabeled Data
    Wang, Guanshen
    Cheng, Yichen
    Xia, Yusen
    Ling, Qiang
    Wang, Xinlei
    INFORMS JOURNAL ON COMPUTING, 2023, 35 (03) : 675 - 691