Near-optimal Bayesian active learning with correlated and noisy tests

被引:0
|
作者
Chen, Yuxin [1 ]
Hassani, S. Hamed [2 ]
Krause, Andreas [3 ]
机构
[1] CALTECH, Pasadena, CA 91125 USA
[2] Univ Penn, Philadelphia, PA 19104 USA
[3] Swiss Fed Inst Technol, Zurich, Switzerland
来源
ELECTRONIC JOURNAL OF STATISTICS | 2017年 / 11卷 / 02期
基金
欧洲研究理事会;
关键词
Bayesian active learning; information gathering; decision making; noisy observation; approximation algorithms;
D O I
10.1214/17-EJS1336SI
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We consider the Bayesian active learning and experimental design problem, where the goal is to learn the value of some unknown target variable through a sequence of informative, noisy tests. In contrast to prior work, we focus on the challenging, yet practically relevant setting where test outcomes can be conditionally dependent given the hidden target variable. Under such assumptions, common heuristics, such as greedily performing tests that maximize the reduction in uncertainty of the target, often perform poorly. We propose ECED, a novel, efficient active learning algorithm, and prove strong theoretical guarantees that hold with correlated, noisy tests. Rather than directly optimizing the prediction error, at each step, ECED picks the test that maximizes the gain in a surrogate objective, which takes into account the dependencies between tests. Our analysis relies on an information-theoretic auxiliary function to track the progress of ECED, and utilizes adaptive submodularity to attain the approximation bound. We demonstrate strong empirical performance of ECED on three problem instances, including a Bayesian experimental design task intended to distinguish among economic theories of how people make risky decisions, an active preference learning task via pairwise comparisons, and a third application on pool-based active learning.
引用
收藏
页码:4969 / 5017
页数:49
相关论文
共 50 条
  • [21] Near-optimal reinforcement learning in polynomial time
    Kearns, M
    Singh, S
    MACHINE LEARNING, 2002, 49 (2-3) : 209 - 232
  • [22] A near-optimal algorithm for network-constrained averaging with noisy links
    Noorshams, Nima
    Wainwright, Martin J.
    2010 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY, 2010, : 1768 - 1772
  • [23] Near-optimal Regret Bounds for Reinforcement Learning
    Jaksch, Thomas
    Ortner, Ronald
    Auer, Peter
    JOURNAL OF MACHINE LEARNING RESEARCH, 2010, 11 : 1563 - 1600
  • [24] Optimal and near-optimal algorithms for multiple fault diagnosis with unreliable tests
    Shakeri, M
    Pattipati, KR
    Raghavan, V
    Patterson-Hine, A
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART C-APPLICATIONS AND REVIEWS, 1998, 28 (03): : 431 - 440
  • [25] Near-optimal regret bounds for reinforcement learning
    Jaksch, Thomas
    Ortner, Ronald
    Auer, Peter
    Journal of Machine Learning Research, 2010, 11 : 1563 - 1600
  • [26] Optimal and near-optimal algorithms for multiple fault diagnosis with unreliable tests
    Shakeri, M
    Raghavan, V
    Pattipati, KR
    PattersonHine, A
    AUTOTESTCON '96 - THE SYSTEM READINESS TECHNOLOGY CONFERENCE: TEST TECHNOLOGY AND COMMERCIALIZATION, CONFERENCE RECORD, 1996, : 473 - 482
  • [27] Advanced Policy Learning Near-Optimal Regulation
    Ding Wang
    Xiangnan Zhong
    IEEE/CAA Journal of Automatica Sinica, 2019, 6 (03) : 743 - 749
  • [28] Near-optimal Reinforcement Learning in Factored MDPs
    Osband, Ian
    Van Roy, Benjamin
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 27 (NIPS 2014), 2014, 27
  • [29] Advanced Policy Learning Near-Optimal Regulation
    Wang, Ding
    Zhong, Xiangnan
    IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2019, 6 (03) : 743 - 749
  • [30] Near-Optimal Bayesian Ambiguity Sets for Distributionally Robust Optimization
    Gupta, Vishal
    MANAGEMENT SCIENCE, 2019, 65 (09) : 4242 - 4260