Optimal learning with a local parametric belief model

被引:0
|
作者
Bolong Cheng
Arta Jamshidi
Warren B. Powell
机构
[1] Princeton University,Department of Electrical Engineering
[2] Princeton University,Department of Operations Research and Financial Engineering
来源
关键词
Ranking and selection; Optimal learning; Local parametric model; Stochastic search;
D O I
暂无
中图分类号
学科分类号
摘要
We are interested in maximizing smooth functions where observations are noisy and expensive to compute, as might arise in computer simulations or laboratory experimentations. We derive a knowledge gradient policy, which chooses measurements which maximize the expected value of information, while using a locally parametric belief model that uses linear approximations with radial basis functions. The method uses a compact representation of the function which avoids storing the entire history, as is typically required by nonparametric methods. Our technique uses the expected value of a measurement in terms of its ability to improve our estimate of the optimum, capturing correlations in our beliefs about neighboring regions of the function, without posing any assumptions on the global shape of the underlying function a priori. Experimental work suggests that the method adapts to a range of arbitrary, continuous functions, and appears to reliably find the optimal solution. Moreover, the policy is shown to be asymptotically optimal.
引用
收藏
页码:401 / 425
页数:24
相关论文
共 50 条
  • [21] Optimal N-Parametric Type Iterative Learning Control
    Ali Madady
    Hamid-Reza Reza-Alikhani
    Sara Zamiri
    International Journal of Control, Automation and Systems, 2018, 16 : 2187 - 2202
  • [22] Optimal N-Parametric Type Iterative Learning Control
    Madady, Ali
    Reza-Alikhani, Hamid-Reza
    Zamiri, Sara
    INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2018, 16 (05) : 2187 - 2202
  • [23] A Bayesian belief network model of a virtual learning community
    Daniel, Ben K.
    Schwier, Richard A.
    International Journal of Web Based Communities, 2007, 3 (02) : 151 - 169
  • [24] A belief-focused process model of organizational learning
    Williams, APO
    JOURNAL OF MANAGEMENT STUDIES, 2001, 38 (01) : 67 - 85
  • [25] SOCIAL-LEARNING THEORY AND THE HEALTH BELIEF MODEL
    ROSENSTOCK, IM
    STRECHER, VJ
    BECKER, MH
    HEALTH EDUCATION QUARTERLY, 1988, 15 (02): : 175 - 183
  • [26] OPTIMAL LOCAL PROPENSITIES FOR MODEL PROTEINS
    GOVINDARAJAN, S
    GOLDSTEIN, RA
    PROTEINS-STRUCTURE FUNCTION AND GENETICS, 1995, 22 (04): : 413 - 418
  • [27] Local bandit approximation for optimal learning problems
    Duff, MO
    Barto, AG
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 9: PROCEEDINGS OF THE 1996 CONFERENCE, 1997, 9 : 1019 - 1025
  • [28] Optimal Belief Approximation
    Leike, Reimar H.
    Ensslin, Torsten A.
    ENTROPY, 2017, 19 (08):
  • [29] A parametric model for local dependence among test items
    Hoskens, M
    DeBoeck, P
    PSYCHOLOGICAL METHODS, 1997, 2 (03) : 261 - 277
  • [30] Reinforcement Learning with Partial Parametric Model Knowledge
    Wang, Shuyuan
    Loewen, Philip D.
    Lawrence, Nathan P.
    Forbes, Michael G.
    Gopaluni, R. Bhushan
    IFAC PAPERSONLINE, 2023, 56 (02): : 8012 - 8017