Monte Carlo Hierarchical Model Learning

被引:0
|
作者
Menashe, Jacob [1 ]
Stone, Peter [1 ]
机构
[1] Univ Texas Austin, Austin, TX 78712 USA
关键词
Single and multi-agent learning techniques; Reinforcement Learning; Factored Domains; Model Learning; Hierarchical Skill Learning; Monte Carlo Methods;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Reinforcement learning (RL) is a well-established paradigm for enabling autonomous agents to learn from experience. To enable RL to scale to any but the smallest domains, it is necessary to make use of abstraction and generalization of the state-action space, for example with a factored representation. However, to make effective use of such a representation, it is necessary to determine which state variables are relevant in which situations. In this work, we introduce T-UCT, a novel model-based RL approach for learning and exploiting the dynamics of structured hierarchical environments. When learning the dynamics while acting, a partial or inaccurate model may do more harm than good. T-UCT uses graph-based planning and Monte Carlo simulations to exploit models that may be incomplete or inaccurate, allowing it to both maximize cumulative rewards and ignore trajectories that are unlikely to succeed. T-UCT incorporates new experiences in the form of more accurate plans that span a greater area of the state space. T-UCT is fully implemented and compared empirically against B-VISA, the best known prior approach to the same problem. We show that T-UCT learns hierarchical models with fewer samples than B-VISA and that this effect is magnified at deeper levels of hierarchical complexity.
引用
收藏
页码:1985 / 1986
页数:2
相关论文
共 50 条
  • [31] Prediction of electron beam parameters of a Monte Carlo model using machine learning
    Wagner, A.
    Boni, K. Brou
    Rault, E.
    Crop, F.
    Lacornerie, T.
    Van Gestel, D.
    Reynaert, N.
    RADIOTHERAPY AND ONCOLOGY, 2020, 152 : S716 - S717
  • [32] Assessing Protein Loop Flexibility by Hierarchical Monte Carlo Sampling
    Nilmeier, Jerome
    Hua, Lan
    Coutsias, Evangelos A.
    Jacobson, Matthew P.
    JOURNAL OF CHEMICAL THEORY AND COMPUTATION, 2011, 7 (05) : 1564 - 1574
  • [34] Hamiltonian Monte Carlo and Borrowing Strength in Hierarchical Inverse Problems
    Nagel, Joseph B.
    Sudret, Bruno
    ASCE-ASME JOURNAL OF RISK AND UNCERTAINTY IN ENGINEERING SYSTEMS PART A-CIVIL ENGINEERING, 2016, 2 (03):
  • [35] Hierarchical models, data augmentation, and Markov chain Monte Carlo
    van Dyk, DA
    STATISTICAL CHALLENGES IN ASTRONOMY, 2003, : 41 - 56
  • [36] Constrained Hierarchical Monte Carlo Belief-State Planning
    Jamgochian, Arec (arec@stanford.edu), 1600, Institute of Electrical and Electronics Engineers Inc.
  • [37] Hierarchical Monte Carlo Tree Search for Latent Skill Planning
    Pei, Yue
    2023 2ND ASIA CONFERENCE ON ALGORITHMS, COMPUTING AND MACHINE LEARNING, CACML 2023, 2023, : 6 - 12
  • [38] A Hierarchical Automatic Stopping Condition for Monte Carlo Global Illumination
    Dammertz, Holger
    Hanika, Johannes
    Keller, Alexander
    Lensch, Hendrik P. A.
    WSCG 2010: FULL PAPERS PROCEEDINGS, 2010, : 159 - +
  • [40] A Monte Carlo approach for the bouncer model
    Diaz, Gabriel
    Yoshida, Makoto
    Leonel, Edson D.
    PHYSICS LETTERS A, 2017, 381 (42) : 3636 - 3640