Monte Carlo Hierarchical Model Learning

被引：0

作者：

Menashe, Jacob ^{[1
]}

Stone, Peter ^{[1
]}

机构：

[1] Univ Texas Austin, Austin, TX 78712 USA

来源：

PROCEEDINGS OF THE 2015 INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS & MULTIAGENT SYSTEMS (AAMAS'15) | 2015年

关键词：

Single and multi-agent learning techniques; Reinforcement Learning; Factored Domains; Model Learning; Hierarchical Skill Learning; Monte Carlo Methods;

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Reinforcement learning (RL) is a well-established paradigm for enabling autonomous agents to learn from experience. To enable RL to scale to any but the smallest domains, it is necessary to make use of abstraction and generalization of the state-action space, for example with a factored representation. However, to make effective use of such a representation, it is necessary to determine which state variables are relevant in which situations. In this work, we introduce T-UCT, a novel model-based RL approach for learning and exploiting the dynamics of structured hierarchical environments. When learning the dynamics while acting, a partial or inaccurate model may do more harm than good. T-UCT uses graph-based planning and Monte Carlo simulations to exploit models that may be incomplete or inaccurate, allowing it to both maximize cumulative rewards and ignore trajectories that are unlikely to succeed. T-UCT incorporates new experiences in the form of more accurate plans that span a greater area of the state space. T-UCT is fully implemented and compared empirically against B-VISA, the best known prior approach to the same problem. We show that T-UCT learns hierarchical models with fewer samples than B-VISA and that this effect is magnified at deeper levels of hierarchical complexity.

引用

页码：1985 / 1986

页数：2

共 50 条

[31] Prediction of electron beam parameters of a Monte Carlo model using machine learning
Wagner, A.
Boni, K. Brou
Rault, E.
Crop, F.
Lacornerie, T.
Van Gestel, D.
Reynaert, N.
RADIOTHERAPY AND ONCOLOGY, 2020, 152 : S716 - S717
[32] Assessing Protein Loop Flexibility by Hierarchical Monte Carlo Sampling
Nilmeier, Jerome
Hua, Lan
Coutsias, Evangelos A.
Jacobson, Matthew P.
JOURNAL OF CHEMICAL THEORY AND COMPUTATION, 2011, 7 (05) : 1564 - 1574
[33] Dynamically Rescaled Hamiltonian Monte Carlo for Bayesian Hierarchical Models
Kleppe, Tore Selland
JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2019, 28 (03) : 493 - 507
[34] Hamiltonian Monte Carlo and Borrowing Strength in Hierarchical Inverse Problems
Nagel, Joseph B.
Sudret, Bruno
ASCE-ASME JOURNAL OF RISK AND UNCERTAINTY IN ENGINEERING SYSTEMS PART A-CIVIL ENGINEERING, 2016, 2 (03):
[35] Hierarchical models, data augmentation, and Markov chain Monte Carlo
van Dyk, DA
STATISTICAL CHALLENGES IN ASTRONOMY, 2003, : 41 - 56
[36] Constrained Hierarchical Monte Carlo Belief-State Planning
Jamgochian, Arec (arec@stanford.edu), 1600, Institute of Electrical and Electronics Engineers Inc.
[37] Hierarchical Monte Carlo Tree Search for Latent Skill Planning
Pei, Yue
2023 2ND ASIA CONFERENCE ON ALGORITHMS, COMPUTING AND MACHINE LEARNING, CACML 2023, 2023, : 6 - 12
[38] A Hierarchical Automatic Stopping Condition for Monte Carlo Global Illumination
Dammertz, Holger
Hanika, Johannes
Keller, Alexander
Lensch, Hendrik P. A.
WSCG 2010: FULL PAPERS PROCEEDINGS, 2010, : 159 - +
[39] A comparison between the Monte Carlo shell model method and the Monte Carlo spectroscopic method
Puddu, G
JOURNAL OF PHYSICS G-NUCLEAR AND PARTICLE PHYSICS, 2003, 29 (09) : 2179 - 2185
[40] A Monte Carlo approach for the bouncer model
Diaz, Gabriel
Yoshida, Makoto
Leonel, Edson D.
PHYSICS LETTERS A, 2017, 381 (42) : 3636 - 3640

← 1 2 3 4 5 →