Variational Inference MPC for Bayesian Model-based Reinforcement Learning

被引:0
|
作者
Okada, Masashi [1 ]
Taniguchi, Tadahiro [1 ,2 ]
机构
[1] Panason Corp, Kadoma, Osaka, Japan
[2] Ritsumeikan Univ, Kyoto, Japan
来源
关键词
model predictive control; variational inference; model-based reinforcement learning; PREDICTIVE CONTROL; OPTIMIZATION;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
In recent studies on model-based reinforcement learning (MBRL), incorporating uncertainty in forward dynamics is a state-of-the-art strategy to enhance learning performance, making MBRLs competitive to cutting-edge model-free methods, especially in simulated robotics tasks. Probabilistic ensembles with trajectory sampling (PETS) is a leading type of MBRL, which employs Bayesian inference to dynamics modeling and model predictive control (MPC) with stochastic optimization via the cross entropy method (CEM). In this paper, we propose a novel extension to the uncertainty-aware MBRL. Our main contributions are twofold: Firstly, we introduce a variational inference MPC (VI-MPC), which reformulates various stochastic methods, including CEM, in a Bayesian fashion. Secondly, we propose a novel instance of the framework, called probabilistic action ensembles with trajectory sampling (PaETS). As a result, our Bayesian MBRL can involve multimodal uncertainties both in dynamics and optimal trajectories. In comparison to PETS, our method consistently improves asymptotic performance on several challenging locomotion tasks.
引用
收藏
页数:15
相关论文
共 50 条
  • [21] Nonparametric model-based reinforcement learning
    Atkeson, CG
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 10, 1998, 10 : 1008 - 1014
  • [22] Model-based Clustering with Noise: Bayesian Inference and Estimation
    H. Bensmail
    J. J. Meulman
    Journal of Classification, 2003, 20 : 049 - 076
  • [23] The ubiquity of model-based reinforcement learning
    Doll, Bradley B.
    Simon, Dylan A.
    Daw, Nathaniel D.
    CURRENT OPINION IN NEUROBIOLOGY, 2012, 22 (06) : 1075 - 1081
  • [24] Reduced-order model-based variational inference with normalizing flows for Bayesian elliptic inverse problems
    Wu, Zhizhang
    Zhang, Cheng
    Zhang, Zhiwen
    JOURNAL OF COMPUTATIONAL AND APPLIED MATHEMATICS, 2024, 441
  • [25] Model-based clustering with noise: Bayesian inference and estimation
    Bensmail, H
    Meulman, JJ
    JOURNAL OF CLASSIFICATION, 2003, 20 (01) : 49 - 76
  • [26] Multiple model-based reinforcement learning
    Doya, K
    Samejima, K
    Katagiri, K
    Kawato, M
    NEURAL COMPUTATION, 2002, 14 (06) : 1347 - 1369
  • [27] Bayesian model-based inference of transcription factor activity
    Simon Rogers
    Raya Khanin
    Mark Girolami
    BMC Bioinformatics, 8
  • [28] Model-based Bayesian Inference for ROC Data Analysis
    Lei, Tianhu
    Bae, K. Ty
    MEDICAL IMAGING 2013: IMAGE PERCEPTION, OBSERVER PERFORMANCE, AND TECHNOLOGY ASSESSMENT, 2013, 8673
  • [29] A survey on model-based reinforcement learning
    Luo, Fan-Ming
    Xu, Tian
    Lai, Hang
    Chen, Xiong-Hui
    Zhang, Weinan
    Yu, Yang
    SCIENCE CHINA-INFORMATION SCIENCES, 2024, 67 (02)
  • [30] Bayesian model-based inference of transcription factor activity
    Rogers, Simon
    Khanin, Raya
    Girolami, Mark
    BMC BIOINFORMATICS, 2007, 8 (Suppl 2)