Approximate Dynamic Programming Using Model-Free Bellman Residual Elimination

被引:0
|
作者
Bethke, Brett
How, Jonathan P.
机构
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents an modification to the method of Bellman Residual Elimination (BRE) [1], [2] for approximate dynamic programming. While prior work on BRE has focused on learning an approximate policy for an underlying Markov Decision Process (MDP) when the state transition model of the MDP is known, this work proposes a model-free variant of BRE that does not require knowledge of the state transition model. Instead, state trajectories of the system, generated using simulation and/or observations of the real system in operation, are used to build stochastic approximations of the quantities needed to carry out the BRE algorithm. The resulting algorithm can be shown to converge to the policy produced by the nominal, model-based BRE algorithm in the limit of observing an infinite number of trajectories. To validate the performance of the approach, we compare model-based and model-free BRE against LSPI [3], a well-known approximate dynamic programming algorithm. Measuring performance in terms of both computational complexity and policy quality, we present results showing that BRE performs at least as well as, and sometimes significantly better than, LSPI on a standard benchmark problem.
引用
收藏
页码:4146 / 4151
页数:6
相关论文
共 50 条
  • [41] An Improved Reinforcement Learning Based Heuristic Dynamic Programming Algorithm for Model-Free Optimal Control
    Li, Jia
    Yuan, Zhaolin
    Ban, Xiaojuan
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2020, PT II, 2020, 12397 : 282 - 294
  • [42] Intelligent Questionnaires Using Approximate Dynamic Programming
    Logé F.
    Le Pennec E.
    Amadou-Boubacar H.
    i-com, 2021, 19 (03) : 227 - 237
  • [43] Empirical model based control of nonlinear processes using approximate dynamic programming
    Lee, JM
    Lee, JH
    PROCEEDINGS OF THE 2004 AMERICAN CONTROL CONFERENCE, VOLS 1-6, 2004, : 3041 - 3046
  • [44] Dynamic Site Layout Planning Using Approximate Dynamic Programming
    El-Rayes, Khaled
    Said, Hisham
    JOURNAL OF COMPUTING IN CIVIL ENGINEERING, 2009, 23 (02) : 119 - 127
  • [45] Model-free frequency regulation in islanded microgrids: An event-triggered adaptive dynamic programming approach
    Shi, Jing
    Peng, Chen
    Zhang, Jin
    Xie, Xiangpeng
    International Journal of Electrical Power and Energy Systems, 2024, 155
  • [46] Model-free frequency regulation in islanded microgrids: An event-triggered adaptive dynamic programming approach
    Shi, Jing
    Peng, Chen
    Zhang, Jin
    Xie, Xiangpeng
    INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS, 2024, 155
  • [47] Stackelberg games for model-free continuous-time stochastic systems based on adaptive dynamic programming
    Liu, Xikui
    Ge, Yingying
    Li, Yan
    APPLIED MATHEMATICS AND COMPUTATION, 2019, 363
  • [48] Adaptive Dynamic Programming for Model-Free Global Stabilization of Control Constrained Continuous-Time Systems
    Rizvi, Syed Ali Asad
    Lin, Zongli
    IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (02) : 1048 - 1060
  • [49] AUTOMATED STRUCTURAL DYNAMIC MODELLING USING MODEL-FREE HEALTH MONITORING RESULTS
    Tondut, Jeanne
    Chase, J. Geoffrey
    Zhou, Cong
    BULLETIN OF THE NEW ZEALAND SOCIETY FOR EARTHQUAKE ENGINEERING, 2020, 53 (04): : 189 - 202
  • [50] Self-Triggered Adaptive Dynamic Programming for Model-Free Nonlinear Systems via Generalized Fuzzy Hyperbolic Model
    Ming, Zhongyang
    Zhang, Huaguang
    Yan, Yuqing
    Sun, Jiayue
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 53 (05): : 2792 - 2801