Approximate Dynamic Programming Using Model-Free Bellman Residual Elimination

被引:0
|
作者
Bethke, Brett
How, Jonathan P.
机构
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents an modification to the method of Bellman Residual Elimination (BRE) [1], [2] for approximate dynamic programming. While prior work on BRE has focused on learning an approximate policy for an underlying Markov Decision Process (MDP) when the state transition model of the MDP is known, this work proposes a model-free variant of BRE that does not require knowledge of the state transition model. Instead, state trajectories of the system, generated using simulation and/or observations of the real system in operation, are used to build stochastic approximations of the quantities needed to carry out the BRE algorithm. The resulting algorithm can be shown to converge to the policy produced by the nominal, model-based BRE algorithm in the limit of observing an infinite number of trajectories. To validate the performance of the approach, we compare model-based and model-free BRE against LSPI [3], a well-known approximate dynamic programming algorithm. Measuring performance in terms of both computational complexity and policy quality, we present results showing that BRE performs at least as well as, and sometimes significantly better than, LSPI on a standard benchmark problem.
引用
收藏
页码:4146 / 4151
页数:6
相关论文
共 50 条
  • [1] Approximate Dynamic Programming Using Bellman Residual Elimination and Gaussian Process Regression
    Bethke, Brett
    How, Jonathan P.
    2009 AMERICAN CONTROL CONFERENCE, VOLS 1-9, 2009, : 745 - +
  • [2] Model-free approximate dynamic programming schemes for linear systems
    Al-Tamimi, Asma
    Vrabie, Draguna
    Abu-Khalaf, Murad
    Lewis, Frank L.
    2007 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-6, 2007, : 371 - +
  • [3] An Approximate Dynamic Programming Approach for Model-free Control of Switched Systems
    Lu, Wenjie
    Ferrari, Silvia
    2013 IEEE 52ND ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2013, : 3837 - 3844
  • [4] Online and Model-Free Supplementary Learning Control Based on Approximate Dynamic Programming
    Guo, Wentao
    Liu, Feng
    Si, Jennie
    Mei, Shengwei
    26TH CHINESE CONTROL AND DECISION CONFERENCE (2014 CCDC), 2014, : 1316 - 1321
  • [5] Model-Free Approximate Dynamic Programming for Continuous-Time Linear Systems
    Lee, Jae Young
    Park, Jin Bae
    Choi, Yoon Ho
    PROCEEDINGS OF THE 48TH IEEE CONFERENCE ON DECISION AND CONTROL, 2009 HELD JOINTLY WITH THE 2009 28TH CHINESE CONTROL CONFERENCE (CDC/CCC 2009), 2009, : 5009 - 5014
  • [6] Model-Free Control Design using Incremental Approximate Dynamic Programming and Generalized Extended State Observer
    Kim, Juyoung
    Lee, Hanna
    Lee, Youngjun
    Park, Jongho
    Kim, Youdan
    2024 INTERNATIONAL CONFERENCE ON UNMANNED AIRCRAFT SYSTEMS, ICUAS, 2024, : 505 - 511
  • [7] Model-Free Cooperative Control for Multi-Agent Systems Using the Approximate Dynamic Programming Approach
    Qu, Yanhua
    Wang, Anna
    Liu, Jinglu
    IEEE ACCESS, 2018, 6 : 37195 - 37203
  • [8] Model-free incremental adaptive dynamic programming based approximate robust optimal regulation
    Li, Cong
    Wang, Yongchao
    Liu, Fangzhou
    Liu, Qingchen
    Buss, Martin
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2022, 32 (05) : 2662 - 2682
  • [9] Model-Free Dual Heuristic Dynamic Programming
    Ni, Zhen
    He, Haibo
    Zhong, Xiangnan
    Prokhorov, Danil V.
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 26 (08) : 1834 - 1839
  • [10] Model-free model elimination: A new step in the model-free dynamic analysis of NMR relaxation data
    d'Auvergne, Edward J.
    Gooley, Paul R.
    JOURNAL OF BIOMOLECULAR NMR, 2006, 35 (02) : 117 - 135