Approximate Dynamic Programming Using Model-Free Bellman Residual Elimination

被引：0

作者：

Bethke, Brett

How, Jonathan P.

机构：

来源：

2010 AMERICAN CONTROL CONFERENCE | 2010年

关键词：

D O I：

暂无

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper presents an modification to the method of Bellman Residual Elimination (BRE) [1], [2] for approximate dynamic programming. While prior work on BRE has focused on learning an approximate policy for an underlying Markov Decision Process (MDP) when the state transition model of the MDP is known, this work proposes a model-free variant of BRE that does not require knowledge of the state transition model. Instead, state trajectories of the system, generated using simulation and/or observations of the real system in operation, are used to build stochastic approximations of the quantities needed to carry out the BRE algorithm. The resulting algorithm can be shown to converge to the policy produced by the nominal, model-based BRE algorithm in the limit of observing an infinite number of trajectories. To validate the performance of the approach, we compare model-based and model-free BRE against LSPI [3], a well-known approximate dynamic programming algorithm. Measuring performance in terms of both computational complexity and policy quality, we present results showing that BRE performs at least as well as, and sometimes significantly better than, LSPI on a standard benchmark problem.

引用

页码：4146 / 4151

页数：6

共 50 条

[1] Approximate Dynamic Programming Using Bellman Residual Elimination and Gaussian Process Regression
Bethke, Brett
How, Jonathan P.
2009 AMERICAN CONTROL CONFERENCE, VOLS 1-9, 2009, : 745 - +
[2] Model-free approximate dynamic programming schemes for linear systems
Al-Tamimi, Asma
Vrabie, Draguna
Abu-Khalaf, Murad
Lewis, Frank L.
2007 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-6, 2007, : 371 - +
[3] An Approximate Dynamic Programming Approach for Model-free Control of Switched Systems
Lu, Wenjie
Ferrari, Silvia
2013 IEEE 52ND ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2013, : 3837 - 3844
[4] Online and Model-Free Supplementary Learning Control Based on Approximate Dynamic Programming
Guo, Wentao
Liu, Feng
Si, Jennie
Mei, Shengwei
26TH CHINESE CONTROL AND DECISION CONFERENCE (2014 CCDC), 2014, : 1316 - 1321
[5] Model-Free Approximate Dynamic Programming for Continuous-Time Linear Systems
Lee, Jae Young
Park, Jin Bae
Choi, Yoon Ho
PROCEEDINGS OF THE 48TH IEEE CONFERENCE ON DECISION AND CONTROL, 2009 HELD JOINTLY WITH THE 2009 28TH CHINESE CONTROL CONFERENCE (CDC/CCC 2009), 2009, : 5009 - 5014
[6] Model-Free Control Design using Incremental Approximate Dynamic Programming and Generalized Extended State Observer
Kim, Juyoung
Lee, Hanna
Lee, Youngjun
Park, Jongho
Kim, Youdan
2024 INTERNATIONAL CONFERENCE ON UNMANNED AIRCRAFT SYSTEMS, ICUAS, 2024, : 505 - 511
[7] Model-Free Cooperative Control for Multi-Agent Systems Using the Approximate Dynamic Programming Approach
Qu, Yanhua
Wang, Anna
Liu, Jinglu
IEEE ACCESS, 2018, 6 : 37195 - 37203
[8] Model-free incremental adaptive dynamic programming based approximate robust optimal regulation
Li, Cong
Wang, Yongchao
Liu, Fangzhou
Liu, Qingchen
Buss, Martin
INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2022, 32 (05) : 2662 - 2682
[9] Model-Free Dual Heuristic Dynamic Programming
Ni, Zhen
He, Haibo
Zhong, Xiangnan
Prokhorov, Danil V.
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 26 (08) : 1834 - 1839
[10] Model-free model elimination: A new step in the model-free dynamic analysis of NMR relaxation data
d'Auvergne, Edward J.
Gooley, Paul R.
JOURNAL OF BIOMOLECULAR NMR, 2006, 35 (02) : 117 - 135

← 1 2 3 4 5 →