CONTINUOUS-TIME ROBUST DYNAMIC PROGRAMMING

被引:37
|
作者
Bian, Tao [1 ]
Jiang, Zhong-Ping [2 ]
机构
[1] Bank Amer Merrill Lynch, One Bryant Pk, New York, NY 10036 USA
[2] NYU, Tandon Sch Engn, Dept Elect & Comp Engn, 6 Metrotech Ctr, Brooklyn, NY 11201 USA
基金
美国国家科学基金会;
关键词
dynamic programming; stochastic optimal control; adaptive optimal control; robust control; STOCHASTIC-APPROXIMATION; STABILIZATION; STABILITY; SYSTEMS; INPUT;
D O I
10.1137/18M1214147
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a new theory, known as robust dynamic programming, for a class of continuous-time dynamical systems. Different from traditional dynamic programming (DP) methods, this new theory serves as a fundamental tool to analyze the robustness of DP algorithms, and, in particular, to develop novel adaptive optimal control and reinforcement learning methods. In order to demonstrate the potential of this new framework, two illustrative applications in the fields of stochastic and decentralized optimal control are presented. Two numerical examples arising from both finance and engineering industries are also given, along with several possible extensions of the proposed framework.
引用
收藏
页码:4150 / 4174
页数:25
相关论文
共 50 条