CONTINUOUS-TIME ROBUST DYNAMIC PROGRAMMING

被引:37
|
作者
Bian, Tao [1 ]
Jiang, Zhong-Ping [2 ]
机构
[1] Bank Amer Merrill Lynch, One Bryant Pk, New York, NY 10036 USA
[2] NYU, Tandon Sch Engn, Dept Elect & Comp Engn, 6 Metrotech Ctr, Brooklyn, NY 11201 USA
基金
美国国家科学基金会;
关键词
dynamic programming; stochastic optimal control; adaptive optimal control; robust control; STOCHASTIC-APPROXIMATION; STABILIZATION; STABILITY; SYSTEMS; INPUT;
D O I
10.1137/18M1214147
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a new theory, known as robust dynamic programming, for a class of continuous-time dynamical systems. Different from traditional dynamic programming (DP) methods, this new theory serves as a fundamental tool to analyze the robustness of DP algorithms, and, in particular, to develop novel adaptive optimal control and reinforcement learning methods. In order to demonstrate the potential of this new framework, two illustrative applications in the fields of stochastic and decentralized optimal control are presented. Two numerical examples arising from both finance and engineering industries are also given, along with several possible extensions of the proposed framework.
引用
收藏
页码:4150 / 4174
页数:25
相关论文
共 50 条
  • [31] ROBUST LYAPUNOV GAMES - THE CONTINUOUS-TIME CASE
    DEISSENBERG, C
    LECTURE NOTES IN ECONOMICS AND MATHEMATICAL SYSTEMS, 1991, 353 : 65 - 83
  • [32] Optimal dynamic output feedback control of unknown linear continuous-time systems by adaptive dynamic programming☆
    Xie, Kedi
    Zheng, Yiwei
    Jiang, Yi
    Lan, Weiyao
    Yu, Xiao
    AUTOMATICA, 2024, 163
  • [33] ROBUST CONSENSUS FOR CONTINUOUS-TIME MULTIAGENT DYNAMICS
    Shi, Guodong
    Johansson, Karl Henrik
    SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 2013, 51 (05) : 3673 - 3691
  • [34] Robust stability condition for continuous-time systems
    Chughtai, SS
    Munro, N
    ELECTRONICS LETTERS, 2004, 40 (16) : 978 - 979
  • [35] Robust stability of continuous-time difference systems
    Kharitonov, VL
    INTERNATIONAL JOURNAL OF CONTROL, 1996, 64 (05) : 985 - 990
  • [36] Design of a robust NTF for continuous-time ΔΣ modulators
    Mirzaei, Mahdi
    Shamsi, Hossein
    IEICE ELECTRONICS EXPRESS, 2010, 7 (17): : 1323 - 1328
  • [37] Extended Form of Robust Solutions for Uncertain Continuous-Time Linear Programming Problems with Time-Dependent Matrices
    Wu, Hsien-Chung
    AXIOMS, 2022, 11 (05)
  • [38] A pseudospectral method for continuous-time nonlinear fractional programming
    Yang, Yin
    Skandari, M. H. Noori
    Zhang, Jiaqi
    FILOMAT, 2024, 38 (06) : 1947 - 1961
  • [39] Parametric continuous-time linear fractional programming problems
    Wu, Hsien-Chung
    JOURNAL OF INEQUALITIES AND APPLICATIONS, 2015, : 1 - 22
  • [40] Parametric continuous-time linear fractional programming problems
    Hsien-Chung Wu
    Journal of Inequalities and Applications, 2015