CONTINUOUS-TIME ROBUST DYNAMIC PROGRAMMING

被引:37
|
作者
Bian, Tao [1 ]
Jiang, Zhong-Ping [2 ]
机构
[1] Bank Amer Merrill Lynch, One Bryant Pk, New York, NY 10036 USA
[2] NYU, Tandon Sch Engn, Dept Elect & Comp Engn, 6 Metrotech Ctr, Brooklyn, NY 11201 USA
基金
美国国家科学基金会;
关键词
dynamic programming; stochastic optimal control; adaptive optimal control; robust control; STOCHASTIC-APPROXIMATION; STABILIZATION; STABILITY; SYSTEMS; INPUT;
D O I
10.1137/18M1214147
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a new theory, known as robust dynamic programming, for a class of continuous-time dynamical systems. Different from traditional dynamic programming (DP) methods, this new theory serves as a fundamental tool to analyze the robustness of DP algorithms, and, in particular, to develop novel adaptive optimal control and reinforcement learning methods. In order to demonstrate the potential of this new framework, two illustrative applications in the fields of stochastic and decentralized optimal control are presented. Two numerical examples arising from both finance and engineering industries are also given, along with several possible extensions of the proposed framework.
引用
收藏
页码:4150 / 4174
页数:25
相关论文
共 50 条
  • [1] Robust adaptive dynamic programming for continuous-time linear stochastic systems
    Bian, Tao
    Jiang, Zhong-Ping
    2014 IEEE INTERNATIONAL SYMPOSIUM ON INTELLIGENT CONTROL (ISIC), 2014, : 536 - 541
  • [2] ON CONTINUOUS-TIME DISCOUNTED STOCHASTIC DYNAMIC-PROGRAMMING
    LAI, HC
    TANAKA, K
    APPLIED MATHEMATICS AND OPTIMIZATION, 1991, 23 (02): : 155 - 169
  • [3] Continuous-Time Differential Dynamic Programming with Terminal Constraints
    Sun, Wei
    Theodorou, Evangelos A.
    Tsiotras, Panagiotis
    2014 IEEE SYMPOSIUM ON ADAPTIVE DYNAMIC PROGRAMMING AND REINFORCEMENT LEARNING (ADPRL), 2014, : 289 - 294
  • [4] Biologically inspired scheme for continuous-time approximate dynamic programming
    Vrabie, Draguna
    Lewis, Frank
    Abu-Khalaf, Murad
    TRANSACTIONS OF THE INSTITUTE OF MEASUREMENT AND CONTROL, 2008, 30 (3-4) : 207 - 223
  • [5] Continuous-Time Stochastic Policy Iteration of Adaptive Dynamic Programming
    Wei, Qinglai
    Zhou, Tianmin
    Lu, Jingwei
    Liu, Yu
    Su, Shuai
    Xiao, Jun
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 53 (10): : 6375 - 6387
  • [6] Finite-difference methods for continuous-time dynamic programming
    Candler, GV
    COMPUTATIONAL METHODS FOR THE STUDY OF DYNAMIC ECONOMIES, 1999, : 172 - 194
  • [7] Global Adaptive Dynamic Programming for Continuous-Time Nonlinear Systems
    Jiang, Yu
    Jiang, Zhong-Ping
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2015, 60 (11) : 2917 - 2929
  • [8] Adaptive dynamic programming for robust neural control of unknown continuous-time non-linear systems
    Yang, Xiong
    He, Haibo
    Liu, Derong
    Zhu, Yuanheng
    IET CONTROL THEORY AND APPLICATIONS, 2017, 11 (14): : 2307 - 2316
  • [9] CONTINUOUS-TIME MATRIX PROGRAMMING
    SINGH, C
    KIRAN, M
    JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 1993, 175 (01) : 280 - 291
  • [10] Numerical Method for Solving the Robust Continuous-Time Linear Programming Problems
    Wu, Hsien-Chung
    MATHEMATICS, 2019, 7 (05)