CONTINUOUS-TIME ROBUST DYNAMIC PROGRAMMING

被引:37
|
作者
Bian, Tao [1 ]
Jiang, Zhong-Ping [2 ]
机构
[1] Bank Amer Merrill Lynch, One Bryant Pk, New York, NY 10036 USA
[2] NYU, Tandon Sch Engn, Dept Elect & Comp Engn, 6 Metrotech Ctr, Brooklyn, NY 11201 USA
基金
美国国家科学基金会;
关键词
dynamic programming; stochastic optimal control; adaptive optimal control; robust control; STOCHASTIC-APPROXIMATION; STABILIZATION; STABILITY; SYSTEMS; INPUT;
D O I
10.1137/18M1214147
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a new theory, known as robust dynamic programming, for a class of continuous-time dynamical systems. Different from traditional dynamic programming (DP) methods, this new theory serves as a fundamental tool to analyze the robustness of DP algorithms, and, in particular, to develop novel adaptive optimal control and reinforcement learning methods. In order to demonstrate the potential of this new framework, two illustrative applications in the fields of stochastic and decentralized optimal control are presented. Two numerical examples arising from both finance and engineering industries are also given, along with several possible extensions of the proposed framework.
引用
收藏
页码:4150 / 4174
页数:25
相关论文
共 50 条
  • [21] Robust Multi-Parametric Control of Continuous-Time Linear Dynamic Systems
    Sun, Muxin
    Villanueva, Mario E.
    Pistikopoulos, Efstratios N.
    Chachuat, Benoit
    IFAC PAPERSONLINE, 2017, 50 (01): : 4660 - 4665
  • [22] NETWORK PROGRAMMING IN CONTINUOUS-TIME WITH NODE STORAGE
    PHILPOTT, AB
    LECTURE NOTES IN ECONOMICS AND MATHEMATICAL SYSTEMS, 1985, 259 : 136 - 153
  • [23] SUFFICIENT OPTIMALITY CRITERIA IN CONTINUOUS-TIME PROGRAMMING
    KAUL, RN
    KAUR, S
    JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 1982, 88 (01) : 37 - 47
  • [24] Vector continuous-time programming without differentiability
    de Oliveira, Valeriano Antunes
    JOURNAL OF COMPUTATIONAL AND APPLIED MATHEMATICS, 2010, 234 (03) : 924 - 933
  • [25] DUALITY FOR A CLASS OF CONTINUOUS-TIME PROGRAMMING PROBLEMS
    Preda, Vasile
    PROCEEDINGS OF THE ROMANIAN ACADEMY SERIES A-MATHEMATICS PHYSICS TECHNICAL SCIENCES INFORMATION SCIENCE, 2009, 10 (03): : 222 - 225
  • [26] Quantitative Programming and Continuous-Time Markov Chains
    Todoran, Eneia Nicolae
    2023 25TH INTERNATIONAL SYMPOSIUM ON SYMBOLIC AND NUMERIC ALGORITHMS FOR SCIENTIFIC COMPUTING, SYNASC 2023, 2023, : 104 - 113
  • [27] Continuous-Time Dynamic Network Embeddings
    Nguyen, Giang Hoang
    Lee, John Boaz
    Rossi, Ryan A.
    Ahmed, Nesreen K.
    Koh, Eunyee
    Kim, Sungchul
    COMPANION PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2018 (WWW 2018), 2018, : 969 - 976
  • [28] Approximate Dynamic Programming with Gaussian Processes for Optimal Control of Continuous-Time Nonlinear Systems
    Beppu, Hirofumi
    Maruta, Ichiro
    Fujimoto, Kenji
    IFAC PAPERSONLINE, 2020, 53 (02): : 6715 - 6722
  • [29] Event-Triggered Adaptive Dynamic Programming for Continuous-Time Systems With Control Constraints
    Dong, Lu
    Zhong, Xiangnan
    Sun, Changyin
    He, Haibo
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 28 (08) : 1941 - 1952
  • [30] Convergence Analysis of Value Iteration Adaptive Dynamic Programming for Continuous-Time Nonlinear Systems
    Xiao, Geyang
    Zhang, Huaguang
    IEEE TRANSACTIONS ON CYBERNETICS, 2024, 54 (03) : 1639 - 1649