CONTINUOUS-TIME ROBUST DYNAMIC PROGRAMMING

被引：37

作者：

Bian, Tao ^{[1
]}

Jiang, Zhong-Ping ^{[2
]}

机构：

[1] Bank Amer Merrill Lynch, One Bryant Pk, New York, NY 10036 USA

[2] NYU, Tandon Sch Engn, Dept Elect & Comp Engn, 6 Metrotech Ctr, Brooklyn, NY 11201 USA

来源：

SIAM JOURNAL ON CONTROL AND OPTIMIZATION | 2019年 / 57卷 / 06期

基金：

美国国家科学基金会;

关键词：

dynamic programming; stochastic optimal control; adaptive optimal control; robust control; STOCHASTIC-APPROXIMATION; STABILIZATION; STABILITY; SYSTEMS; INPUT;

D O I：

10.1137/18M1214147

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper presents a new theory, known as robust dynamic programming, for a class of continuous-time dynamical systems. Different from traditional dynamic programming (DP) methods, this new theory serves as a fundamental tool to analyze the robustness of DP algorithms, and, in particular, to develop novel adaptive optimal control and reinforcement learning methods. In order to demonstrate the potential of this new framework, two illustrative applications in the fields of stochastic and decentralized optimal control are presented. Two numerical examples arising from both finance and engineering industries are also given, along with several possible extensions of the proposed framework.

引用

页码：4150 / 4174

页数：25

共 50 条

[41] Note: continuous-time linear programming problems revisited
Wu, Hsien-Chung
OPTIMIZATION, 2015, 64 (09) : 2047 - 2048
[42] Event-Triggered Adaptive Dynamic Programming for Unmatched Uncertain Nonlinear Continuous-Time Systems
Xue, Shan
Luo, Biao
Liu, Derong
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (07) : 2939 - 2951
[43] Continuous-Time Distributed Dynamic Programming for Networked Multi-Agent Markov Decision Processes
Lee, Donghwan
Lim, Han-Dong
Kim, Do Wan
2024 IEEE 18TH INTERNATIONAL CONFERENCE ON CONTROL & AUTOMATION, ICCA 2024, 2024, : 960 - 967
[44] Accelerated Continuous-Time Approximate Dynamic Programming via Data-Assisted Hybrid Control
Ochoa, Daniel E.
Poveda, Jorge, I
IFAC PAPERSONLINE, 2022, 55 (12): : 561 - 566
[45] ON THE DYNAMIC-PROGRAMMING INEQUALITIES ASSOCIATED WITH THE DETERMINISTIC OPTIMAL STOPPING PROBLEM IN DISCRETE AND CONTINUOUS-TIME
DOLCETTA, IC
MATZEU, M
NUMERICAL FUNCTIONAL ANALYSIS AND OPTIMIZATION, 1981, 3 (04) : 425 - 450
[46] Input-Derivative-Constrained Approximate Dynamic Programming For Unknown Continuous-Time Linear Systems
Lee, Jae Young
Park, Jin Bae
Choi, Yoon Ho
ISIE: 2009 IEEE INTERNATIONAL SYMPOSIUM ON INDUSTRIAL ELECTRONICS, 2009, : 1137 - 1142
[47] Continuous-time identification of continuous-time systems
Kowalczuk, Z
Kozlowski, J
(SYSID'97): SYSTEM IDENTIFICATION, VOLS 1-3, 1998, : 1293 - 1298
[48] ESTIMATION OF A CONTINUOUS-TIME DYNAMIC DEMAND SYSTEM
CHAMBERS, MJ
JOURNAL OF APPLIED ECONOMETRICS, 1992, 7 (01) : 53 - 64
[49] On solving continuous-time dynamic network flows
Hashemi, S. Mehdi
Nasrabadi, Ebrahim
JOURNAL OF GLOBAL OPTIMIZATION, 2012, 53 (03) : 497 - 524
[50] On solving continuous-time dynamic network flows
S. Mehdi Hashemi
Ebrahim Nasrabadi
Journal of Global Optimization, 2012, 53 : 497 - 524

← 1 2 3 4 5 →