Undiscounted control policy generation for continuous-valued optimal control by approximate dynamic programming

被引：2

作者：

Lock, Jonathan ^{[1
]}

McKelvey, Tomas ^{[1
]}

机构：

[1] Chalmers Univ Technol, Dept Elect Engn, S-41296 Gothenburg, Sweden

来源：

INTERNATIONAL JOURNAL OF CONTROL | 2022年 / 95卷 / 10期

关键词：

Approximate dynamic programming; control policy; undiscounted infinite-horizon; optimal control;

D O I：

10.1080/00207179.2021.1939892

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

We present a numerical method for generating the state-feedback control policy associated with general undiscounted, constant-setpoint, infinite-horizon, nonlinear optimal control problems with continuous state variables. The method is based on approximate dynamic programming, and is closely related to approximate policy iteration. Existing methods typically terminate based on the convergence of the control policy and either require a discounted problem formulation or demand the cost function to lie in a specific subclass of functions. The presented method extends on existing termination criteria by requiring both the control policy and the resulting system state to converge, allowing for use with undiscounted cost functions that are bounded and continuous. This paper defines the numerical method, derives the relevant underlying mathematical properties, and validates the numerical method with representative examples. A MATLAB implementation with the shown examples is freely available.

引用

页码：2854 / 2864

页数：11

共 50 条

[31] Optimal individualized dosing strategies: A pharmacologic approach to developing dynamic treatment regimens for continuous-valued treatments
Rich, Benjamin
Moodie, Erica E. M.
Stephens, David A.
BIOMETRICAL JOURNAL, 2016, 58 (03) : 502 - 517
[32] PARTITIONED DYNAMIC PROGRAMMING FOR OPTIMAL CONTROL
Wright, Stephen J.
SIAM JOURNAL ON OPTIMIZATION, 1991, 1 (04) : 620 - 642
[33] OPTIMAL CONTROL OF ROBOTS BY DYNAMIC PROGRAMMING
Baumgart-Schmitt, Rudolf
Liebetrau, Stephan
Walther, Christian
Krautwald, Maria
Trommer, Daniel
ECT 2009: ELECTRICAL AND CONTROL TECHNOLOGIES, 2009, : 27 - 30
[34] An approximate dynamic programming method for the optimal control of Alkai-Surfactant-Polymer flooding
Ge, Yulei
Li, Shurong
Chan, Peng
JOURNAL OF PROCESS CONTROL, 2018, 64 : 15 - 26
[35] Automata Theory Meets Approximate Dynamic Programming: Optimal Control with Temporal Logic Constraints
Papusha, Ivan
Fu, Jie
Topcu, Ufuk
Murray, Richard M.
2016 IEEE 55TH CONFERENCE ON DECISION AND CONTROL (CDC), 2016, : 434 - 440
[36] Nonlinear Noncausal Optimal Control of Wave Energy Converters Via Approximate Dynamic Programming
Zhan, Siyuan
Na, Jing
Li, Guang
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2019, 15 (11) : 6070 - 6079
[37] OPEN-LOOP OPTIMAL CONTROL FOR TRACKING A REFERENCE SIGNAL WITH APPROXIMATE DYNAMIC PROGRAMMING
Diaz, Jorge A.
Xu, Lei
Sardarmehni, Tohid
PROCEEDINGS OF ASME 2022 INTERNATIONAL MECHANICAL ENGINEERING CONGRESS AND EXPOSITION, IMECE2022, VOL 5, 2022,
[38] Discussion of dynamic programming and linear programming approaches to stochastic control and optimal stopping in continuous time
Stockbridge, R. H.
METRIKA, 2014, 77 (01) : 137 - 162
[39] Discussion of dynamic programming and linear programming approaches to stochastic control and optimal stopping in continuous time
R. H. Stockbridge
Metrika, 2014, 77 : 137 - 162
[40] NECESSARY AND SUFFICIENT DYNAMIC PROGRAMMING CONDITIONS FOR CONTINUOUS TIME STOCHASTIC OPTIMAL CONTROL
RISHEL, R
SIAM JOURNAL ON CONTROL, 1970, 8 (04): : 559 - &

← 1 2 3 4 5 →