Undiscounted control policy generation for continuous-valued optimal control by approximate dynamic programming

被引：2

作者：

Lock, Jonathan ^{[1
]}

McKelvey, Tomas ^{[1
]}

机构：

[1] Chalmers Univ Technol, Dept Elect Engn, S-41296 Gothenburg, Sweden

来源：

INTERNATIONAL JOURNAL OF CONTROL | 2022年 / 95卷 / 10期

关键词：

Approximate dynamic programming; control policy; undiscounted infinite-horizon; optimal control;

D O I：

10.1080/00207179.2021.1939892

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

We present a numerical method for generating the state-feedback control policy associated with general undiscounted, constant-setpoint, infinite-horizon, nonlinear optimal control problems with continuous state variables. The method is based on approximate dynamic programming, and is closely related to approximate policy iteration. Existing methods typically terminate based on the convergence of the control policy and either require a discounted problem formulation or demand the cost function to lie in a specific subclass of functions. The presented method extends on existing termination criteria by requiring both the control policy and the resulting system state to converge, allowing for use with undiscounted cost functions that are bounded and continuous. This paper defines the numerical method, derives the relevant underlying mathematical properties, and validates the numerical method with representative examples. A MATLAB implementation with the shown examples is freely available.

引用

页码：2854 / 2864

页数：11

共 50 条

[41] NECESSARY AND SUFFICIENT DYNAMIC PROGRAMMING CONDITIONS FOR CONTINUOUS TIME STOCHASTIC OPTIMAL CONTROL
RISHEL, RW
NOTICES OF THE AMERICAN MATHEMATICAL SOCIETY, 1970, 17 (01): : 252 - &
[42] Proposal of the Continuous-Valued Penalty Avoiding Rational Policy Making Algorithm
Miyazaki, Kazuteru
JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2012, 16 (02) : 183 - 190
[43] Policy Gradient Adaptive Dynamic Programming for Data-Based Optimal Control
Luo, Biao
Liu, Derong
Wu, Huai-Ning
Wang, Ding
Lewis, Frank L.
IEEE TRANSACTIONS ON CYBERNETICS, 2017, 47 (10) : 3341 - 3354
[44] Approximate Optimal tracking Control for Nonlinear Discrete-time Switched Systems via Approximate Dynamic Programming
Qin, Chunbin
Huang, Yizhe
Yang, Yabin
Zhang, Jishi
Liu, Xianxing
PROCEEDINGS OF THE 2019 31ST CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2019), 2019, : 1456 - 1461
[45] Approximate Dynamic Programming for Self-Learning Control
Derong Liu Department of Electrical and Computer Engineering University of Illinois Chicago IL USA
自动化学报, 2005, (01) : 13 - 18
[46] Nonlinear Control of Quadcopters via Approximate Dynamic Programming
Romero, Angel
Beuchat, Paul N.
Sturz, Yvonne R.
Smith, Roy S.
Lygeros, John
2019 18TH EUROPEAN CONTROL CONFERENCE (ECC), 2019, : 3752 - 3759
[47] Shared Control of Human and Robot by Approximate Dynamic Programming
Li, Yanan
Tee, Keng Peng
Yan, Rui
Limbu, Dilip Kumar
Ge, Shuzhi Sam
2015 AMERICAN CONTROL CONFERENCE (ACC), 2015, : 1167 - 1172
[48] Multiple approximate dynamic programming controllers for congestion control
Xiang, Yanping
Yi, Jianqiang
Zhao, Dongbin
ADVANCES IN NEURAL NETWORKS - ISNN 2007, PT 1, PROCEEDINGS, 2007, 4491 : 368 - +
[49] Adaptive polyhedral meshing for approximate dynamic programming in control
Sala, Antonio
Armesto, Leopoldo
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 107
[50] Approximate Dynamic Programming in Tracking Control of a Robotic Manipulator
Szuster, Marcin
Gierlak, Piotr
INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2016, 13

← 1 2 3 4 5 →