Undiscounted control policy generation for continuous-valued optimal control by approximate dynamic programming

被引：2

作者：

Lock, Jonathan ^{[1
]}

McKelvey, Tomas ^{[1
]}

机构：

[1] Chalmers Univ Technol, Dept Elect Engn, S-41296 Gothenburg, Sweden

来源：

INTERNATIONAL JOURNAL OF CONTROL | 2022年 / 95卷 / 10期

关键词：

Approximate dynamic programming; control policy; undiscounted infinite-horizon; optimal control;

D O I：

10.1080/00207179.2021.1939892

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

We present a numerical method for generating the state-feedback control policy associated with general undiscounted, constant-setpoint, infinite-horizon, nonlinear optimal control problems with continuous state variables. The method is based on approximate dynamic programming, and is closely related to approximate policy iteration. Existing methods typically terminate based on the convergence of the control policy and either require a discounted problem formulation or demand the cost function to lie in a specific subclass of functions. The presented method extends on existing termination criteria by requiring both the control policy and the resulting system state to converge, allowing for use with undiscounted cost functions that are bounded and continuous. This paper defines the numerical method, derives the relevant underlying mathematical properties, and validates the numerical method with representative examples. A MATLAB implementation with the shown examples is freely available.

引用

页码：2854 / 2864

页数：11

共 50 条

[21] Optimal Switching and Control of Nonlinear Switching Systems Using Approximate Dynamic Programming
Heydari, Ali
Balakrishnan, Sivasubramanya N.
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2014, 25 (06) : 1106 - 1117
[22] Near-optimal Control of Motor Drives via Approximate Dynamic Programming
Wang, Yebin
Chakrabarty, Ankush
Zhou, Meng-Chu
Zhang, Jinyun
2019 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), 2019, : 3679 - 3686
[23] Solving optimal predictor-feedback control using approximate dynamic programming
Wang, Hongxia
Zhao, Fuyu
Zhang, Zhaorong
Xu, Juanjuan
Li, Xun
AUTOMATICA, 2024, 170
[24] Optimal Tracking Control for a Class of Continuous Time Complex-valued Systems Based on Adaptive Dynamic Programming Algorithm
Song, Ruizhuo
Lewis, Frank L.
Wei, Qinglai
Xiao, Wendong
2014 33RD CHINESE CONTROL CONFERENCE (CCC), 2014, : 8968 - 8972
[25] Learning Algorithm of Decision Tree Generation for Continuous-valued Attribute
Li Hua
Hu Xiaojuan
Sun Haizhen
PROCEEDINGS OF THE 29TH CHINESE CONTROL CONFERENCE, 2010, : 2642 - 2644
[26] Approximate dynamic programming approach for process control
Lee, Jay H.
Wong, Weechin
JOURNAL OF PROCESS CONTROL, 2010, 20 (09) : 1038 - 1048
[27] On the handling of fuzziness for continuous-valued attributes in decision tree generation
Wang, XZ
Hong, JR
FUZZY SETS AND SYSTEMS, 1998, 99 (03) : 283 - 290
[28] Approximate dynamic programming for ship course control
Bai, Xuerui
Yi, Jianqiang
Zhao, Dongbin
ADVANCES IN NEURAL NETWORKS - ISNN 2007, PT 1, PROCEEDINGS, 2007, 4491 : 349 - +
[29] Approximate Dynamic Programming for Output Feedback Control
Jiang Yu
Jiang Zhong-Ping
PROCEEDINGS OF THE 29TH CHINESE CONTROL CONFERENCE, 2010, : 5815 - 5820
[30] Approximate dynamic programming approach for process control
Lee, Jay H.
INTERNATIONAL CONFERENCE ON CONTROL, AUTOMATION AND SYSTEMS (ICCAS 2010), 2010, : 459 - 464

← 1 2 3 4 5 →