Undiscounted control policy generation for continuous-valued optimal control by approximate dynamic programming

被引：2

作者：

Lock, Jonathan ^{[1
]}

McKelvey, Tomas ^{[1
]}

机构：

[1] Chalmers Univ Technol, Dept Elect Engn, S-41296 Gothenburg, Sweden

来源：

INTERNATIONAL JOURNAL OF CONTROL | 2022年 / 95卷 / 10期

关键词：

Approximate dynamic programming; control policy; undiscounted infinite-horizon; optimal control;

D O I：

10.1080/00207179.2021.1939892

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

We present a numerical method for generating the state-feedback control policy associated with general undiscounted, constant-setpoint, infinite-horizon, nonlinear optimal control problems with continuous state variables. The method is based on approximate dynamic programming, and is closely related to approximate policy iteration. Existing methods typically terminate based on the convergence of the control policy and either require a discounted problem formulation or demand the cost function to lie in a specific subclass of functions. The presented method extends on existing termination criteria by requiring both the control policy and the resulting system state to converge, allowing for use with undiscounted cost functions that are bounded and continuous. This paper defines the numerical method, derives the relevant underlying mathematical properties, and validates the numerical method with representative examples. A MATLAB implementation with the shown examples is freely available.

引用

页码：2854 / 2864

页数：11

共 50 条

[11] Optimal control of heavy haul train based on approximate dynamic programming
Wang, Xi
Tang, Tao
He, Hui
ADVANCES IN MECHANICAL ENGINEERING, 2017, 9 (04) : 1 - 15
[12] Approximate optimal control for an uncertain robot based on adaptive dynamic programming
Kong, Linghuan
Zhang, Shuang
Yu, Xinbo
NEUROCOMPUTING, 2021, 423 : 308 - 317
[13] Approximate Dynamic Programming Recurrence Relations for a Hybrid Optimal Control Problem
Lu, W.
Ferrari, S.
Fierro, R.
Wettergren, T. A.
UNMANNED SYSTEMS TECHNOLOGY XIV, 2012, 8387
[14] Implementation of Dynamic Programming for Optimal Control Problems With Continuous States
van Berkel, Koos
de Jager, Bram
Hofman, Theo
Steinbuch, Maarten
IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2015, 23 (03) : 1172 - 1179
[15] Piecewise linear continuous optimal control by iterative dynamic programming
Luus, Rein
Industrial and Engineering Chemistry Research, 1993, 32 (05): : 859 - 865
[16] Value and Policy Iterations in Optimal Control and Adaptive Dynamic Programming
Bertsekas, Dimitri P.
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 28 (03) : 500 - 509
[17] Symbolic Music Generation Conditioned on Continuous-Valued Emotions
Sulun S.
Davies M.E.P.
Viana P.
IEEE Access, 2022, 10 : 44617 - 44626
[18] ON THE HANDLING OF CONTINUOUS-VALUED ATTRIBUTES IN DECISION TREE GENERATION
FAYYAD, UM
IRANI, KB
MACHINE LEARNING, 1992, 8 (01) : 87 - 102
[19] Approximate Dynamic Programming Methods Applied to Far Trajectory Planning in Optimal Control
Wahl, Hans-Georg
Holzaepfel, Marc
Gauterin, Frank
2014 IEEE INTELLIGENT VEHICLES SYMPOSIUM PROCEEDINGS, 2014, : 1085 - 1090
[20] Optimal Tracking Control for Ship Course Using Approximate Dynamic Programming Method
Xie Qingqing
Luo Bin
Tan Fuxiao
2013 32ND CHINESE CONTROL CONFERENCE (CCC), 2013, : 2911 - 2916

← 1 2 3 4 5 →