Solving optimal predictor-feedback control using approximate dynamic programming

被引：1

作者：

Wang, Hongxia ^{[1
]}

Zhao, Fuyu ^{[1
]}

Zhang, Zhaorong ^{[2
]}

Xu, Juanjuan ^{[3
]}

Li, Xun ^{[4
]}

机构：

[1] Shandong Univ Sci & Technol, Sch Elect Engn & Automat, Qingdao, Peoples R China

[2] Shandong Univ, Sch Comp Sci & Technol, Qingdao, Peoples R China

[3] Shandong Univ, Sch Control Sci & Engn, Jinan, Peoples R China

[4] Hong Kong Polytech Univ, Dept Appl Math, Hong Kong, Peoples R China

来源：

AUTOMATICA | 2024年 / 170卷

基金：

中国国家自然科学基金;

关键词：

Stochastic system; Optimal control; Input delay; Approximate dynamic programming; ADAPTIVE OPTIMAL-CONTROL; DISCRETE-TIME-SYSTEMS; LINEAR-SYSTEMS; LEARNING ALGORITHM; DELAYS; STATE;

D O I：

10.1016/j.automatica.2024.111848

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper is concerned with approximately solving the optimal predictor-feedback control problem of multiplicative-noise systems with input delay in infinite horizon. The optimal predictor-feedback control, provided by the analytical method, is determined by Riccati-ZXL equations and is hard to obtain in the case of unknown system dynamics. We aim to propose a policy iteration (PI) algorithm for solving the optimal solution by approximate dynamic programming. For convergence analysis of the algorithm, we first develop a necessary and sufficient stabilizing condition, in the form of several new Lyapunov-type equations, which parameterizes all predictor-feedback controllers and can be seen as an important addition to Lyapunov stability theory. We then propose an iterative scheme for the Riccati-ZXL equations computations, along with convergence analysis, based on the condition. Inspired by this scheme, a data-driven online PI algorithm, convergence implied in that of the iterative scheme, is proposed for the optimal predictor-feedback control problem without full system dynamics. Finally, a numerical example is used to evaluate the proposed PI algorithm. (c) 2024 Published by Elsevier Ltd.

引用

页数：8

共 50 条

[31] Adaptive railway traffic control using approximate dynamic programming
Ghasempour, Taha
Heydecker, Benjamin
TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2020, 113 : 91 - 107
[32] Optimal control of a fed-batch bioreactor using simulation-based approximate dynamic programming
Peroni, CV
Kaisare, NS
Lee, JH
IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2005, 13 (05) : 786 - 790
[33] Approximate Dynamic Programming Methods Applied to Far Trajectory Planning in Optimal Control
Wahl, Hans-Georg
Holzaepfel, Marc
Gauterin, Frank
2014 IEEE INTELLIGENT VEHICLES SYMPOSIUM PROCEEDINGS, 2014, : 1085 - 1090
[34] Near-optimal Control of Motor Drives via Approximate Dynamic Programming
Wang, Yebin
Chakrabarty, Ankush
Zhou, Meng-Chu
Zhang, Jinyun
2019 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), 2019, : 3679 - 3686
[35] Nonlinear predictor-feedback cooperative adaptive cruise control of vehicles with nonlinear dynamics and input delay
Bekiaris-Liberis, Nikolaos
INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2024, 34 (10) : 6683 - 6698
[36] AN APPROXIMATE METHOD FOR SOLVING OPTIMAL CONTROL PROBLEMS
CHANG, CS
DERUSSO, PM
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1964, AC 9 (04) : 554 - &
[37] Undiscounted control policy generation for continuous-valued optimal control by approximate dynamic programming
Lock, Jonathan
McKelvey, Tomas
INTERNATIONAL JOURNAL OF CONTROL, 2022, 95 (10) : 2854 - 2864
[38] Optimal signal control using adaptive dynamic programming
Kim, CO
Park, Y
Baek, JG
COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2005, VOL 4, PROCEEDINGS, 2005, 3483 : 148 - 160
[39] Event-triggered adaptive optimal control using output feedback: An adaptive dynamic programming approach
Zhao, Fuyu
Jiang, Zhong-Ping
Liu, Tengfei
PROCEEDINGS OF THE 38TH CHINESE CONTROL CONFERENCE (CCC), 2019, : 1979 - 1984
[40] Discrete-Time LQR Optimal Tracking Control Problems Using Approximate Dynamic Programming Algorithm with Disturbance
Xie, Qingqing
Luo, Bin
Tan, Fuxiao
PROCEEDINGS OF THE 2013 FOURTH INTERNATIONAL CONFERENCE ON INTELLIGENT CONTROL AND INFORMATION PROCESSING (ICICIP), 2013, : 716 - 721

← 1 2 3 4 5 →