Solving optimal predictor-feedback control using approximate dynamic programming

被引:1
|
作者
Wang, Hongxia [1 ]
Zhao, Fuyu [1 ]
Zhang, Zhaorong [2 ]
Xu, Juanjuan [3 ]
Li, Xun [4 ]
机构
[1] Shandong Univ Sci & Technol, Sch Elect Engn & Automat, Qingdao, Peoples R China
[2] Shandong Univ, Sch Comp Sci & Technol, Qingdao, Peoples R China
[3] Shandong Univ, Sch Control Sci & Engn, Jinan, Peoples R China
[4] Hong Kong Polytech Univ, Dept Appl Math, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Stochastic system; Optimal control; Input delay; Approximate dynamic programming; ADAPTIVE OPTIMAL-CONTROL; DISCRETE-TIME-SYSTEMS; LINEAR-SYSTEMS; LEARNING ALGORITHM; DELAYS; STATE;
D O I
10.1016/j.automatica.2024.111848
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper is concerned with approximately solving the optimal predictor-feedback control problem of multiplicative-noise systems with input delay in infinite horizon. The optimal predictor-feedback control, provided by the analytical method, is determined by Riccati-ZXL equations and is hard to obtain in the case of unknown system dynamics. We aim to propose a policy iteration (PI) algorithm for solving the optimal solution by approximate dynamic programming. For convergence analysis of the algorithm, we first develop a necessary and sufficient stabilizing condition, in the form of several new Lyapunov-type equations, which parameterizes all predictor-feedback controllers and can be seen as an important addition to Lyapunov stability theory. We then propose an iterative scheme for the Riccati-ZXL equations computations, along with convergence analysis, based on the condition. Inspired by this scheme, a data-driven online PI algorithm, convergence implied in that of the iterative scheme, is proposed for the optimal predictor-feedback control problem without full system dynamics. Finally, a numerical example is used to evaluate the proposed PI algorithm. (c) 2024 Published by Elsevier Ltd.
引用
收藏
页数:8
相关论文
共 50 条
  • [31] Adaptive railway traffic control using approximate dynamic programming
    Ghasempour, Taha
    Heydecker, Benjamin
    TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2020, 113 : 91 - 107
  • [32] Optimal control of a fed-batch bioreactor using simulation-based approximate dynamic programming
    Peroni, CV
    Kaisare, NS
    Lee, JH
    IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2005, 13 (05) : 786 - 790
  • [33] Approximate Dynamic Programming Methods Applied to Far Trajectory Planning in Optimal Control
    Wahl, Hans-Georg
    Holzaepfel, Marc
    Gauterin, Frank
    2014 IEEE INTELLIGENT VEHICLES SYMPOSIUM PROCEEDINGS, 2014, : 1085 - 1090
  • [34] Near-optimal Control of Motor Drives via Approximate Dynamic Programming
    Wang, Yebin
    Chakrabarty, Ankush
    Zhou, Meng-Chu
    Zhang, Jinyun
    2019 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), 2019, : 3679 - 3686
  • [35] Nonlinear predictor-feedback cooperative adaptive cruise control of vehicles with nonlinear dynamics and input delay
    Bekiaris-Liberis, Nikolaos
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2024, 34 (10) : 6683 - 6698
  • [36] AN APPROXIMATE METHOD FOR SOLVING OPTIMAL CONTROL PROBLEMS
    CHANG, CS
    DERUSSO, PM
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1964, AC 9 (04) : 554 - &
  • [37] Undiscounted control policy generation for continuous-valued optimal control by approximate dynamic programming
    Lock, Jonathan
    McKelvey, Tomas
    INTERNATIONAL JOURNAL OF CONTROL, 2022, 95 (10) : 2854 - 2864
  • [38] Optimal signal control using adaptive dynamic programming
    Kim, CO
    Park, Y
    Baek, JG
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2005, VOL 4, PROCEEDINGS, 2005, 3483 : 148 - 160
  • [39] Event-triggered adaptive optimal control using output feedback: An adaptive dynamic programming approach
    Zhao, Fuyu
    Jiang, Zhong-Ping
    Liu, Tengfei
    PROCEEDINGS OF THE 38TH CHINESE CONTROL CONFERENCE (CCC), 2019, : 1979 - 1984
  • [40] Discrete-Time LQR Optimal Tracking Control Problems Using Approximate Dynamic Programming Algorithm with Disturbance
    Xie, Qingqing
    Luo, Bin
    Tan, Fuxiao
    PROCEEDINGS OF THE 2013 FOURTH INTERNATIONAL CONFERENCE ON INTELLIGENT CONTROL AND INFORMATION PROCESSING (ICICIP), 2013, : 716 - 721