Solving optimal predictor-feedback control using approximate dynamic programming

被引:1
|
作者
Wang, Hongxia [1 ]
Zhao, Fuyu [1 ]
Zhang, Zhaorong [2 ]
Xu, Juanjuan [3 ]
Li, Xun [4 ]
机构
[1] Shandong Univ Sci & Technol, Sch Elect Engn & Automat, Qingdao, Peoples R China
[2] Shandong Univ, Sch Comp Sci & Technol, Qingdao, Peoples R China
[3] Shandong Univ, Sch Control Sci & Engn, Jinan, Peoples R China
[4] Hong Kong Polytech Univ, Dept Appl Math, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Stochastic system; Optimal control; Input delay; Approximate dynamic programming; ADAPTIVE OPTIMAL-CONTROL; DISCRETE-TIME-SYSTEMS; LINEAR-SYSTEMS; LEARNING ALGORITHM; DELAYS; STATE;
D O I
10.1016/j.automatica.2024.111848
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper is concerned with approximately solving the optimal predictor-feedback control problem of multiplicative-noise systems with input delay in infinite horizon. The optimal predictor-feedback control, provided by the analytical method, is determined by Riccati-ZXL equations and is hard to obtain in the case of unknown system dynamics. We aim to propose a policy iteration (PI) algorithm for solving the optimal solution by approximate dynamic programming. For convergence analysis of the algorithm, we first develop a necessary and sufficient stabilizing condition, in the form of several new Lyapunov-type equations, which parameterizes all predictor-feedback controllers and can be seen as an important addition to Lyapunov stability theory. We then propose an iterative scheme for the Riccati-ZXL equations computations, along with convergence analysis, based on the condition. Inspired by this scheme, a data-driven online PI algorithm, convergence implied in that of the iterative scheme, is proposed for the optimal predictor-feedback control problem without full system dynamics. Finally, a numerical example is used to evaluate the proposed PI algorithm. (c) 2024 Published by Elsevier Ltd.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] Approximate Dynamic Programming for Output Feedback Control
    Jiang Yu
    Jiang Zhong-Ping
    PROCEEDINGS OF THE 29TH CHINESE CONTROL CONFERENCE, 2010, : 5815 - 5820
  • [2] Nonlinear Predictor-Feedback Cooperative Adaptive Cruise Control
    Bekiaris-Liberis, Nikolaos
    2023 62ND IEEE CONFERENCE ON DECISION AND CONTROL, CDC, 2023, : 932 - 937
  • [3] Adaptive feedback control by constrained approximate dynamic programming
    Ferrari, Silvia
    Steck, James E.
    Chandramohan, Rajeev
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2008, 38 (04): : 982 - 987
  • [4] Optimal Tracking Control for Ship Course Using Approximate Dynamic Programming Method
    Xie Qingqing
    Luo Bin
    Tan Fuxiao
    2013 32ND CHINESE CONTROL CONFERENCE (CCC), 2013, : 2911 - 2916
  • [5] Optimal Switching and Control of Nonlinear Switching Systems Using Approximate Dynamic Programming
    Heydari, Ali
    Balakrishnan, Sivasubramanya N.
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2014, 25 (06) : 1106 - 1117
  • [6] Nonlinear Adaptive Flight Control Using Incremental Approximate Dynamic Programming and Output Feedback
    Zhou, Ye
    van Kampen, Erik-Jan
    Chu, QiPing
    JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 2017, 40 (02) : 493 - 500
  • [7] Using computer algebra for solving some optimal control problems by dynamic programming
    Boulehmi, M
    Calvet, JL
    COMPUTER AIDED CONTROL SYSTEMS DESIGN (CACSD'97), 1997, : 39 - 43
  • [8] Approximation of optimal feedback control: a dynamic programming approach
    Bao-Zhu Guo
    Tao-Tao Wu
    Journal of Global Optimization, 2010, 46 : 395 - 422
  • [9] Approximation of optimal feedback control: a dynamic programming approach
    Guo, Bao-Zhu
    Wu, Tao-Tao
    JOURNAL OF GLOBAL OPTIMIZATION, 2010, 46 (03) : 395 - 422
  • [10] A Dynamic Programming Approach for Approximate Optimal Control for Cancer Therapy
    Nowakowski, A.
    Popa, A.
    JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2013, 156 (02) : 365 - 379