Solving optimal predictor-feedback control using approximate dynamic programming

被引:1
|
作者
Wang, Hongxia [1 ]
Zhao, Fuyu [1 ]
Zhang, Zhaorong [2 ]
Xu, Juanjuan [3 ]
Li, Xun [4 ]
机构
[1] Shandong Univ Sci & Technol, Sch Elect Engn & Automat, Qingdao, Peoples R China
[2] Shandong Univ, Sch Comp Sci & Technol, Qingdao, Peoples R China
[3] Shandong Univ, Sch Control Sci & Engn, Jinan, Peoples R China
[4] Hong Kong Polytech Univ, Dept Appl Math, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Stochastic system; Optimal control; Input delay; Approximate dynamic programming; ADAPTIVE OPTIMAL-CONTROL; DISCRETE-TIME-SYSTEMS; LINEAR-SYSTEMS; LEARNING ALGORITHM; DELAYS; STATE;
D O I
10.1016/j.automatica.2024.111848
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper is concerned with approximately solving the optimal predictor-feedback control problem of multiplicative-noise systems with input delay in infinite horizon. The optimal predictor-feedback control, provided by the analytical method, is determined by Riccati-ZXL equations and is hard to obtain in the case of unknown system dynamics. We aim to propose a policy iteration (PI) algorithm for solving the optimal solution by approximate dynamic programming. For convergence analysis of the algorithm, we first develop a necessary and sufficient stabilizing condition, in the form of several new Lyapunov-type equations, which parameterizes all predictor-feedback controllers and can be seen as an important addition to Lyapunov stability theory. We then propose an iterative scheme for the Riccati-ZXL equations computations, along with convergence analysis, based on the condition. Inspired by this scheme, a data-driven online PI algorithm, convergence implied in that of the iterative scheme, is proposed for the optimal predictor-feedback control problem without full system dynamics. Finally, a numerical example is used to evaluate the proposed PI algorithm. (c) 2024 Published by Elsevier Ltd.
引用
收藏
页数:8
相关论文
共 50 条
  • [41] Robust Adaptive Optimal Tracking Switching Control of Nonlinear Systems Using Embedding Transformation and Approximate Dynamic Programming
    Wang, Weizhe
    Fu, Yue
    INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 2025,
  • [42] Discrete-Time Optimal Control of State-Constrained Nonlinear Systems Using Approximate Dynamic Programming
    Song, Shijie
    Gong, Dawei
    Zhu, Minglei
    Zhao, Yuyang
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2025, 35 (03) : 858 - 871
  • [43] Predictor-Feedback Control of a Model of Microfluidic Process With Hydraulic Input-Dependent Input Delay
    Bekiaris-Liberis, Nikolaos
    Bresch-Pietri, Delphine
    Petit, Nicolas
    2023 EUROPEAN CONTROL CONFERENCE, ECC, 2023,
  • [44] Approximate Dynamic Programming, Local or Global Optimal Solution?
    Heydari, Ali
    Balakrishnan, S. N.
    2014 AMERICAN CONTROL CONFERENCE (ACC), 2014, : 1237 - 1242
  • [45] ε-Optimal Value and Approximate Multidimensional Dual Dynamic Programming
    Nowakowski, A.
    ASIAN JOURNAL OF CONTROL, 2013, 15 (02) : 444 - 452
  • [46] Min-max control using parametric approximate dynamic programming
    Nosair, Hussam
    Yang, Yu
    Lee, Jong Min
    CONTROL ENGINEERING PRACTICE, 2010, 18 (02) : 190 - 197
  • [47] Production control in a complex production system using approximate dynamic programming
    Wu, Han
    Evans, Gerald
    Bae, Ki-Hwan
    INTERNATIONAL JOURNAL OF PRODUCTION RESEARCH, 2016, 54 (08) : 2419 - 2432
  • [48] On-policy Approximate Dynamic Programming for Optimal Control of non-linear systems
    Shalini, K.
    Vrushabh, D.
    Sonam, K.
    2020 7TH INTERNATIONAL CONFERENCE ON CONTROL, DECISION AND INFORMATION TECHNOLOGIES (CODIT'20), VOL 1, 2020, : 1058 - 1062
  • [49] An approximate dynamic programming method for the optimal control of Alkai-Surfactant-Polymer flooding
    Ge, Yulei
    Li, Shurong
    Chan, Peng
    JOURNAL OF PROCESS CONTROL, 2018, 64 : 15 - 26
  • [50] Automata Theory Meets Approximate Dynamic Programming: Optimal Control with Temporal Logic Constraints
    Papusha, Ivan
    Fu, Jie
    Topcu, Ufuk
    Murray, Richard M.
    2016 IEEE 55TH CONFERENCE ON DECISION AND CONTROL (CDC), 2016, : 434 - 440