Online adaptive optimal control algorithm based on synchronous integral reinforcement learning with explorations

被引:7
|
作者
Guo, Lei [1 ]
Zhao, Han [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Artificial Intelligence, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Reinforcement learning; Neural networks; Adaptive control; Actor; -critic; Explorations; TIME LINEAR-SYSTEMS; NONLINEAR-SYSTEMS;
D O I
10.1016/j.neucom.2022.11.055
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this study, we present a novel algorithm, based on synchronous policy iteration, to solve the continuous-time infinite-horizon optimal control problem of input affine system dynamics. The integral reinforcement is measured as an excitation signal to estimate the solution to the Hamilton-Jacobi-Bell man equation. In addition, the proposed method is completely model-free, that is, no a priori knowledge of the system is required. Using the adaptive tuning law, the actor and critic neural networks can simultaneously approximate the optimal value function and policy. The persistence of excitation condition is required to guarantee the convergence of the two networks. Unlike in traditional policy iteration algorithms, the restriction of the initial admissible policy was eliminated using this method. The effectiveness of the proposed algorithm is verified through numerical simulations. (c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页码:250 / 261
页数:12
相关论文
共 50 条
  • [1] Online adaptive algorithm for optimal control with integral reinforcement learning
    Vamvoudakis, Kyriakos G.
    Vrabie, Draguna
    Lewis, Frank L.
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2014, 24 (17) : 2686 - 2710
  • [2] Optimal adaptive control of drug dosing using integral reinforcement learning
    Padmanabhan, Regina
    Meskin, Nader
    Haddad, Wassim M.
    MATHEMATICAL BIOSCIENCES, 2019, 309 : 131 - 142
  • [3] An Adaptive Online Parameter Control Algorithm for Particle Swarm Optimization Based on Reinforcement Learning
    Liu, Yaxian
    Lu, Hui
    Cheng, Shi
    Shi, Yuhui
    2019 IEEE CONGRESS ON EVOLUTIONARY COMPUTATION (CEC), 2019, : 815 - 822
  • [4] Online optimal and adaptive integral tracking control for varying discrete-time systems using reinforcement learning
    Sanusi, Ibrahim
    Mills, Andrew
    Dodd, Tony
    Konstantopoulos, George
    INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 2020, 34 (08) : 971 - 991
  • [5] A novel adaptive control algorithm based on reinforcement learning
    Qian Zheng
    Sun Liang
    Ruan Xiaogang
    Proceedings of the 24th Chinese Control Conference, Vols 1 and 2, 2005, : 651 - 654
  • [6] Finite-Horizon Optimal Control for Nonlinear Multi-Input Systems With Online Adaptive Integral Reinforcement Learning
    Lv, Yongfeng
    Zhang, Wan
    Zhao, Jun
    Zhao, Xiaowei
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2025, 22 : 802 - 812
  • [7] Online Learning Algorithm Based on Adaptive Control Theory
    Liu, Jian-Wei
    Zhou, Jia-Jia
    Kamel, Mohamed S.
    Luo, Xiong-Lin
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (06) : 2278 - 2293
  • [8] Optimal Tracking Control Based on Integral Reinforcement Learning for An Underactuated Drone
    Li, Shaobao
    Durdevic, Petar
    Yang, Zhenyu
    IFAC PAPERSONLINE, 2019, 52 (08): : 55 - 60
  • [9] Event-triggered-based online integral reinforcement learning for optimal control of unknown constrained nonlinear systems
    Han, Xiumei
    Zhao, Xudong
    Wang, Ding
    Wang, Bohui
    INTERNATIONAL JOURNAL OF CONTROL, 2024, 97 (02) : 213 - 225
  • [10] Adaptive optimal trajectory tracking control of AUVs based on reinforcement learning
    Li, Zhifu
    Wang, Ming
    Ma, Ge
    ISA TRANSACTIONS, 2023, 137 : 122 - 132