Online adaptive optimal control algorithm based on synchronous integral reinforcement learning with explorations

被引:7
|
作者
Guo, Lei [1 ]
Zhao, Han [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Artificial Intelligence, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Reinforcement learning; Neural networks; Adaptive control; Actor; -critic; Explorations; TIME LINEAR-SYSTEMS; NONLINEAR-SYSTEMS;
D O I
10.1016/j.neucom.2022.11.055
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this study, we present a novel algorithm, based on synchronous policy iteration, to solve the continuous-time infinite-horizon optimal control problem of input affine system dynamics. The integral reinforcement is measured as an excitation signal to estimate the solution to the Hamilton-Jacobi-Bell man equation. In addition, the proposed method is completely model-free, that is, no a priori knowledge of the system is required. Using the adaptive tuning law, the actor and critic neural networks can simultaneously approximate the optimal value function and policy. The persistence of excitation condition is required to guarantee the convergence of the two networks. Unlike in traditional policy iteration algorithms, the restriction of the initial admissible policy was eliminated using this method. The effectiveness of the proposed algorithm is verified through numerical simulations. (c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页码:250 / 261
页数:12
相关论文
共 50 条
  • [41] Adaptive Algorithm for Selecting the Optimal Trading Strategy Based on Reinforcement Learning for Managing a Hedge Fund
    Belyakov, B.
    Sizykh, D.
    IEEE ACCESS, 2024, 12 : 189047 - 189063
  • [42] An Adaptive AQM Algorithm Based on Neuron Reinforcement Learning
    Zhou, Chuan
    Di, Dongjie
    Chen, Qingwei
    Guo, Jian
    2009 IEEE INTERNATIONAL CONFERENCE ON CONTROL AND AUTOMATION, VOLS 1-3, 2009, : 1342 - 1346
  • [43] Online Adaptive Optimal Control of Discrete-time Linear Systems via Synchronous Q-learning
    Li, Xinxing
    Wang, Xueyuan
    Zha, Wenzhong
    2021 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2021, : 2024 - 2029
  • [44] Optimal Reinforcement Learning-Based Control Algorithm for a Class of Nonlinear Macroeconomic Systems
    Ding, Qing
    Jahanshahi, Hadi
    Wang, Ye
    Bekiros, Stelios
    Alassafi, Madini O.
    MATHEMATICS, 2022, 10 (03)
  • [45] Adaptive PI Controller Based on a Reinforcement Learning Algorithm for Speed Control of a DC Motor
    Alejandro-Sanjines, Ulbio
    Maisincho-Jivaja, Anthony
    Asanza, Victor
    Lorente-Leyva, Leandro L.
    Peluffo-Ordonez, Diego H.
    BIOMIMETICS, 2023, 8 (05)
  • [46] Adaptive Discretization in Online Reinforcement Learning
    Sinclair, Sean R.
    Banerjee, Siddhartha
    Yu, Christina Lee
    OPERATIONS RESEARCH, 2023, 71 (05) : 1636 - 1652
  • [47] Online optimal scheduling of a microgrid based on deep reinforcement learning
    Ji, Ying
    Wang, Jian-Hui
    Kongzhi yu Juece/Control and Decision, 2022, 37 (07): : 1675 - 1684
  • [48] Online Adaptive Decoding of Motor Imagery Based on Reinforcement Learning
    Li, Jingmeng
    Qu, Shen
    Chen, Weihai
    Chu, Junsheng
    Sun, Yu
    PROCEEDINGS OF THE 2019 14TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA 2019), 2019, : 522 - 527
  • [49] Optimal dynamic Control Allocation with guaranteed constraints and online Reinforcement Learning
    Kolaric, Patrik
    Lopez, Victor G.
    Lewis, Frank L.
    AUTOMATICA, 2020, 122
  • [50] Online Continual Safe Reinforcement Learning-based Optimal Control of Mobile Robot Formations
    Ganie, Irfan
    Jagannathan, S.
    2024 IEEE CONFERENCE ON CONTROL TECHNOLOGY AND APPLICATIONS, CCTA 2024, 2024, : 519 - 524