Online adaptive optimal control algorithm based on synchronous integral reinforcement learning with explorations

被引:7
|
作者
Guo, Lei [1 ]
Zhao, Han [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Artificial Intelligence, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
Reinforcement learning; Neural networks; Adaptive control; Actor; -critic; Explorations; TIME LINEAR-SYSTEMS; NONLINEAR-SYSTEMS;
D O I
10.1016/j.neucom.2022.11.055
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this study, we present a novel algorithm, based on synchronous policy iteration, to solve the continuous-time infinite-horizon optimal control problem of input affine system dynamics. The integral reinforcement is measured as an excitation signal to estimate the solution to the Hamilton-Jacobi-Bell man equation. In addition, the proposed method is completely model-free, that is, no a priori knowledge of the system is required. Using the adaptive tuning law, the actor and critic neural networks can simultaneously approximate the optimal value function and policy. The persistence of excitation condition is required to guarantee the convergence of the two networks. Unlike in traditional policy iteration algorithms, the restriction of the initial admissible policy was eliminated using this method. The effectiveness of the proposed algorithm is verified through numerical simulations. (c) 2022 Elsevier B.V. All rights reserved.
引用
收藏
页码:250 / 261
页数:12
相关论文
共 50 条
  • [21] Policy iteration-based integral reinforcement learning for online adaptive trajectory tracking of mobile robot
    Ashida T.
    Ichihara H.
    SICE Journal of Control, Measurement, and System Integration, 2021, 14 (01) : 233 - 241
  • [22] Adaptive Load Frequency Control of Wind Power System Based on Online Reinforcement Learning
    Yang L.
    Sun Y.
    Xu J.
    Liao S.
    Peng L.
    Dianli Xitong Zidonghua/Automation of Electric Power Systems, 2020, 44 (12): : 74 - 83
  • [23] V-learning: An adaptive reinforcement learning algorithm for the optimal stopping problem
    Li, Xiying
    Lee, Chi-Guhn
    EXPERT SYSTEMS WITH APPLICATIONS, 2023, 231
  • [24] Distributed adaptive cooperative optimal output regulation via integral reinforcement learning
    Lin, Liquan
    Huang, Jie
    AUTOMATICA, 2024, 170
  • [25] Online Reinforcement Learning Control Algorithm for Concentration of Thickener Underflow
    Yuan Z.-L.
    He R.-Z.
    Yao C.
    Li J.
    Ban X.-J.
    Zidonghua Xuebao/Acta Automatica Sinica, 2021, 47 (07): : 1558 - 1571
  • [26] Constrained adaptive optimal control using a reinforcement learning agent
    Lin, Wei-Song
    Zheng, Chen-Hong
    AUTOMATICA, 2012, 48 (10) : 2614 - 2619
  • [27] Reinforcement learning and optimal adaptive control: An overview and implementation examples
    Khan, Said G.
    Herrmann, Guido
    Lewis, Frank L.
    Pipe, Tony
    Melhuish, Chris
    ANNUAL REVIEWS IN CONTROL, 2012, 36 (01) : 42 - 59
  • [28] Adaptive linearization control based on reinforcement learning
    Hwang, KS
    Chiou, JY
    2002 IEEE REGION 10 CONFERENCE ON COMPUTERS, COMMUNICATIONS, CONTROL AND POWER ENGINEERING, VOLS I-III, PROCEEDINGS, 2002, : 1483 - 1486
  • [29] Space manipulator optimal impedance control using integral reinforcement learning
    Wu, Han
    Hu, Qinglei
    Shi, Yongxia
    Zheng, Jianying
    Sun, Kaipeng
    Wang, Jiawen
    AEROSPACE SCIENCE AND TECHNOLOGY, 2023, 139
  • [30] DC Motor Control based on Integral Reinforcement Learning
    Bujgoi, Gheorghe
    Sendrescu, Dorin
    2022 23RD INTERNATIONAL CARPATHIAN CONTROL CONFERENCE (ICCC), 2022, : 282 - 286