Optimized Leader-Follower Consensus Control Using Reinforcement Learning for a Class of Second-Order Nonlinear Multiagent Systems

被引:46
|
作者
Wen, Guoxing [1 ]
Li, Bin [2 ]
机构
[1] Binzhou Univ, Coll Sci, Binzhou 256600, Shandong, Peoples R China
[2] Qilu Univ Technol, Sch Math & Stat, Shandong Acad Sci, Jinan 250353, Shandong, Peoples R China
基金
中国国家自然科学基金;
关键词
Optimal control; Multi-agent systems; Artificial neural networks; Heuristic algorithms; Reinforcement learning; Consensus control; Topology; Double integrator dynamic; multiagent system; neural network (NN); optimal control; reinforcement learning (RL); unknown nonlinear dynamic; HJB EQUATION; NETWORKS;
D O I
10.1109/TSMC.2021.3130070
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this article, an optimized leader-follower consensus control is proposed for a class of second-order unknown nonlinear dynamical multiagent system. Different with the first-order multiagent consensus, the second-order case needs to achieve the agreement not only on position but also on velocity, therefore this optimized control is more challenging and interesting. To derive the control, reinforcement learning (RL) can be a natural consideration because it can overcome the difficulty of solving the Hamilton-Jacobi-Bellman (HJB) equation. To implement RL, it needs to iterate both adaptive critic and actor networks each other. However, if this optimized control learns RL from most existing optimal methods that derives the critic and actor adaptive laws from the negative gradient of square of the approximating function of the HJB equation, this control algorithm will be very intricate, because the HJB equation correlated to a second-order nonlinear multiagent system will become very complex due to strong state coupling and nonlinearity. In this work, since the two RL adaptive laws are derived via implementing the gradient descent method to a simple positive function, which is obtained on the basis of a partial derivative of the HJB equation, this optimized control is significantly simple. Meanwhile, it not merely can avoid the requirement of known dynamic acknowledge, but also can release the condition of persistent excitation, which is demanded in most RL optimization methods for training the adaptive parameter more sufficiently. Finally, the proposed control is demonstrated by both theory and computer simulation.
引用
收藏
页码:5546 / 5555
页数:10
相关论文
共 50 条
  • [31] Leader-Follower finite-time consensus of multiagent systems with nonlinear dynamics by intermittent protocol
    He, Shengchao
    Liu, Xiangdong
    Lu, Pingli
    Liu, Haikuo
    Du, Changkun
    Journal of the Franklin Institute, 2022, 359 (06) : 2646 - 2662
  • [32] Optimized leader-follower consensus control using combination of reinforcement learning and sliding mode mechanism for multiple robot manipulator system
    Song, Yanfen
    Li, Zijun
    Li, Bin
    Wen, Guoxing
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2024, 34 (08) : 5212 - 5228
  • [33] Composite Backstepping Consensus Algorithms of Leader-Follower Higher-Order Nonlinear Multiagent Systems Subject to Mismatched Disturbances
    Wang, Xiangyu
    Li, Shihua
    Chen, Michael Z. Q.
    IEEE TRANSACTIONS ON CYBERNETICS, 2018, 48 (06) : 1935 - 1946
  • [34] Distributed Control of Leader-follower Systems under Adversarial Inputs Using Reinforcement Learning
    Moghadam, Rohollah
    Wei, Qinglai
    Modares, Hamidreza
    2017 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2017, : 1885 - 1892
  • [35] Stochastic Consensus Control of Second-Order Nonlinear Multiagent Systems With External Disturbances
    Ji, Huihui
    Zhang, Hai-Tao
    Ye, Zhiyong
    Zhang, He
    Xu, Bowen
    Chen, Guanrong
    IEEE TRANSACTIONS ON CONTROL OF NETWORK SYSTEMS, 2018, 5 (04): : 1585 - 1596
  • [36] Pinning Control of Lag-Consensus for Second-Order Nonlinear Multiagent Systems
    Wang, Yi
    Ma, Zhongjun
    Zheng, Song
    Chen, Guanrong
    IEEE TRANSACTIONS ON CYBERNETICS, 2017, 47 (08) : 2203 - 2211
  • [37] Leader-Follower Consensus of Linear Multiagent Systems With Magnitude and Rate Saturation and an Active Leader
    Li, Pengyuan
    Jabbari, Faryar
    Sun, Xi-Ming
    IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2023, 68 (09) : 5584 - 5591
  • [38] Second Order Consensus for Leader-follower Multi-agent Systems with Prescribed Performance
    Chen, Fei
    Dimarogonas, Dimos, V
    IFAC PAPERSONLINE, 2019, 52 (20): : 103 - 108
  • [39] Second-Order Consensus in Multiagent Systems via Nonlinear Protocol
    Pan, Huan
    Nian, Xiaohong
    Guo, Ling
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2013, 2013
  • [40] Leader-follower consensus of multiagent systems via reset observer-based control approach
    Zhong, Guang-Xin
    Xiao, Qian-Cheng
    Li, Jian-Ning
    Li, Jian
    Long, Yue
    Zhao, Xiao-Qi
    JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2024, 361 (03): : 1555 - 1565