Optimized Leader-Follower Consensus Control Using Reinforcement Learning for a Class of Second-Order Nonlinear Multiagent Systems

被引：46

作者：

Wen, Guoxing ^{[1
]}

Li, Bin ^{[2
]}

机构：

[1] Binzhou Univ, Coll Sci, Binzhou 256600, Shandong, Peoples R China

[2] Qilu Univ Technol, Sch Math & Stat, Shandong Acad Sci, Jinan 250353, Shandong, Peoples R China

来源：

IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS | 2022年 / 52卷 / 09期

基金：

中国国家自然科学基金;

关键词：

Optimal control; Multi-agent systems; Artificial neural networks; Heuristic algorithms; Reinforcement learning; Consensus control; Topology; Double integrator dynamic; multiagent system; neural network (NN); optimal control; reinforcement learning (RL); unknown nonlinear dynamic; HJB EQUATION; NETWORKS;

D O I：

10.1109/TSMC.2021.3130070

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this article, an optimized leader-follower consensus control is proposed for a class of second-order unknown nonlinear dynamical multiagent system. Different with the first-order multiagent consensus, the second-order case needs to achieve the agreement not only on position but also on velocity, therefore this optimized control is more challenging and interesting. To derive the control, reinforcement learning (RL) can be a natural consideration because it can overcome the difficulty of solving the Hamilton-Jacobi-Bellman (HJB) equation. To implement RL, it needs to iterate both adaptive critic and actor networks each other. However, if this optimized control learns RL from most existing optimal methods that derives the critic and actor adaptive laws from the negative gradient of square of the approximating function of the HJB equation, this control algorithm will be very intricate, because the HJB equation correlated to a second-order nonlinear multiagent system will become very complex due to strong state coupling and nonlinearity. In this work, since the two RL adaptive laws are derived via implementing the gradient descent method to a simple positive function, which is obtained on the basis of a partial derivative of the HJB equation, this optimized control is significantly simple. Meanwhile, it not merely can avoid the requirement of known dynamic acknowledge, but also can release the condition of persistent excitation, which is demanded in most RL optimization methods for training the adaptive parameter more sufficiently. Finally, the proposed control is demonstrated by both theory and computer simulation.

引用

页码：5546 / 5555

页数：10

共 50 条

[21] Adaptive Neural Network Leader-Follower Formation Control for a Class of Second-Order Nonlinear Multi-Agent Systems With Unknown Dynamics
Wen, Guoxing
Zhang, Chenyang
Hu, Ping
Cui, Yang
IEEE ACCESS, 2020, 8 (08): : 148149 - 148156
[22] Adaptive Event-Triggered Consensus Control of a Class of Second-Order Nonlinear Multiagent Systems
Yang, Yang
Li, Yanfei
Yue, Dong
Yue, Wenbin
IEEE TRANSACTIONS ON CYBERNETICS, 2020, 50 (12) : 5010 - 5020
[23] Leader-Follower Weighted Consensus of Nonlinear Fractional-Order Multiagent Systems Using Current and Time Delay State Information
Chen, Liping
Liu, Chuang
Chu, Zhaobi
Lopes, Antonio M.
Chen, Yangquan
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2024, 54 (09): : 5814 - 5823
[24] Leader-follower consensus control of Lipschitz nonlinear systems by output feedback
Isira, Ahmad Sadhiqin Mohd
Zuo, Zongyu
Ding, Zhengtao
INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2016, 47 (16) : 3772 - 3781
[25] A multiagent fuzzy policy reinforcement learning algorithm with application to leader-follower robotic systems
Yang, Erfu
Gu, Dongbing
2006 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-12, 2006, : 3197 - +
[26] LQR-Based Optimal Leader-Follower Consensus of Second-Order Multi-agent Systems
Li, Zonggang
Zhang, Tongzhou
Xie, Guangming
PROCEEDINGS OF THE 2015 CHINESE INTELLIGENT SYSTEMS CONFERENCE, VOL 2, 2016, 360 : 353 - 361
[27] Leader-follower consensus control for a class of nonlinear multi-agent systems using dynamical neural networks
Munoz, Filiberto
Valdovinos, Jose Manuel
Cervantes-Rojas, Jorge Said
Cruz, Sergio Salazar
Santana, Alejandro Morfin
NEUROCOMPUTING, 2023, 561
[28] Finite-time leader-follower consensus control of multiagent systems with mismatched disturbances
Gu, Lixue
Zhao, Zhanshan
Sun, Jie
Wang, Zhangang
ASIAN JOURNAL OF CONTROL, 2022, 24 (02) : 722 - 731
[29] Higher Order Barrier Certificates for Leader-Follower Multiagent Systems
Sharifi, Maryam
Dimarogonas, Dimos V.
IEEE TRANSACTIONS ON CONTROL OF NETWORK SYSTEMS, 2023, 10 (02): : 900 - 911
[30] Leader-Follower finite-time consensus of multiagent systems with nonlinear dynamics by intermittent protocol
He, Shengchao
Liu, Xiangdong
Lu, Pingli
Liu, Haikuo
Du, Changkun
JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2022, 359 (06): : 2646 - 2662

← 1 2 3 4 5 →