Optimal Consensus Control for Continuous-time Multi-agent Systems via Actor-Critic Neural Networks

被引：0

作者：

Jia, Xiao ^{[1
]}

Wolter, Katinka ^{[1
]}

机构：

[1] Free Univ Berlin, Dept Math & Comp Sci, Berlin, Germany

来源：

2022 8TH INTERNATIONAL CONFERENCE ON AUTOMATION, ROBOTICS AND APPLICATIONS (ICARA 2022) | 2022年

关键词：

optimal consensus control; reinforcement learning; policy iteration; actor-critic neural network; SYNCHRONIZATION; TRACKING;

D O I：

10.1109/ICARA55094.2022.9738588

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper investigates the optimal consensus control problem for continuous-time multi-agent systems with switching topology by utilizing the framework of reinforcement learning. A leader-follower continuous-time high-order multi-agent system is formulated and the corresponding Hamilton-Jacobi-Bellman equation is presented. To calculate the performance index and the optimal consensus control law, a policy iteration (PI) algorithm is proposed and the convergence analysis of multi-agent systems for the algorithm is derived. Furthermore, an actor-critic neural network is applied for the PI algorithm, which does not require the knowledge of multi-agent system dynamics. A simulation example shows the effectiveness of the proposed optimal consensus control scheme.

引用

页码：191 / 195

页数：5

共 50 条

[21] Event-triggered consensus control of continuous-time stochastic multi-agent systems
Cao, Xiangyang
Zhang, Chenghui
Zhao, Daduan
Sun, Bo
Li, Yan
AUTOMATICA, 2022, 137
[22] Localized data-driven consensus control for continuous-time multi-agent systems
Chang, Zeze
Li, Zhongkui
INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2024,
[23] A DDPG-based solution for optimal consensus of continuous-time linear multi-agent systems
Li, Ye
Liu, ZhongXin
Lan, Ge
Sader, Malika
Chen, ZengQiang
SCIENCE CHINA-TECHNOLOGICAL SCIENCES, 2023, 66 (08) : 2441 - 2453
[24] A DDPG-based solution for optimal consensus of continuous-time linear multi-agent systems
Ye Li
ZhongXin Liu
Ge Lan
Malika Sader
ZengQiang Chen
Science China Technological Sciences, 2023, 66 : 2441 - 2453
[25] A DDPG-based solution for optimal consensus of continuous-time linear multi-agent systems
LI Ye
LIU ZhongXin
LAN Ge
SADER Malika
CHEN ZengQiang
Science China(Technological Sciences), 2023, (08) : 2441 - 2453
[26] A DDPG-based solution for optimal consensus of continuous-time linear multi-agent systems
LI Ye
LIU ZhongXin
LAN Ge
SADER Malika
CHEN ZengQiang
Science China(Technological Sciences), 2023, 66 (08) : 2441 - 2453
[27] Consensus design for continuous-time multi-agent systems with communication delay
Zhenhua Wang
Keyou You
Juanjuan Xu
Huanshui Zhang
Journal of Systems Science and Complexity, 2014, 27 : 701 - 711
[28] Actor-Critic Algorithms for Constrained Multi-agent Reinforcement Learning
Diddigi, Raghuram Bharadwaj
Reddy, D. Sai Koti
Prabuchandran, K. J.
Bhatnagar, Shalabh
AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 1931 - 1933
[29] Multi-Agent Natural Actor-Critic Reinforcement Learning Algorithms
Prashant Trivedi
Nandyala Hemachandra
Dynamic Games and Applications, 2023, 13 : 25 - 55
[30] A New Advantage Actor-Critic Algorithm For Multi-Agent Environments
Paczolay, Gabor
Harmati, Istvan
2020 23RD IEEE INTERNATIONAL SYMPOSIUM ON MEASUREMENT AND CONTROL IN ROBOTICS (ISMCR), 2020,

← 1 2 3 4 5 →