Optimal Consensus Control for Continuous-time Multi-agent Systems via Actor-Critic Neural Networks

被引:0
|
作者
Jia, Xiao [1 ]
Wolter, Katinka [1 ]
机构
[1] Free Univ Berlin, Dept Math & Comp Sci, Berlin, Germany
来源
2022 8TH INTERNATIONAL CONFERENCE ON AUTOMATION, ROBOTICS AND APPLICATIONS (ICARA 2022) | 2022年
关键词
optimal consensus control; reinforcement learning; policy iteration; actor-critic neural network; SYNCHRONIZATION; TRACKING;
D O I
10.1109/ICARA55094.2022.9738588
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper investigates the optimal consensus control problem for continuous-time multi-agent systems with switching topology by utilizing the framework of reinforcement learning. A leader-follower continuous-time high-order multi-agent system is formulated and the corresponding Hamilton-Jacobi-Bellman equation is presented. To calculate the performance index and the optimal consensus control law, a policy iteration (PI) algorithm is proposed and the convergence analysis of multi-agent systems for the algorithm is derived. Furthermore, an actor-critic neural network is applied for the PI algorithm, which does not require the knowledge of multi-agent system dynamics. A simulation example shows the effectiveness of the proposed optimal consensus control scheme.
引用
收藏
页码:191 / 195
页数:5
相关论文
共 50 条
  • [21] Event-triggered consensus control of continuous-time stochastic multi-agent systems
    Cao, Xiangyang
    Zhang, Chenghui
    Zhao, Daduan
    Sun, Bo
    Li, Yan
    AUTOMATICA, 2022, 137
  • [22] Localized data-driven consensus control for continuous-time multi-agent systems
    Chang, Zeze
    Li, Zhongkui
    INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2024,
  • [23] A DDPG-based solution for optimal consensus of continuous-time linear multi-agent systems
    Li, Ye
    Liu, ZhongXin
    Lan, Ge
    Sader, Malika
    Chen, ZengQiang
    SCIENCE CHINA-TECHNOLOGICAL SCIENCES, 2023, 66 (08) : 2441 - 2453
  • [24] A DDPG-based solution for optimal consensus of continuous-time linear multi-agent systems
    Ye Li
    ZhongXin Liu
    Ge Lan
    Malika Sader
    ZengQiang Chen
    Science China Technological Sciences, 2023, 66 : 2441 - 2453
  • [25] A DDPG-based solution for optimal consensus of continuous-time linear multi-agent systems
    LI Ye
    LIU ZhongXin
    LAN Ge
    SADER Malika
    CHEN ZengQiang
    Science China(Technological Sciences), 2023, (08) : 2441 - 2453
  • [26] A DDPG-based solution for optimal consensus of continuous-time linear multi-agent systems
    LI Ye
    LIU ZhongXin
    LAN Ge
    SADER Malika
    CHEN ZengQiang
    Science China(Technological Sciences), 2023, 66 (08) : 2441 - 2453
  • [27] Consensus design for continuous-time multi-agent systems with communication delay
    Zhenhua Wang
    Keyou You
    Juanjuan Xu
    Huanshui Zhang
    Journal of Systems Science and Complexity, 2014, 27 : 701 - 711
  • [28] Actor-Critic Algorithms for Constrained Multi-agent Reinforcement Learning
    Diddigi, Raghuram Bharadwaj
    Reddy, D. Sai Koti
    Prabuchandran, K. J.
    Bhatnagar, Shalabh
    AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 1931 - 1933
  • [29] Multi-Agent Natural Actor-Critic Reinforcement Learning Algorithms
    Prashant Trivedi
    Nandyala Hemachandra
    Dynamic Games and Applications, 2023, 13 : 25 - 55
  • [30] A New Advantage Actor-Critic Algorithm For Multi-Agent Environments
    Paczolay, Gabor
    Harmati, Istvan
    2020 23RD IEEE INTERNATIONAL SYMPOSIUM ON MEASUREMENT AND CONTROL IN ROBOTICS (ISMCR), 2020,