Optimal consensus control for multi-agent systems: Multi-step policy gradient adaptive dynamic programming method

被引:4
|
作者
Ji, Lianghao [1 ,3 ]
Jian, Kai [1 ]
Zhang, Cuijuan [1 ]
Yang, Shasha [1 ]
Guo, Xing [1 ]
Li, Huaqing [2 ]
机构
[1] Chongqing Univ Posts & Telecommun, Chongqing Key Lab Image Cognit, Chongqing, Peoples R China
[2] Southwest Univ, Coll Elect & Informat Engn, Chongqing, Peoples R China
[3] Chongqing Univ Posts & Telecommun, Chongqing Key Lab Image Cognit, Chongqing 400065, Peoples R China
来源
IET CONTROL THEORY AND APPLICATIONS | 2023年 / 17卷 / 11期
基金
中国国家自然科学基金;
关键词
complex networks; dynamic programming; intelligent control; multi-agent systems; optimal control; OPTIMAL TRACKING CONTROL; ALGORITHM; FRAMEWORK;
D O I
10.1049/cth2.12473
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a novel adaptive dynamic programming (ADP) method to solve the optimal consensus problem for a class of discrete-time multi-agent systems with completely unknown dynamics. Different from the classical RL-based optimal control algorithms based on one-step temporal difference method, a multi-step-based (also call n-step) policy gradient ADP (MS-PGADP) algorithm, which have been proved to be more efficient owing to its faster propagation of the reward, is proposed to obtain the iterative control policies. Moreover, a novel Q-function is defined, which estimates the performance of performing an action in the current state. Then, through the Lyapunov stability theorem and functional analysis, the proof of optimality of the performance index function is given and the stability of the error system is also proved. Furthermore, the actor-critic neural networks are used to implement the proposed method. Inspired by deep Q network, the target network is also introduced to guarantee the stability of NNs in the process of training. Finally, two simulations are conducted to verify the effectiveness of the proposed algorithm.
引用
收藏
页码:1443 / 1457
页数:15
相关论文
共 50 条
  • [1] Pinning consensus control for switched multi-agent systems: A switched adaptive dynamic programming method
    Qi, Yiwen
    Geng, Honglin
    NONLINEAR ANALYSIS-HYBRID SYSTEMS, 2023, 48
  • [2] Pinning consensus control for switched multi-agent systems: A switched adaptive dynamic programming method
    Qi, Yiwen
    Geng, Honglin
    NONLINEAR ANALYSIS-HYBRID SYSTEMS, 2023, 48
  • [3] Optimal Consensus Control of Unknown Nonlinear Multi-Agent Systems Using Adaptive Dynamic Programming via MRAC
    Fu, Hao
    Chen, Xin
    Wang, Wei
    Li, Jinbin
    Zhang, Yaodong
    2018 37TH CHINESE CONTROL CONFERENCE (CCC), 2018, : 7141 - 7146
  • [4] Optimal containment control for multi-agent systems using fast adaptive dynamic programming
    Cao, Ao
    Wang, Fuyong
    Liu, Zhongxin
    Chen, Zengqiang
    ASIAN JOURNAL OF CONTROL, 2024,
  • [5] Dynamic Event-Triggered Consensus Control for Multi-Agent Systems Using Adaptive Dynamic Programming
    Zhang, Qi
    Yang, Yang
    Xie, Xiaoran
    Xu, Chunming
    Yang, Han
    IEEE ACCESS, 2022, 10 : 110285 - 110293
  • [6] Optimal Dynamic Average Consensus of Multi-Agent Systems
    Dong, Hangning
    Li, Chaoyong
    Chen, Sai
    IFAC PAPERSONLINE, 2019, 52 (24): : 179 - 183
  • [7] Adaptive Dynamic Programing for Optimal Consensus of Multi-agent Systems Using ANFIS
    Yu, Yinsheng
    Xie, Songbo
    Zhang, Xianxia
    2022 INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND MECHATRONICS (ICARM 2022), 2022, : 539 - 544
  • [8] Distributed optimal consensus control for nonlinear multi-agent systems with input saturation based on event-triggered adaptive dynamic programming method
    Shi, Zhengqing
    Zhou, Chuan
    INTERNATIONAL JOURNAL OF CONTROL, 2022, 95 (02) : 282 - 294
  • [9] Optimal consensus control of multi-agent systems with regular/irregular optimal control method
    Zhang H.-S.
    Xu J.-J.
    Kongzhi yu Juece/Control and Decision, 2023, 38 (08): : 2203 - 2210
  • [10] Online optimal consensus control of unknown linear multi-agent systems via time-based adaptive dynamic programming
    Liu, Yifan
    Li, Tieshan
    Shan, Qihe
    Yu, Renhai
    Wu, Yue
    Chen, C. L. Philip
    NEUROCOMPUTING, 2020, 404 : 137 - 144