Optimal consensus control for multi-agent systems: Multi-step policy gradient adaptive dynamic programming method

被引:4
|
作者
Ji, Lianghao [1 ,3 ]
Jian, Kai [1 ]
Zhang, Cuijuan [1 ]
Yang, Shasha [1 ]
Guo, Xing [1 ]
Li, Huaqing [2 ]
机构
[1] Chongqing Univ Posts & Telecommun, Chongqing Key Lab Image Cognit, Chongqing, Peoples R China
[2] Southwest Univ, Coll Elect & Informat Engn, Chongqing, Peoples R China
[3] Chongqing Univ Posts & Telecommun, Chongqing Key Lab Image Cognit, Chongqing 400065, Peoples R China
来源
IET CONTROL THEORY AND APPLICATIONS | 2023年 / 17卷 / 11期
基金
中国国家自然科学基金;
关键词
complex networks; dynamic programming; intelligent control; multi-agent systems; optimal control; OPTIMAL TRACKING CONTROL; ALGORITHM; FRAMEWORK;
D O I
10.1049/cth2.12473
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a novel adaptive dynamic programming (ADP) method to solve the optimal consensus problem for a class of discrete-time multi-agent systems with completely unknown dynamics. Different from the classical RL-based optimal control algorithms based on one-step temporal difference method, a multi-step-based (also call n-step) policy gradient ADP (MS-PGADP) algorithm, which have been proved to be more efficient owing to its faster propagation of the reward, is proposed to obtain the iterative control policies. Moreover, a novel Q-function is defined, which estimates the performance of performing an action in the current state. Then, through the Lyapunov stability theorem and functional analysis, the proof of optimality of the performance index function is given and the stability of the error system is also proved. Furthermore, the actor-critic neural networks are used to implement the proposed method. Inspired by deep Q network, the target network is also introduced to guarantee the stability of NNs in the process of training. Finally, two simulations are conducted to verify the effectiveness of the proposed algorithm.
引用
收藏
页码:1443 / 1457
页数:15
相关论文
共 50 条
  • [21] Multi-step heuristic dynamic programming for optimal control of nonlinear discrete-time systems
    Luo, Biao
    Liu, Derong
    Huang, Tingwen
    Yang, Xiong
    Ma, Hongwen
    INFORMATION SCIENCES, 2017, 411 : 66 - 83
  • [22] Adaptive consensus estimation of multi-agent systems
    Demetriou, Michael A.
    Nestinger, Stephen S.
    2011 50TH IEEE CONFERENCE ON DECISION AND CONTROL AND EUROPEAN CONTROL CONFERENCE (CDC-ECC), 2011, : 354 - 359
  • [23] A Survey on Optimal Consensus of Multi-agent Systems
    Sun, Hui
    Liu, Yungang
    Li, Fengzhong
    Niu, Xinglong
    2017 CHINESE AUTOMATION CONGRESS (CAC), 2017, : 4978 - 4983
  • [24] Randomized optimal consensus of multi-agent systems
    Shi, Guodong
    Johansson, Karl Henrik
    AUTOMATICA, 2012, 48 (12) : 3018 - 3030
  • [25] Distributed consensus of linear multi-agent systems with adaptive dynamic protocols
    Li, Zhongkui
    Ren, Wei
    Liu, Xiangdong
    Xie, Lihua
    AUTOMATICA, 2013, 49 (07) : 1986 - 1995
  • [26] Fuzzy adaptive dynamic programming-based optimal leader-following consensus for heterogeneous nonlinear multi-agent systems
    Yuliang Cai
    Huaguang Zhang
    Kun Zhang
    Chong Liu
    Neural Computing and Applications, 2020, 32 : 8763 - 8781
  • [27] Fuzzy adaptive dynamic programming-based optimal leader-following consensus for heterogeneous nonlinear multi-agent systems
    Cai, Yuliang
    Zhang, Huaguang
    Zhang, Kun
    Liu, Chong
    NEURAL COMPUTING & APPLICATIONS, 2020, 32 (13): : 8763 - 8781
  • [28] Dynamic consensus of linear multi-agent systems
    Li, Z.
    Duan, Z.
    Chen, G.
    IET CONTROL THEORY AND APPLICATIONS, 2011, 5 (01): : 19 - 28
  • [29] Consensus of multi-agent linear dynamic systems
    Wang, Jinhuan
    Cheng, Daizhan
    Hu, Xiaoming
    ASIAN JOURNAL OF CONTROL, 2008, 10 (02) : 144 - 155
  • [30] Adaptive H∞ Consensus Control of Multi-Agent Systems on Directed Graph
    Miyasato, Yoshihiko
    2015 54TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2015, : 7592 - 7597