Optimal consensus control for multi-agent systems: Multi-step policy gradient adaptive dynamic programming method

被引:4
|
作者
Ji, Lianghao [1 ,3 ]
Jian, Kai [1 ]
Zhang, Cuijuan [1 ]
Yang, Shasha [1 ]
Guo, Xing [1 ]
Li, Huaqing [2 ]
机构
[1] Chongqing Univ Posts & Telecommun, Chongqing Key Lab Image Cognit, Chongqing, Peoples R China
[2] Southwest Univ, Coll Elect & Informat Engn, Chongqing, Peoples R China
[3] Chongqing Univ Posts & Telecommun, Chongqing Key Lab Image Cognit, Chongqing 400065, Peoples R China
来源
IET CONTROL THEORY AND APPLICATIONS | 2023年 / 17卷 / 11期
基金
中国国家自然科学基金;
关键词
complex networks; dynamic programming; intelligent control; multi-agent systems; optimal control; OPTIMAL TRACKING CONTROL; ALGORITHM; FRAMEWORK;
D O I
10.1049/cth2.12473
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a novel adaptive dynamic programming (ADP) method to solve the optimal consensus problem for a class of discrete-time multi-agent systems with completely unknown dynamics. Different from the classical RL-based optimal control algorithms based on one-step temporal difference method, a multi-step-based (also call n-step) policy gradient ADP (MS-PGADP) algorithm, which have been proved to be more efficient owing to its faster propagation of the reward, is proposed to obtain the iterative control policies. Moreover, a novel Q-function is defined, which estimates the performance of performing an action in the current state. Then, through the Lyapunov stability theorem and functional analysis, the proof of optimality of the performance index function is given and the stability of the error system is also proved. Furthermore, the actor-critic neural networks are used to implement the proposed method. Inspired by deep Q network, the target network is also introduced to guarantee the stability of NNs in the process of training. Finally, two simulations are conducted to verify the effectiveness of the proposed algorithm.
引用
收藏
页码:1443 / 1457
页数:15
相关论文
共 50 条
  • [31] Adaptive consensus of multi-agent systems via odd impulsive control
    Ma, Tiedong
    Zhang, Zhengle
    Cui, Bing
    NEUROCOMPUTING, 2018, 321 : 139 - 145
  • [32] Adaptive H∞ Consensus Control of Multi-Agent Systems with Time Delays
    Miyasato, Yoshihiko
    2015 54TH ANNUAL CONFERENCE OF THE SOCIETY OF INSTRUMENT AND CONTROL ENGINEERS OF JAPAN (SICE), 2015, : 572 - 577
  • [33] Optimized adaptive consensus control for multi-agent systems with prescribed performance
    Yan, Lei
    Liu, Zhi
    Chen, C. L. Philip
    Zhang, Yun
    Wu, Zongze
    INFORMATION SCIENCES, 2022, 613 : 649 - 666
  • [34] Adaptive H∞ consensus control of multi-agent systems on directed graph
    20161402196846
    (1) Department of Mathematical Analysis and Statistical Inference, Institute of Statistical Mathematics, Tachikawa, Tokyo; 190-8562, Japan, 1600, Cybernet Systems; et al.; Kozo Keikaku Engineering (KKE); MathWorks; Mitsubishi Electric; Springer (Institute of Electrical and Electronics Engineers Inc.):
  • [35] γ-adaptive consensus control for multi-agent systems with adjustable convergence speed
    Shi, Guanghui
    Xi, Jianxiang
    Fan, Zhiliang
    Zheng, Tang
    PROCEEDINGS OF THE 36TH CHINESE CONTROL CONFERENCE (CCC 2017), 2017, : 8559 - 8564
  • [36] Optimized adaptive consensus control for multi-agent systems with prescribed performance
    Yan, Lei
    Liu, Zhi
    Philip Chen, C.L.
    Zhang, Yun
    Wu, Zongze
    Information Sciences, 2022, 613 : 649 - 666
  • [37] Adaptive impulsive consensus of multi-agent systems with control gain error
    Zhang, Liuyang
    Li, Teng
    Huang, Tao
    Huang, Junhao
    Ma, Tiedong
    2017 29TH CHINESE CONTROL AND DECISION CONFERENCE (CCDC), 2017, : 4610 - 4615
  • [38] Value Iteration Algorithm for Optimal Consensus Control of Multi-agent Systems
    Zhang, Qichao
    Zhao, Dongbin
    NEURAL INFORMATION PROCESSING (ICONIP 2018), PT VII, 2018, 11307 : 200 - 208
  • [39] Bipartite consensus of descriptor multi-agent systems via adaptive control
    Shi Weimin
    Cui Yulong
    Chen Wenhai
    Gao Lixin
    PROCEEDINGS OF THE 36TH CHINESE CONTROL CONFERENCE (CCC 2017), 2017, : 8503 - 8508
  • [40] Adaptive Iterative Learning Control for Multi-Agent Systems Consensus Tracking
    Yang, Shiping
    Xu, Jian-Xin
    PROCEEDINGS 2012 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2012, : 2803 - 2808