Constrained predictive control for consensus of nonlinear multi-agent systems by using game Q-learning

被引:0
|
作者
Wang, Yan [1 ]
Xue, Huiwen [1 ]
Wen, Jiwei [1 ]
Liu, Jinfeng [2 ]
Luan, Xiaoli [1 ]
机构
[1] Jiangnan Univ, Sch Internet Things Engn, Key Lab Adv Proc Control Light Ind, Minist Educ, Wuxi 214122, Peoples R China
[2] Univ Alberta, Dept Chem & Mat Engn, Edmonton, AB T6G 2R3, Canada
基金
中国国家自然科学基金;
关键词
Multi-agent systems; Joint constraints; Learning predictive control; Barrier function;
D O I
10.1007/s11071-024-10698-5
中图分类号
TH [机械、仪表工业];
学科分类号
0802 ;
摘要
This paper develops constrained learning predictive control for achieving consensus in nonlinear multi-agent systems. First, a general predictive and learning framework is constructed for the optimization of control policies by employing an Identifier-Actor-Critic network. Specifically, the Identifier neural network is utilized to approximately characterize the dynamics of the nonlinear system and generate predictive data for available datasets. Each time point within the predictive horizon, regarded as a participant in a non-zero-sum game (NZSG), executes distributed policy and is fed into the Actor-Critic network. When the constrained control policies at all time points reach optimality via the policy gradient algorithm (PGA), the NZSG achieves Nash equilibrium. Subsequently, a gradient recentered self-concordant barrier function is employed to address the joint constraints on tracking error and control input. Moreover, by introducing incremental adjustments, the learning rate factors within the PGA are optimized to enhance the learning efficiency of the Actor-Critic network. Finally, simulation results demonstrate the effectiveness and the rapidity of achieving consensus of the learning predictive control approach compared to the general predictive control methodology.
引用
收藏
页码:11683 / 11700
页数:18
相关论文
共 50 条
  • [21] Robust H∞ Output Consensus in Heterogeneous Multi-agent Discrete-Time Systems Using Q-Learning Algorithm
    Valadbeigi, Amir Parviz
    Soltanian, Farzad
    Shasadeghi, Mokhtar
    IRANIAN JOURNAL OF SCIENCE AND TECHNOLOGY-TRANSACTIONS OF ELECTRICAL ENGINEERING, 2025,
  • [22] Continuous Q-Learning for Multi-Agent Cooperation
    Hwang, Kao-Shing
    Jiang, Wei-Cheng
    Lin, Yu-Hong
    Lai, Li-Hsin
    CYBERNETICS AND SYSTEMS, 2012, 43 (03) : 227 - 256
  • [23] Untangling Braids with Multi-Agent Q-Learning
    Khan, Abdullah
    Vernitski, Alexei
    Lisitsa, Alexei
    2021 23RD INTERNATIONAL SYMPOSIUM ON SYMBOLIC AND NUMERIC ALGORITHMS FOR SCIENTIFIC COMPUTING (SYNASC 2021), 2021, : 135 - 139
  • [24] Q-learning with FCMAC in multi-agent cooperation
    Hwang, Kao-Shing
    Chen, Yu-Jen
    Lin, Tzung-Feng
    ADVANCES IN NEURAL NETWORKS - ISNN 2006, PT 1, 2006, 3971 : 599 - 606
  • [25] An iterative Q-learning based global consensus of discrete-time saturated multi-agent systems
    Long, Mingkang
    Su, Housheng
    Wang, Xiaoling
    Jiang, Guo-Ping
    Wang, Xiaofan
    CHAOS, 2019, 29 (10)
  • [26] Trajectory Optimization for Nonlinear Multi-Agent Systems using Decentralized Learning Model Predictive Control
    Zhu, Edward L.
    Sturz, Yvonne R.
    Rosolia, Ugo
    Borrelli, Francesco
    2020 59TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2020, : 6198 - 6203
  • [27] Adaptive consensus control for output-constrained nonlinear multi-agent systems with actuator faults
    Sun, Yuan
    Shi, Peng
    Lim, Cheng-Chew
    JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2022, 359 (09): : 4216 - 4232
  • [28] Optimistic-Pessimistic Q-Learning Algorithm for Multi-Agent Systems
    Akchurina, Natalia
    MULTIAGENT SYSTEM TECHNOLOGIES, PROCEEDINGS, 2008, 5244 : 13 - 24
  • [29] A novel multi-agent Q-learning algorithm in cooperative multi-agent system
    Ou, HT
    Zhang, WD
    Zhang, WY
    Xu, XM
    PROCEEDINGS OF THE 3RD WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-5, 2000, : 272 - 276
  • [30] On consensus performance of nonlinear multi-agent systems with hybrid control
    Hu, Bin
    Guan, Zhi-Hong
    Jiang, Xiao-Wei
    Chi, Ming
    Yu, Li
    JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2016, 353 (13): : 3133 - 3150