Asynchronous iterative Q-learning based tracking control for nonlinear discrete-time multi-agent systems

被引:0
|
作者
Shen, Ziwen [1 ]
Dong, Tao [1 ]
Huang, Tingwen [2 ]
机构
[1] Southwest Univ, Coll Elect & Informat Engn, Chongqing 400715, Peoples R China
[2] Shenzhen Univ Adv Technol, Fac Comp Sci & Control Engn, Shenzhen 518055, Peoples R China
关键词
Multi-agent; Discrete-time; Asynchronous iterative Q-learning; Tracking control; OPTIMAL CONSENSUS CONTROL;
D O I
10.1016/j.neunet.2024.106667
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper addresses the tracking control problem of nonlinear discrete-time multi-agent systems (MASs). First, a local neighborhood error system (LNES) is constructed. Then, a novel tracking algorithm based on asynchronous iterative Q-learning (AIQL) is developed, which can transform the tracking problem into the optimal regulation of LNES. The AIQL-based algorithm has two Q values Q(i)(A) and Q(i)(B) for each agent i , where Q(i)(A) is used for improving the control policy and Q(i)(B) is used for evaluating the value of the control policy. Moreover, the convergence of LNES is given. It is shown that the LNES converges to 0 and the tracking problem is solved. A neural network-based actor-critic framework is used to implement AIQL. The critic network of AIQL is composed of two neural networks, which are used for approximating Q(i)(A) and Q(i)(B) respectively. Finally, simulation results are given to verify the performance of the developed algorithm. It is shown that the AIQLbased tracking algorithm has a lower cost value and faster convergence speed than the IQL-based tracking algorithm.
引用
收藏
页数:13
相关论文
共 50 条
  • [1] An iterative Q-learning based global consensus of discrete-time saturated multi-agent systems
    Long, Mingkang
    Su, Housheng
    Wang, Xiaoling
    Jiang, Guo-Ping
    Wang, Xiaofan
    CHAOS, 2019, 29 (10)
  • [2] Neighbor Q-learning based consensus control for discrete-time multi-agent systems
    Zhu, Xiaoxia
    Yuan, Xin
    Dong, Lu
    Wang, Yuanda
    Sun, Changyin
    OPTIMAL CONTROL APPLICATIONS & METHODS, 2023, 44 (03): : 1475 - 1490
  • [3] Consensus of discrete-time multi-agent system based on Q-learning
    Zhu Z.-B.
    Wang F.-Y.
    Yin Y.-H.
    Liu Z.-X.
    Chen Z.-Q.
    Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2021, 38 (07): : 997 - 1005
  • [4] Group consensus for discrete-time multi-agent systems based on iterative learning control
    Gao, Qianhui
    Li, Jinsha
    2023 IEEE 12TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE, DDCLS, 2023, : 271 - 276
  • [5] Q-learning algorithm in solving consensusability problem of discrete-time multi-agent systems
    Feng, Tao
    Zhang, Jilie
    Tong, Yin
    Zhang, Huaguang
    AUTOMATICA, 2021, 128
  • [6] Formation control of discrete-time multi-agent systems by iterative learning approach
    Yang Liu
    Yingmin Jia
    International Journal of Control, Automation and Systems, 2012, 10 : 913 - 919
  • [7] Formation Control of Discrete-Time Multi-Agent Systems by Iterative Learning Approach
    Liu, Yang
    Jia, Yingmin
    INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2012, 10 (05) : 913 - 919
  • [8] Distributed optimal consensus control for discrete-time multi-agent systems with composite switching topologies via an iterative Q-learning method
    Wang, Zhenhuan
    Tang, Fanghua
    Zhao, Ning
    Li, Ren
    Alassafi, M. O.
    INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2024,
  • [9] Robust formation control of discrete-time multi-agent systems by iterative learning approach
    Liu, Yang
    Jia, Yingmin
    INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2015, 46 (04) : 625 - 633
  • [10] Analysis of Asynchronous Containment Control Problem for Discrete-Time Multi-Agent Systems
    Shao, Jinliang
    Shi, Lei
    Gong, Lisha
    2017 IEEE 2ND ADVANCED INFORMATION TECHNOLOGY, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (IAEAC), 2017, : 247 - 251