Distributed Minmax Strategy for Consensus Tracking in Differential Graphical Games: A Model-Free Approach

被引:3
|
作者
Zhou, Yan [1 ]
Zhou, Jialing [2 ]
Wen, Guanghui [3 ]
Gan, Minggang [4 ]
Yang, Tao [5 ]
机构
[1] Southeast Univ, Sch Cyber Sci & Engn, Nanjing 211189, Peoples R China
[2] Beijing Inst Technol, Adv Res Inst Multidisciplinary Sci, Beijing 100081, Peoples R China
[3] Southeast Univ, Dept Syst Sci, Nanjing 211189, Peoples R China
[4] Beijing Inst Technol, State Key Lab Intelligent Control & Decis Complex, Sch Automat, Beijing 100081, Peoples R China
[5] Northeastern Univ, State Key Lab Synthet Automat Proc Ind, Shenyang 110819, Peoples R China
来源
基金
中国国家自然科学基金;
关键词
Asymptotic stability; Sufficient conditions; Heuristic algorithms; Riccati equations; Games; Reinforcement learning; Mathematical models; ADAPTIVE OPTIMAL-CONTROL; SYSTEMS; ITERATION;
D O I
10.1109/MSMC.2023.3282774
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
This article focuses on the design of distributed minmax strategies for multiagent consensus tracking control problems with completely unknown dynamics in the presence of external disturbances or attacks. Each agent obtains its distributed minmax strategy by solving a multiagent zero-sum differential graphical game, which involves both nonadversarial and adversarial behaviors. Solving such a game is equivalent to solving a game algebraic Riccati equation (GARE). By making slight assumptions concerning performance matrices, L-2 stability and asymptotic stability of the closed-loop consensus error systems are strictly proven. Furthermore, inspired by data-driven off-policy reinforcement learning (RL), a model-free policy iteration (PI) algorithm is presented for each follower to generate the minmax strategy. Finally, simulations are performed to demonstrate the effectiveness of the proposed theoretical results.
引用
收藏
页码:53 / 68
页数:16
相关论文
共 50 条
  • [41] Model-free fuzzy tracking control of a nuclear reactor
    Marseguerra, M
    Zio, E
    ANNALS OF NUCLEAR ENERGY, 2003, 30 (09) : 953 - 981
  • [42] In Defense of Color-based Model-free Tracking
    Possegger, Horst
    Mauthner, Thomas
    Bischof, Horst
    2015 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2015, : 2113 - 2120
  • [43] Model-Free Video Detection and Tracking of Pedestrians and Bicyclists
    Malinovskiy, Yegor
    Zheng, Jianyang
    Wang, Yinhai
    COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING, 2009, 24 (03) : 157 - 168
  • [44] Cascaded Model-Free Control for trajectory tracking of quadrotors
    Bekcheva, Maria
    Join, Cedric
    Mounier, Hugues
    2018 INTERNATIONAL CONFERENCE ON UNMANNED AIRCRAFT SYSTEMS (ICUAS), 2018, : 1359 - 1368
  • [45] Off-policy reinforcement learning-based novel model-free minmax fault-tolerant tracking control for industrial processes
    Li, Xueyu
    Luo, Qiuwen
    Wang, Limin
    Zhang, Ridong
    Gao, Furong
    JOURNAL OF PROCESS CONTROL, 2022, 115 : 145 - 156
  • [46] Model-Free Multiple Object Tracking with Shared Proposals
    Zhu, Gao
    Porikli, Fatih
    Li, Hongdong
    COMPUTER VISION - ACCV 2016, PT II, 2017, 10112 : 288 - 304
  • [47] Model-free, statistical detection and tracking of moving objects
    Ross, Mark
    2006 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, ICIP 2006, PROCEEDINGS, 2006, : 557 - 560
  • [48] Model-free Reinforcement Learning for Stochastic Stackelberg Security Games
    Mishra, Rajesh K.
    Vasal, Deepanshu
    Vishwanath, Sriram
    2020 59TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2020, : 348 - 353
  • [49] Model-free distributed state estimation with local measurements
    Gao, Kepan
    Ran, Chenyu
    Wang, Xiaoling
    Liu, Liu
    Jiang, Guo-Ping
    CHAOS, 2024, 34 (11)
  • [50] Model-free optimal tracking policies for Markov jump systems by solving non-zero-sum games
    Zhou, Peixin
    Xue, Huiwen
    Wen, Jiwei
    Shi, Peng
    Luan, Xaoli
    INFORMATION SCIENCES, 2023, 647