Hierarchical Multiagent Formation Control Scheme via Actor-Critic Learning

被引:17
|
作者
Mu, Chaoxu [1 ]
Peng, Jiangwen [1 ]
Sun, Changyin [2 ]
机构
[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China
[2] Southeast Univ, Sch Automat, Nanjing 210096, Peoples R China
基金
中国国家自然科学基金;
关键词
Convergence; Hybrid fiber coaxial cables; Heuristic algorithms; Games; Dynamic programming; Microgrids; Computational complexity; Adaptive dynamic programming (ADP); hierarchical formation control (HFC); multiagent system (MAS); multistep generalized policy iteration (MsGPI); neural networks (NNs); GROUP CONSENSUS; SYSTEMS SUBJECT;
D O I
10.1109/TNNLS.2022.3153028
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This article presents a nearly optimal solution to the cooperative formation control problem for large-scale multiagent system (MAS). First, multigroup technique is widely used for the decomposition of the large-scale problem, but there is no consensus between different subgroups. Inspired by the hierarchical structure applied in the MAS, a hierarchical leader-following formation control structure with multigroup technique is constructed, where two layers and three types of agents are designed. Second, adaptive dynamic programming technique is conformed to the optimal formation control problem by the establishment of performance index function. Based on the traditional generalized policy iteration (PI) algorithm, the multistep generalized policy iteration (MsGPI) is developed with the modification of policy evaluation. The novel algorithm not only inherits the advantages of high convergence speed and low computational complexity in the generalized PI algorithm but also further accelerates the convergence speed and reduces run time. Besides, the stability analysis, convergence analysis, and optimality analysis are given for the proposed multistep PI algorithm. Afterward, a neural network-based actor-critic structure is built for approximating the iterative control policies and value functions. Finally, a large-scale formation control problem is provided to demonstrate the performance of our developed hierarchical leader-following formation control structure and MsGPI algorithm.
引用
收藏
页码:8764 / 8777
页数:14
相关论文
共 50 条
  • [21] Federated Multiagent Actor-Critic Learning for Age Sensitive Mobile-Edge Computing
    Zhu, Zheqi
    Wan, Shuo
    Fan, Pingyi
    Letaief, Khaled B.
    IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (02) : 1053 - 1067
  • [22] Actor-critic reinforcement learning for the feedback control of a swinging chain
    Dengler, C.
    Lohmann, B.
    IFAC PAPERSONLINE, 2018, 51 (13): : 378 - 383
  • [23] Actor-Critic Policy Optimization in Partially Observable Multiagent Environments
    Srinivasan, Sriram
    Lanctot, Marc
    Zambaldi, Vinicius
    Perolat, Julien
    Tuyls, Karl
    Munos, Remi
    Bowling, Michael
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [24] Actor-critic learning based PID control for robotic manipulators
    Nohooji, Hamed Rahimi
    Zaraki, Abolfazl
    Voos, Holger
    APPLIED SOFT COMPUTING, 2024, 151
  • [25] Decentralized Multiagent Actor-Critic Algorithm Based on Message Diffusion
    Ding, Siyuan
    Li, Shengxiang
    Liu, Guangyi
    Li, Ou
    Ke, Ke
    Bai, Yijie
    Chen, Weiye
    JOURNAL OF SENSORS, 2021, 2021
  • [26] Actor-Critic Model Predictive Control
    Romero, Angel
    Song, Yunlong
    Scaramuzza, Davide
    2024 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2024), 2024, : 14777 - 14784
  • [27] Multi-Agent Hierarchical Graph Attention Actor-Critic Reinforcement Learning
    Li, Tongyue
    Shi, Dianxi
    Jin, Songchang
    Wang, Zhen
    Yang, Huanhuan
    Chen, Yang
    ENTROPY, 2025, 27 (01)
  • [28] Deep Reinforcement Learning in VizDoom via DQN and Actor-Critic Agents
    Bakhanova, Maria
    Makarov, Ilya
    ADVANCES IN COMPUTATIONAL INTELLIGENCE, IWANN 2021, PT I, 2021, 12861 : 138 - 150
  • [29] Actor-critic algorithms for hierarchical Markov decision processes
    Bhatnagar, S
    Panigrahi, JR
    AUTOMATICA, 2006, 42 (04) : 637 - 644
  • [30] Learning Locomotion for Quadruped Robots via Distributional Ensemble Actor-Critic
    Li, Sicen
    Pang, Yiming
    Bai, Panju
    Li, Jiawei
    Liu, Zhaojin
    Hu, Shihao
    Wang, Liquan
    Wang, Gang
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (02) : 1811 - 1818