Hierarchical Multiagent Formation Control Scheme via Actor-Critic Learning

被引：17

作者：

Mu, Chaoxu ^{[1
]}

Peng, Jiangwen ^{[1
]}

Sun, Changyin ^{[2
]}

机构：

[1] Tianjin Univ, Sch Elect & Informat Engn, Tianjin 300072, Peoples R China

[2] Southeast Univ, Sch Automat, Nanjing 210096, Peoples R China

来源：

IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS | 2023年 / 34卷 / 11期

基金：

中国国家自然科学基金;

关键词：

Convergence; Hybrid fiber coaxial cables; Heuristic algorithms; Games; Dynamic programming; Microgrids; Computational complexity; Adaptive dynamic programming (ADP); hierarchical formation control (HFC); multiagent system (MAS); multistep generalized policy iteration (MsGPI); neural networks (NNs); GROUP CONSENSUS; SYSTEMS SUBJECT;

D O I：

10.1109/TNNLS.2022.3153028

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This article presents a nearly optimal solution to the cooperative formation control problem for large-scale multiagent system (MAS). First, multigroup technique is widely used for the decomposition of the large-scale problem, but there is no consensus between different subgroups. Inspired by the hierarchical structure applied in the MAS, a hierarchical leader-following formation control structure with multigroup technique is constructed, where two layers and three types of agents are designed. Second, adaptive dynamic programming technique is conformed to the optimal formation control problem by the establishment of performance index function. Based on the traditional generalized policy iteration (PI) algorithm, the multistep generalized policy iteration (MsGPI) is developed with the modification of policy evaluation. The novel algorithm not only inherits the advantages of high convergence speed and low computational complexity in the generalized PI algorithm but also further accelerates the convergence speed and reduces run time. Besides, the stability analysis, convergence analysis, and optimality analysis are given for the proposed multistep PI algorithm. Afterward, a neural network-based actor-critic structure is built for approximating the iterative control policies and value functions. Finally, a large-scale formation control problem is provided to demonstrate the performance of our developed hierarchical leader-following formation control structure and MsGPI algorithm.

引用

页码：8764 / 8777

页数：14

共 50 条

[21] Federated Multiagent Actor-Critic Learning for Age Sensitive Mobile-Edge Computing
Zhu, Zheqi
Wan, Shuo
Fan, Pingyi
Letaief, Khaled B.
IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (02) : 1053 - 1067
[22] Actor-critic reinforcement learning for the feedback control of a swinging chain
Dengler, C.
Lohmann, B.
IFAC PAPERSONLINE, 2018, 51 (13): : 378 - 383
[23] Actor-Critic Policy Optimization in Partially Observable Multiagent Environments
Srinivasan, Sriram
Lanctot, Marc
Zambaldi, Vinicius
Perolat, Julien
Tuyls, Karl
Munos, Remi
Bowling, Michael
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
[24] Actor-critic learning based PID control for robotic manipulators
Nohooji, Hamed Rahimi
Zaraki, Abolfazl
Voos, Holger
APPLIED SOFT COMPUTING, 2024, 151
[25] Decentralized Multiagent Actor-Critic Algorithm Based on Message Diffusion
Ding, Siyuan
Li, Shengxiang
Liu, Guangyi
Li, Ou
Ke, Ke
Bai, Yijie
Chen, Weiye
JOURNAL OF SENSORS, 2021, 2021
[26] Actor-Critic Model Predictive Control
Romero, Angel
Song, Yunlong
Scaramuzza, Davide
2024 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2024), 2024, : 14777 - 14784
[27] Multi-Agent Hierarchical Graph Attention Actor-Critic Reinforcement Learning
Li, Tongyue
Shi, Dianxi
Jin, Songchang
Wang, Zhen
Yang, Huanhuan
Chen, Yang
ENTROPY, 2025, 27 (01)
[28] Deep Reinforcement Learning in VizDoom via DQN and Actor-Critic Agents
Bakhanova, Maria
Makarov, Ilya
ADVANCES IN COMPUTATIONAL INTELLIGENCE, IWANN 2021, PT I, 2021, 12861 : 138 - 150
[29] Actor-critic algorithms for hierarchical Markov decision processes
Bhatnagar, S
Panigrahi, JR
AUTOMATICA, 2006, 42 (04) : 637 - 644
[30] Learning Locomotion for Quadruped Robots via Distributional Ensemble Actor-Critic
Li, Sicen
Pang, Yiming
Bai, Panju
Li, Jiawei
Liu, Zhaojin
Hu, Shihao
Wang, Liquan
Wang, Gang
IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (02) : 1811 - 1818

← 1 2 3 4 5 →