A Distributed Actor-Critic Learning Approach for Affine Formation Control of Multi-Robots With Unknown Dynamics

被引：0

作者：

Zhang, Ronghua ^{[1
,2
]}

Ma, Qingwen ^{[1
]}

Zhang, Xinglong ^{[1
]}

Xu, Xin ^{[1
]}

Liu, Daxue ^{[1
]}

机构：

[1] Natl Univ Def Technol, Coll Intelligence Sci & Technol, Changsha, Peoples R China

[2] Sichuan Univ Sci & Engn, Sch Mech Engn, Zigong, Peoples R China

来源：

INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING | 2025年

关键词：

affine formation control; data-driven; multi-robots; reinforcement learning; rollout; TIME NONLINEAR-SYSTEMS;

D O I：

10.1002/acs.3972

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Formation maneuverability is particularly important for multi-robots (MRs), especially when the robots are operating cooperatively in complex and dynamic environments. Although various methods have been developed for affine formation, it is still a difficult problem to design an affine formation controller for MRs with unknown dynamics. In this paper, a distributed actor-critic learning approach (DACL) in a look-ahead rollout manner is proposed for the affine formation of MRs under local communication, which improves the online learning efficiency. In the proposed approach, a distributed data-driven online optimization mechanism is designed via the sparse kernel technique to solve the near-optimal affine formation control issue of MRs with unknown dynamics as well as improve control performance. The unknown dynamics of MRs are learned offline based on precollected input-output datasets, and the sparse kernel-based approach is employed to increase the feature representation capability of the samples. Then, the proposed distributed online actor-critic algorithm for each robot in the formation includes two neural networks, which are utilized to approximate the costate functions and the near-optimal policies. Moreover, the convergence analysis of the proposed approach has been conducted. Finally, numerical simulation and KKSwarm-based experiment studies are performed to verify the effectiveness of the proposed approach.

引用

页数：15

共 50 条

[1] Distributed Multi-Agent Reinforcement Learning by Actor-Critic Method
Heredia, Paulo C.
Mou, Shaoshuai
IFAC PAPERSONLINE, 2019, 52 (20): : 363 - 368
[2] Adaptive actor-critic learning for the control of mobile robots by applying predictive models
Syam, R
Watanabe, K
Izumi, K
SOFT COMPUTING, 2005, 9 (11) : 835 - 845
[3] Adaptive actor-critic learning for the control of mobile robots by applying predictive models
Rafiuddin Syam
Keigo Watanabe
Kiyotaka Izumi
Soft Computing, 2005, 9 : 835 - 845
[4] Receding Horizon Actor-Critic Learning Control for Nonlinear Time-Delay Systems With Unknown Dynamics
Liu, Jiahang
Zhang, Xinglong
Xu, Xin
Xiong, Quan
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 53 (08): : 4980 - 4993
[5] Multi-actor mechanism for actor-critic reinforcement learning
Li, Lin
Li, Yuze
Wei, Wei
Zhang, Yujia
Liang, Jiye
INFORMATION SCIENCES, 2023, 647
[6] Distributed Actor-Critic Learning Using Emphatic Weightings
Stankovic, Milos S.
Beko, Marko
Stankovic, Srdjan S.
2022 8TH INTERNATIONAL CONFERENCE ON CONTROL, DECISION AND INFORMATION TECHNOLOGIES (CODIT'22), 2022, : 1167 - 1172
[7] Hierarchical Multiagent Formation Control Scheme via Actor-Critic Learning
Mu, Chaoxu
Peng, Jiangwen
Sun, Changyin
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (11) : 8764 - 8777
[8] USING ACTOR-CRITIC REINFORCEMENT LEARNING FOR CONTROL AND FLIGHT FORMATION OF QUADROTORS
Torres, Edgar
Xu, Lei
Sardarmehni, Tohid
PROCEEDINGS OF ASME 2022 INTERNATIONAL MECHANICAL ENGINEERING CONGRESS AND EXPOSITION, IMECE2022, VOL 5, 2022,
[9] Deep Actor-Critic Learning for Distributed Power Control in Wireless Mobile Networks
Nasir, Yasar Sinan
Guo, Dongning
2020 54TH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS, AND COMPUTERS, 2020, : 398 - 402
[10] Adaptive actor-critic control of robots with integral invariant manifold
Pantoja-Garcia, Luis
Garcia-Rodriguez, Rodolfo
Parra-Vega, Vicente
2021 IEEE CHILEAN CONFERENCE ON ELECTRICAL, ELECTRONICS ENGINEERING, INFORMATION AND COMMUNICATION TECHNOLOGIES (IEEE CHILECON 2021), 2021, : 782 - 787

← 1 2 3 4 5 →