A Distributed Actor-Critic Learning Approach for Affine Formation Control of Multi-Robots With Unknown Dynamics

被引:0
|
作者
Zhang, Ronghua [1 ,2 ]
Ma, Qingwen [1 ]
Zhang, Xinglong [1 ]
Xu, Xin [1 ]
Liu, Daxue [1 ]
机构
[1] Natl Univ Def Technol, Coll Intelligence Sci & Technol, Changsha, Peoples R China
[2] Sichuan Univ Sci & Engn, Sch Mech Engn, Zigong, Peoples R China
关键词
affine formation control; data-driven; multi-robots; reinforcement learning; rollout; TIME NONLINEAR-SYSTEMS;
D O I
10.1002/acs.3972
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Formation maneuverability is particularly important for multi-robots (MRs), especially when the robots are operating cooperatively in complex and dynamic environments. Although various methods have been developed for affine formation, it is still a difficult problem to design an affine formation controller for MRs with unknown dynamics. In this paper, a distributed actor-critic learning approach (DACL) in a look-ahead rollout manner is proposed for the affine formation of MRs under local communication, which improves the online learning efficiency. In the proposed approach, a distributed data-driven online optimization mechanism is designed via the sparse kernel technique to solve the near-optimal affine formation control issue of MRs with unknown dynamics as well as improve control performance. The unknown dynamics of MRs are learned offline based on precollected input-output datasets, and the sparse kernel-based approach is employed to increase the feature representation capability of the samples. Then, the proposed distributed online actor-critic algorithm for each robot in the formation includes two neural networks, which are utilized to approximate the costate functions and the near-optimal policies. Moreover, the convergence analysis of the proposed approach has been conducted. Finally, numerical simulation and KKSwarm-based experiment studies are performed to verify the effectiveness of the proposed approach.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Distributed Multi-Agent Reinforcement Learning by Actor-Critic Method
    Heredia, Paulo C.
    Mou, Shaoshuai
    IFAC PAPERSONLINE, 2019, 52 (20): : 363 - 368
  • [2] Adaptive actor-critic learning for the control of mobile robots by applying predictive models
    Syam, R
    Watanabe, K
    Izumi, K
    SOFT COMPUTING, 2005, 9 (11) : 835 - 845
  • [3] Adaptive actor-critic learning for the control of mobile robots by applying predictive models
    Rafiuddin Syam
    Keigo Watanabe
    Kiyotaka Izumi
    Soft Computing, 2005, 9 : 835 - 845
  • [4] Receding Horizon Actor-Critic Learning Control for Nonlinear Time-Delay Systems With Unknown Dynamics
    Liu, Jiahang
    Zhang, Xinglong
    Xu, Xin
    Xiong, Quan
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 53 (08): : 4980 - 4993
  • [5] Multi-actor mechanism for actor-critic reinforcement learning
    Li, Lin
    Li, Yuze
    Wei, Wei
    Zhang, Yujia
    Liang, Jiye
    INFORMATION SCIENCES, 2023, 647
  • [6] Distributed Actor-Critic Learning Using Emphatic Weightings
    Stankovic, Milos S.
    Beko, Marko
    Stankovic, Srdjan S.
    2022 8TH INTERNATIONAL CONFERENCE ON CONTROL, DECISION AND INFORMATION TECHNOLOGIES (CODIT'22), 2022, : 1167 - 1172
  • [7] Hierarchical Multiagent Formation Control Scheme via Actor-Critic Learning
    Mu, Chaoxu
    Peng, Jiangwen
    Sun, Changyin
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (11) : 8764 - 8777
  • [8] USING ACTOR-CRITIC REINFORCEMENT LEARNING FOR CONTROL AND FLIGHT FORMATION OF QUADROTORS
    Torres, Edgar
    Xu, Lei
    Sardarmehni, Tohid
    PROCEEDINGS OF ASME 2022 INTERNATIONAL MECHANICAL ENGINEERING CONGRESS AND EXPOSITION, IMECE2022, VOL 5, 2022,
  • [9] Deep Actor-Critic Learning for Distributed Power Control in Wireless Mobile Networks
    Nasir, Yasar Sinan
    Guo, Dongning
    2020 54TH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS, AND COMPUTERS, 2020, : 398 - 402
  • [10] Adaptive actor-critic control of robots with integral invariant manifold
    Pantoja-Garcia, Luis
    Garcia-Rodriguez, Rodolfo
    Parra-Vega, Vicente
    2021 IEEE CHILEAN CONFERENCE ON ELECTRICAL, ELECTRONICS ENGINEERING, INFORMATION AND COMMUNICATION TECHNOLOGIES (IEEE CHILECON 2021), 2021, : 782 - 787