Autonomous Swarm Robot Coordination via Mean-Field Control Embedding Multi-Agent Reinforcement Learning

被引:0
|
作者
Tang, Huaze [1 ]
Zhang, Hengxi [1 ]
Shi, Zhenpeng [1 ]
Chen, Xinlei [1 ]
Ding, Wenbo [1 ,2 ]
Zhang, Xiao-Ping [1 ,2 ,3 ]
机构
[1] Tsinghua Berkeley Shenzhen Inst, Tsinghua Univ, Tsinghua Shenzhen Int Grad Sch, Shenzhen, Peoples R China
[2] RISC Int Open Source Lab, Shenzhen 518055, Peoples R China
[3] Ryerson Univ, Dept Elect Comp & Biomed Engn, Toronto, ON, Canada
关键词
D O I
10.1109/IROS55552.2023.10341749
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The learning approaches of designing a controller to guide the collective behavior of swarm robots have gained significant attention in recent years. However, the scalability of swarm robots and their inherent stochasticity complicate the control problem due to increasing complexity, unpredictability, and non-linearity. Despite considerable progress made in swarm robotics, addressing these challenges remains a significant issue. In this work, we model the stochastic dynamics of a swarm robot system and then propose a novel control framework based on a mean-field control (MFC) embedding multi-agent reinforcement learning (MARL) approach named MF-MARL to deal with these challenges. While MARL is able to deal with stochasticity statistically, we integrate MFC, allowing MF-MARL to cope with large-scale robots. Moreover, we apply statistical moments of robots' state and control action to discretize continuous input and enable MF-MARL to be applied in continuous scenarios. To demonstrate the effectiveness of MF-MARL, we evaluate the performance of the robots on a specific swarm simulation platform. The experimental results show that our algorithm outperforms the traditional algorithms both in navigation and manipulation tasks. Finally, we demonstrate the adaptability of the proposed algorithm through the component failure test.
引用
收藏
页码:8820 / 8826
页数:7
相关论文
共 50 条
  • [1] Graphon mean-field control for cooperative multi-agent reinforcement learning
    Hu, Yuanquan
    Wei, Xiaoli
    Yan, Junji
    Zhang, Hengxi
    JOURNAL OF THE FRANKLIN INSTITUTE, 2023, 360 (18) : 14783 - 14805
  • [2] Weighted Mean-Field Multi-Agent Reinforcement Learning via Reward Attribution Decomposition
    Wu, Tingyu
    Li, Wenhao
    Jin, Bo
    Zhang, Wei
    Wang, Xiangfeng
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS. DASFAA 2022 INTERNATIONAL WORKSHOPS, 2022, 13248 : 301 - 316
  • [3] Mean Field Multi-Agent Reinforcement Learning
    Yang, Yaodong
    Luo, Rui
    Li, Minne
    Zhou, Ming
    Zhang, Weinan
    Wang, Jun
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
  • [4] Mean-Field Approximation of Cooperative Constrained Multi-Agent Reinforcement Learning (CMARL)
    Mondal, Washim Uddin
    Aggarwal, Vaneet
    Ukkusuri, Satish, V
    JOURNAL OF MACHINE LEARNING RESEARCH, 2024, 25
  • [5] 3M-RL: Multi-Resolution, Multi-Agent, Mean-Field Reinforcement Learning for Autonomous UAV Routing
    Wang, Weichang
    Liu, Yongming
    Srikant, Rayadurgam
    Ying, Lei
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (07) : 8985 - 8996
  • [6] Adaptive mean field multi-agent reinforcement learning
    Wang, Xiaoqiang
    Ke, Liangjun
    Zhang, Gewei
    Zhu, Dapeng
    INFORMATION SCIENCES, 2024, 669
  • [7] Causal Mean Field Multi-Agent Reinforcement Learning
    Ma, Hao
    Pu, Zhiqiang
    Pan, Yi
    Liu, Boyin
    Gao, Junlong
    Guo, Zhenyu
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [8] Mean-Field Multi-Agent Reinforcement Learning for Peer-to-Peer Multi-Energy Trading
    Qiu, Dawei
    Wang, Jianhong
    Dong, Zihang
    Wang, Yi
    Strbac, Goran
    IEEE TRANSACTIONS ON POWER SYSTEMS, 2023, 38 (05) : 4853 - 4866
  • [9] Coordinated Multi-Agent Reinforcement Learning for Swarm Battery Control
    Ebell, Niklas
    Pruckner, Marco
    2018 IEEE CANADIAN CONFERENCE ON ELECTRICAL & COMPUTER ENGINEERING (CCECE), 2018,
  • [10] Safe multi-agent reinforcement learning for multi-robot control
    Gu, Shangding
    Kuba, Jakub Grudzien
    Chen, Yuanpei
    Du, Yali
    Yang, Long
    Knoll, Alois
    Yang, Yaodong
    ARTIFICIAL INTELLIGENCE, 2023, 319