Autonomous Swarm Robot Coordination via Mean-Field Control Embedding Multi-Agent Reinforcement Learning

被引:0
|
作者
Tang, Huaze [1 ]
Zhang, Hengxi [1 ]
Shi, Zhenpeng [1 ]
Chen, Xinlei [1 ]
Ding, Wenbo [1 ,2 ]
Zhang, Xiao-Ping [1 ,2 ,3 ]
机构
[1] Tsinghua Berkeley Shenzhen Inst, Tsinghua Univ, Tsinghua Shenzhen Int Grad Sch, Shenzhen, Peoples R China
[2] RISC Int Open Source Lab, Shenzhen 518055, Peoples R China
[3] Ryerson Univ, Dept Elect Comp & Biomed Engn, Toronto, ON, Canada
关键词
D O I
10.1109/IROS55552.2023.10341749
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The learning approaches of designing a controller to guide the collective behavior of swarm robots have gained significant attention in recent years. However, the scalability of swarm robots and their inherent stochasticity complicate the control problem due to increasing complexity, unpredictability, and non-linearity. Despite considerable progress made in swarm robotics, addressing these challenges remains a significant issue. In this work, we model the stochastic dynamics of a swarm robot system and then propose a novel control framework based on a mean-field control (MFC) embedding multi-agent reinforcement learning (MARL) approach named MF-MARL to deal with these challenges. While MARL is able to deal with stochasticity statistically, we integrate MFC, allowing MF-MARL to cope with large-scale robots. Moreover, we apply statistical moments of robots' state and control action to discretize continuous input and enable MF-MARL to be applied in continuous scenarios. To demonstrate the effectiveness of MF-MARL, we evaluate the performance of the robots on a specific swarm simulation platform. The experimental results show that our algorithm outperforms the traditional algorithms both in navigation and manipulation tasks. Finally, we demonstrate the adaptability of the proposed algorithm through the component failure test.
引用
收藏
页码:8820 / 8826
页数:7
相关论文
共 50 条
  • [41] Multi-agent reinforcement learning for character control
    Cheng Li
    Levi Fussell
    Taku Komura
    The Visual Computer, 2021, 37 : 3115 - 3123
  • [42] Mean-field optimal control in a multi-agent interaction model for prevention of maritime crime
    Orlando, Gianluca
    ADVANCES IN CONTINUOUS AND DISCRETE MODELS, 2023, 2023 (01):
  • [43] Mean-field optimal control in a multi-agent interaction model for prevention of maritime crime
    Gianluca Orlando
    Advances in Continuous and Discrete Models, 2023
  • [44] Swarm Reinforcement Learning for traffic signal control based on cooperative multi-agent framework
    Tahifa, Mohammed
    Boumhidi, Jaouad
    Yahyaouy, Ali
    2015 INTELLIGENT SYSTEMS AND COMPUTER VISION (ISCV), 2015,
  • [45] Caching for Edge Inference at Scale: A Mean Field Multi-Agent Reinforcement Learning Approach
    Lu, Yanqing
    Zhang, Meng
    Tang, Ming
    IEEE CONFERENCE ON GLOBAL COMMUNICATIONS, GLOBECOM, 2023, : 332 - 337
  • [46] Quantum Multi-Agent Reinforcement Learning for Autonomous Mobility Cooperation
    Park, Soohyun
    Kim, Jae Pyoung
    Park, Chanyoung
    Jung, Soyi
    Kim, Joongheon
    IEEE COMMUNICATIONS MAGAZINE, 2024, 62 (06) : 106 - 112
  • [47] A study on multi-agent reinforcement learning for autonomous distribution vehicles
    Serap Ergün
    Iran Journal of Computer Science, 2023, 6 (4) : 297 - 305
  • [48] Autonomous Separation Assurance with Deep Multi-Agent Reinforcement Learning
    Brittain, Marc W.
    Yang, Xuxi
    Wei, Peng
    JOURNAL OF AEROSPACE INFORMATION SYSTEMS, 2021, 18 (12): : 890 - 905
  • [49] DEEP REINFORCEMENT LEARNING FOR MULTI-AGENT AUTONOMOUS SATELLITE INSPECTION
    Lei, Henry H.
    Shubert, Matt
    Damron, Nathan
    Lang, Kendra
    Phillips, Sean
    PROCEEDINGS OF THE 44TH ANNUAL AMERICAN ASTRONAUTICAL SOCIETY GUIDANCE, NAVIGATION, AND CONTROL CONFERENCE, AAS 2022, 2024, : 1391 - 1412
  • [50] Enhanced consensus control architecture for autonomous platoon utilizing multi-agent reinforcement learning
    Guo, Xin
    Peng, Jiankun
    Pi, Dawei
    Zhang, Hailong
    Wu, Changcheng
    Ma, Chunye
    COMPUTER-AIDED CIVIL AND INFRASTRUCTURE ENGINEERING, 2025,