Crowd Perception Communication-Based Multi- Agent Path Finding With Imitation Learning

被引:0
|
作者
Xie, Jing [1 ,2 ]
Zhang, Yongjun [1 ]
Yang, Huanhuan [3 ]
Ouyang, Qianying [4 ]
Dong, Fang [3 ]
Guo, Xinyu [5 ]
Jin, Songchang [4 ]
Shi, Dianxi [4 ]
机构
[1] Natl Innovat Inst Def Technol, Beijing 100071, Peoples R China
[2] Tianjin Artificial Intelligence Innovat Ctr, Tianjin 300457, Peoples R China
[3] Natl Univ Def Technol, Coll Comp, Changsha 410073, Peoples R China
[4] Intelligent Game & Decis Lab, Beijing 100091, Peoples R China
[5] Beijing Inst Technol, Coll Comp, Beijing 100081, Peoples R China
来源
关键词
Imitation learning; Trajectory; Learning systems; Cloning; Training; Neural networks; Technological innovation; Multi-agent path finding; deep reinforcement learning; multi-agent communication; imitation learning; CONFLICT-BASED SEARCH; REINFORCEMENT;
D O I
10.1109/LRA.2024.3455948
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Deep reinforcement learning-based Multi-Agent Path Finding (MAPF) has gained significant attention due to its remarkable adaptability to environments. Existing methods primarily leverage multi-agent communication in a fully-decentralized framework to maintain scalability while enhancing information exchange among agents. However, as the number of agents and obstacles increases, the environment becomes more complex, making cooperation between agents becomes more difficult, and crowding occurs from time to time. To address these issues, we propose a decentralized planner C3PIL, which integrates a Controlled Communication mechanism for Crowd Perception and uses Imitation Learning to improve policy learning. C3PIL first introduces a crowd perception communication module that perceives environmental crowd information and incorporates it into the controlled communication. This effectively prevents and mitigates crowded situations. Furthermore, we employ generative adversarial imitation learning to learn a reward function from expert experiences. It reduces the possible misleading caused by the fixed reward function, improves the flexibility and diversity of agent behaviors, and ultimately enables agents to cooperate effectively. Finally, experimental results show that C3PIL not only outperforms previous learning-based MAPF methods, but also further enhances the cooperation of agents and significantly reduces crowding in complex environments.
引用
收藏
页码:8929 / 8936
页数:8
相关论文
共 50 条
  • [1] Multi-Agent Path Finding with Prioritized Communication Learning
    Li, Wenhao
    Chen, Hongjun
    Jin, Bo
    Tan, Wenzhe
    Zha, Hongyuan
    Wang, Xiangfeng
    2022 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2022, 2022, : 10695 - 10701
  • [2] Learning Selective Communication for Multi-Agent Path Finding
    Ma, Ziyuan
    Luo, Yudong
    Pan, Jia
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 7 (02) : 1455 - 1462
  • [3] A Decentralized Multi-Agent Path Planning Approach Based on Imitation Learning and Selective Communication
    Feng, Bohan
    Bi, Youyi
    Li, Mian
    Lin, Liyong
    JOURNAL OF COMPUTING AND INFORMATION SCIENCE IN ENGINEERING, 2024, 24 (08)
  • [4] Perception field based imitation learning for unlabeled multi-agent pathfinding
    Wenjie CHU
    Ailun YU
    Wei ZHANG
    Haiyan ZHAO
    Zhi JIN
    Science China(Information Sciences), 2024, 67 (05) : 115 - 135
  • [5] Perception field based imitation learning for unlabeled multi-agent pathfinding
    Chu, Wenjie
    Yu, Ailun
    Zhang, Wei
    Zhao, Haiyan
    Jin, Zhi
    SCIENCE CHINA-INFORMATION SCIENCES, 2024, 67 (05)
  • [6] Planning and Learning in Multi-Agent Path Finding
    Yakovlev, K. S.
    Andreychuk, A. A.
    Skrynnik, A. A.
    Panov, A. I.
    DOKLADY MATHEMATICS, 2022, 106 (SUPPL 1) : S79 - S84
  • [7] Planning and Learning in Multi-Agent Path Finding
    K. S. Yakovlev
    A. A. Andreychuk
    A. A. Skrynnik
    A. I. Panov
    Doklady Mathematics, 2022, 106 : S79 - S84
  • [8] Distributed Heuristic Multi-Agent Path Finding with Communication
    Ma, Ziyuan
    Luo, Yudong
    Ma, Hang
    2021 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2021), 2021, : 8699 - 8705
  • [9] Curriculum Learning Based Multi-Agent Path Finding for Complex Environments
    Zhao, Cheng
    Zhuang, Liansheng
    Huang, Yihong
    Liu, Haonan
    2023 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, IJCNN, 2023,
  • [10] Improved Multi-Agent Reinforcement Learning for Path Planning-Based Crowd Simulation
    Wang, Qingqing
    Liu, Hong
    Gao, Kaizhou
    Zhang, Le
    IEEE ACCESS, 2019, 7 : 73841 - 73855