Hierarchical Multi-Agent Training Based on Reinforcement Learning

被引:1
|
作者
Wang, Guanghua [1 ]
Li, Wenjie [2 ]
Wu, Zhanghua [3 ]
Guo, Xian [1 ]
机构
[1] Nankai Univ, Inst Robot & Automat Informat Syst, Tianjin, Peoples R China
[2] State Grid Tianjin Elect Power Co, Tianjin, Peoples R China
[3] Jiangsu Automat Res Inst, Lianyungang, Jiangsu, Peoples R China
关键词
Multi-Agent Systems; Reinforcement Learning; Multi-Agent Proximal Policy Optimization Algorithm; Formation Confrontation;
D O I
10.1109/ACIRS62330.2024.10684909
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
In the current multi-UAV adversarial games, issues exist such as the instability and difficulty in learning distributed strategies, as well as a lack of coordinated formation UAVs. In this paper, a hierarchical multi-agent training framework is proposed to solve these problems, which categorizes UAV formations into two types of intelligent agents: virtual centroid agents and UAVs within the formation. The centroid agents are responsible for controlling the overall movement of the formation. In contrast, the UAVs within the formation are capable of flexibly adjusting their speed and heading on this basis. By constructing a confrontation scenario involving multiple formations and types of UAVs, the effectiveness of the hierarchical training framework is experimentally validated. The average winning rate against UAVs controlled by strategy methods based on rule construction reaches 97%, enabling both formation variations and tactical evolutions.
引用
收藏
页码:11 / 18
页数:8
相关论文
共 50 条
  • [21] Multi-agent deep reinforcement learning with type-based hierarchical group communication
    Hao Jiang
    Dianxi Shi
    Chao Xue
    Yajie Wang
    Gongju Wang
    Yongjun Zhang
    Applied Intelligence, 2021, 51 : 5793 - 5808
  • [22] Reinforcement Learning Based Hierarchical Multi-Agent Robotic Search Team in Uncertain Environment
    Hamid, Shahzaib
    Nasir, Ali
    Saleem, Yasir
    MEHRAN UNIVERSITY RESEARCH JOURNAL OF ENGINEERING AND TECHNOLOGY, 2021, 40 (03) : 645 - 662
  • [23] A Study on Multi-Agent Reinforcement Learning Problem Based on Hierarchical Modular Fuzzy Model
    Watanabe, Toshihiko
    2009 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-3, 2009, : 2041 - 2046
  • [24] Hierarchical Policy Network with Multi-agent for Knowledge Graph Reasoning Based on Reinforcement Learning
    Zheng, Mingming
    Zhou, Yanquan
    Cui, Qingyao
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT I, 2021, 12815 : 445 - 457
  • [25] Multi-agent deep reinforcement learning with type-based hierarchical group communication
    Jiang, Hao
    Shi, Dianxi
    Xue, Chao
    Wang, Yajie
    Wang, Gongju
    Zhang, Yongjun
    APPLIED INTELLIGENCE, 2021, 51 (08) : 5793 - 5808
  • [26] GHGC: Goal-based Hierarchical Group Communication in Multi-Agent Reinforcement Learning
    Jiang, Hao
    Shi, Dianxi
    Xue, Chao
    Wang, Yajie
    Wang, Gongju
    Zhang, Yongjun
    2020 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2020, : 3507 - 3514
  • [27] Quantization-aware Training for Multi-Agent Reinforcement Learning
    Chandrinos, Nikolaos
    Amasialidis, Michalis
    Kirtas, Manos
    Tsampazis, Konstantinos
    Passalis, Nikolaos
    Tefas, Anastasios
    32ND EUROPEAN SIGNAL PROCESSING CONFERENCE, EUSIPCO 2024, 2024, : 1891 - 1895
  • [28] Hierarchical Reinforcement Learning with Opponent Modeling for Distributed Multi-agent Cooperation
    Liang, Zhixuan
    Cao, Jiannong
    Jiang, Shan
    Saxena, Divya
    Xu, Huafeng
    2022 IEEE 42ND INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS (ICDCS 2022), 2022, : 884 - 894
  • [29] Mitigating Bus Bunching via Hierarchical Multi-Agent Reinforcement Learning
    Yu, Mengdi
    Yang, Tao
    Li, Chunxiao
    Jin, Yaohui
    Xu, Yanyan
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (08) : 9675 - 9692
  • [30] Hierarchical Control of Multi-Agent Systems using Online Reinforcement Learning
    Bai, He
    George, Jemin
    Chakrabortty, Aranya
    2020 AMERICAN CONTROL CONFERENCE (ACC), 2020, : 340 - 345