Joint UAV trajectory and communication design with heterogeneous multi-agent reinforcement learning

被引:0
|
作者
Xuanhan ZHOU [1 ]
Jun XIONG [1 ]
Haitao ZHAO [1 ]
Xiaoran LIU [1 ]
Baoquan REN [2 ]
Xiaochen ZHANG [1 ]
Jibo WEI [1 ]
Hao YIN [2 ]
机构
[1] College of Electronic Science and Technology,National University of Defense Technology
[2] Systems Engineering Institute,Academy of Military Sciences PLA
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TN929.5 [移动通信]; TP18 [人工智能理论]; V279 [无人驾驶飞机];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ; 1111 ;
摘要
Unmanned aerial vehicles(UAVs) are recognized as effective means for delivering emergency communication services when terrestrial infrastructures are unavailable. This paper investigates a multiUAV-assisted communication system, where we jointly optimize UAVs’ trajectories, user association, and ground users(GUs)’ transmit power to maximize a defined fairness-weighted throughput metric. Owing to the dynamic nature of UAVs, this problem has to be solved in real time. However, the problem’s non-convex and combinatorial attributes pose challenges for conventional optimization-based algorithms, particularly in scenarios without central controllers. To address this issue, we propose a multi-agent deep reinforcement learning(MADRL) approach to provide distributed and online solutions. In contrast to previous MADRLbased methods considering only UAV agents, we model UAVs and GUs as heterogeneous agents sharing a common objective. Specifically, UAVs are tasked with optimizing their trajectories, while GUs are responsible for selecting a UAV for association and determining a transmit power level. To learn policies for these heterogeneous agents, we design a heterogeneous coordinated QMIX(HC-QMIX) algorithm to train local Q-networks in a centralized manner. With these well-trained local Q-networks, UAVs and GUs can make individual decisions based on their local observations. Extensive simulation results demonstrate that the proposed algorithm outperforms state-of-the-art benchmarks in terms of total throughput and system fairness.
引用
收藏
页码:225 / 245
页数:21
相关论文
共 50 条
  • [31] Information Design in Multi-Agent Reinforcement Learning
    Lin, Yue
    Li, Wenhao
    Zha, Hongyuan
    Wang, Baoxian
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [32] Learning of Communication Codes in Multi-Agent Reinforcement Learning Problem
    Kasai, Tatsuya
    Tenmoto, Hiroshi
    Kamiya, Akimoto
    2008 IEEE CONFERENCE ON SOFT COMPUTING IN INDUSTRIAL APPLICATIONS SMCIA/08, 2009, : 1 - +
  • [33] AoI Minimization for UAV-to-Device Underlay Communication by Multi-agent Deep Reinforcement Learning
    Wu, Fanyi
    Zhang, Hongliang
    Wu, Jianjun
    Song, Lingyang
    Han, Zhu
    Poor, H. Vincent
    2020 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2020,
  • [34] Multi-agent reinforcement learning based on local communication
    Zhang, Wenxu
    Ma, Lei
    Li, Xiaonan
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2019, 22 (Suppl 6): : 15357 - 15366
  • [35] Multi-Agent Deep Reinforcement Learning with Emergent Communication
    Simoes, David
    Lau, Nuno
    Reis, Luis Paulo
    2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
  • [36] Sparse communication in multi-agent deep reinforcement learning
    Han, Shuai
    Dastani, Mehdi
    Wang, Shihan
    NEUROCOMPUTING, 2025, 625
  • [37] Multi-Agent Few-Shot Meta Reinforcement Learning for Trajectory Design and Channel Selection in UAV-Assisted Networks
    Shiyang Zhou
    Yufan Cheng
    Xia Lei
    Huanhuan Duan
    ChinaCommunications, 2022, 19 (04) : 166 - 176
  • [38] Improving coordination with communication in multi-agent reinforcement learning
    Szer, D
    Charpillet, F
    ICTAI 2004: 16TH IEEE INTERNATIONALCONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2004, : 436 - 440
  • [39] Multi-Agent Reinforcement Learning for Coordinating Communication and Control
    Mason, Federico
    Chiariotti, Federico
    Zanella, Andrea
    Popovski, Petar
    IEEE TRANSACTIONS ON COGNITIVE COMMUNICATIONS AND NETWORKING, 2024, 10 (04) : 1566 - 1581
  • [40] Universally Expressive Communication in Multi-Agent Reinforcement Learning
    Morris, Matthew
    Barrett, Thomas D.
    Pretorius, Arnu
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,