Joint UAV trajectory and communication design with heterogeneous multi-agent reinforcement learning

被引：0

作者：

Xuanhan ZHOU ^{[1
]}

Jun XIONG ^{[1
]}

Haitao ZHAO ^{[1
]}

Xiaoran LIU ^{[1
]}

Baoquan REN ^{[2
]}

Xiaochen ZHANG ^{[1
]}

Jibo WEI ^{[1
]}

Hao YIN ^{[2
]}

机构：

[1] College of Electronic Science and Technology,National University of Defense Technology

[2] Systems Engineering Institute,Academy of Military Sciences PLA

来源：

ScienceChina(InformationSciences) | 2024年 / 67卷 / 03期

基金：

中国国家自然科学基金;

关键词：

D O I：

暂无

中图分类号：

TN929.5 [移动通信]; TP18 [人工智能理论]; V279 [无人驾驶飞机];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ; 1111 ;

摘要：

Unmanned aerial vehicles(UAVs) are recognized as effective means for delivering emergency communication services when terrestrial infrastructures are unavailable. This paper investigates a multiUAV-assisted communication system, where we jointly optimize UAVs’ trajectories, user association, and ground users(GUs)’ transmit power to maximize a defined fairness-weighted throughput metric. Owing to the dynamic nature of UAVs, this problem has to be solved in real time. However, the problem’s non-convex and combinatorial attributes pose challenges for conventional optimization-based algorithms, particularly in scenarios without central controllers. To address this issue, we propose a multi-agent deep reinforcement learning(MADRL) approach to provide distributed and online solutions. In contrast to previous MADRLbased methods considering only UAV agents, we model UAVs and GUs as heterogeneous agents sharing a common objective. Specifically, UAVs are tasked with optimizing their trajectories, while GUs are responsible for selecting a UAV for association and determining a transmit power level. To learn policies for these heterogeneous agents, we design a heterogeneous coordinated QMIX(HC-QMIX) algorithm to train local Q-networks in a centralized manner. With these well-trained local Q-networks, UAVs and GUs can make individual decisions based on their local observations. Extensive simulation results demonstrate that the proposed algorithm outperforms state-of-the-art benchmarks in terms of total throughput and system fairness.

引用

页码：225 / 245

页数：21

共 50 条

[31] Information Design in Multi-Agent Reinforcement Learning
Lin, Yue
Li, Wenhao
Zha, Hongyuan
Wang, Baoxian
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[32] Learning of Communication Codes in Multi-Agent Reinforcement Learning Problem
Kasai, Tatsuya
Tenmoto, Hiroshi
Kamiya, Akimoto
2008 IEEE CONFERENCE ON SOFT COMPUTING IN INDUSTRIAL APPLICATIONS SMCIA/08, 2009, : 1 - +
[33] AoI Minimization for UAV-to-Device Underlay Communication by Multi-agent Deep Reinforcement Learning
Wu, Fanyi
Zhang, Hongliang
Wu, Jianjun
Song, Lingyang
Han, Zhu
Poor, H. Vincent
2020 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2020,
[34] Multi-agent reinforcement learning based on local communication
Zhang, Wenxu
Ma, Lei
Li, Xiaonan
CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2019, 22 (Suppl 6): : 15357 - 15366
[35] Multi-Agent Deep Reinforcement Learning with Emergent Communication
Simoes, David
Lau, Nuno
Reis, Luis Paulo
2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019,
[36] Sparse communication in multi-agent deep reinforcement learning
Han, Shuai
Dastani, Mehdi
Wang, Shihan
NEUROCOMPUTING, 2025, 625
[37] Multi-Agent Few-Shot Meta Reinforcement Learning for Trajectory Design and Channel Selection in UAV-Assisted Networks
Shiyang Zhou
Yufan Cheng
Xia Lei
Huanhuan Duan
ChinaCommunications, 2022, 19 (04) : 166 - 176
[38] Improving coordination with communication in multi-agent reinforcement learning
Szer, D
Charpillet, F
ICTAI 2004: 16TH IEEE INTERNATIONALCONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2004, : 436 - 440
[39] Multi-Agent Reinforcement Learning for Coordinating Communication and Control
Mason, Federico
Chiariotti, Federico
Zanella, Andrea
Popovski, Petar
IEEE TRANSACTIONS ON COGNITIVE COMMUNICATIONS AND NETWORKING, 2024, 10 (04) : 1566 - 1581
[40] Universally Expressive Communication in Multi-Agent Reinforcement Learning
Morris, Matthew
Barrett, Thomas D.
Pretorius, Arnu
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,

← 1 2 3 4 5 →