Joint UAV trajectory and communication design with heterogeneous multi-agent reinforcement learning

被引:0
|
作者
Xuanhan ZHOU [1 ]
Jun XIONG [1 ]
Haitao ZHAO [1 ]
Xiaoran LIU [1 ]
Baoquan REN [2 ]
Xiaochen ZHANG [1 ]
Jibo WEI [1 ]
Hao YIN [2 ]
机构
[1] College of Electronic Science and Technology,National University of Defense Technology
[2] Systems Engineering Institute,Academy of Military Sciences PLA
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TN929.5 [移动通信]; TP18 [人工智能理论]; V279 [无人驾驶飞机];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ; 1111 ;
摘要
Unmanned aerial vehicles(UAVs) are recognized as effective means for delivering emergency communication services when terrestrial infrastructures are unavailable. This paper investigates a multiUAV-assisted communication system, where we jointly optimize UAVs’ trajectories, user association, and ground users(GUs)’ transmit power to maximize a defined fairness-weighted throughput metric. Owing to the dynamic nature of UAVs, this problem has to be solved in real time. However, the problem’s non-convex and combinatorial attributes pose challenges for conventional optimization-based algorithms, particularly in scenarios without central controllers. To address this issue, we propose a multi-agent deep reinforcement learning(MADRL) approach to provide distributed and online solutions. In contrast to previous MADRLbased methods considering only UAV agents, we model UAVs and GUs as heterogeneous agents sharing a common objective. Specifically, UAVs are tasked with optimizing their trajectories, while GUs are responsible for selecting a UAV for association and determining a transmit power level. To learn policies for these heterogeneous agents, we design a heterogeneous coordinated QMIX(HC-QMIX) algorithm to train local Q-networks in a centralized manner. With these well-trained local Q-networks, UAVs and GUs can make individual decisions based on their local observations. Extensive simulation results demonstrate that the proposed algorithm outperforms state-of-the-art benchmarks in terms of total throughput and system fairness.
引用
收藏
页码:225 / 245
页数:21
相关论文
共 50 条
  • [21] Multi-Agent Model-Based Reinforcement Learning for Trajectory Design and Power Control in UAV-Enabled Networks
    Zhou, Shiyang
    Cheng, Yufan
    Lei, Xia
    2022 3RD INFORMATION COMMUNICATION TECHNOLOGIES CONFERENCE (ICTC 2022), 2022, : 33 - 38
  • [22] Multi-Agent Reinforcement Learning Trajectory Design and Two-Stage Resource Management in CoMP UAV VLC Networks
    Maleki, Mohammad Reza
    Mili, Mohammad Robat
    Javan, Mohammad Reza
    Mokari, Nader
    Jorswieck, Eduard A. A.
    IEEE TRANSACTIONS ON COMMUNICATIONS, 2022, 70 (11) : 7464 - 7476
  • [23] Multi-agent Deep Reinforcement Learning-based Trajectory Design for UAV-aided Edge Computing System
    Lu, Gengyuan
    Chang, Zheng
    2023 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE, WCNC, 2023,
  • [24] Joint UAV Trajectory and RadCom Task Schedule for IVNs: A Game-Embedding Multi-Agent Deep Reinforcement Learning Approach
    Cheng, Sike
    Lin, Xiangbo
    Li, Xuanheng
    Wang, Jingjing
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2025, 24 (01) : 181 - 196
  • [25] Trajectory Design and Bandwidth Assignment for UAVs-enabled Communication Network with Multi-Agent Deep Reinforcement Learning
    Wang, Weijian
    Lin, Yun
    2021 IEEE 94TH VEHICULAR TECHNOLOGY CONFERENCE (VTC2021-FALL), 2021,
  • [26] Evaluating Multi-Agent Reinforcement Learning on Heterogeneous Platforms
    Wiggins, Samuel
    Meng, Yuan
    Kannan, Rajgopal
    Prasanna, Viktor
    ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING FOR MULTI-DOMAIN OPERATIONS APPLICATIONS V, 2023, 12538
  • [27] GAN-powered heterogeneous multi-agent reinforcement learning for UAV-assisted task
    Li, Yangyang
    Feng, Lei
    Yang, Yang
    Li, Wenjing
    AD HOC NETWORKS, 2024, 153
  • [28] Design of routing protocols for heterogeneous WSN based on multi-agent reinforcement learning
    George, Melbin
    Baskar, S.
    Roberts, Michaelraj Kingston
    2024 7TH INTERNATIONAL CONFERENCE ON DEVICES, CIRCUITS AND SYSTEMS, ICDCS 2024, 2024, : 72 - 76
  • [29] Multi-Agent Reinforcement Learning-Based Joint Caching and Routing in Heterogeneous Networks
    Yang, Meiyi
    Gao, Deyun
    Foh, Chuan Heng
    Quan, Wei
    Leung, Victor C. M.
    IEEE TRANSACTIONS ON COGNITIVE COMMUNICATIONS AND NETWORKING, 2024, 10 (05) : 1959 - 1974
  • [30] Multi-Agent Deep Reinforcement Learning for Secure UAV Communications
    Zhang, Yu
    Zhuang, Zirui
    Gao, Feifei
    Wang, Jingyu
    Han, Zhu
    2020 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2020,