Joint UAV trajectory and communication design with heterogeneous multi-agent reinforcement learning

被引:0
|
作者
Xuanhan ZHOU [1 ]
Jun XIONG [1 ]
Haitao ZHAO [1 ]
Xiaoran LIU [1 ]
Baoquan REN [2 ]
Xiaochen ZHANG [1 ]
Jibo WEI [1 ]
Hao YIN [2 ]
机构
[1] College of Electronic Science and Technology,National University of Defense Technology
[2] Systems Engineering Institute,Academy of Military Sciences PLA
基金
中国国家自然科学基金;
关键词
D O I
暂无
中图分类号
TN929.5 [移动通信]; TP18 [人工智能理论]; V279 [无人驾驶飞机];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ; 1111 ;
摘要
Unmanned aerial vehicles(UAVs) are recognized as effective means for delivering emergency communication services when terrestrial infrastructures are unavailable. This paper investigates a multiUAV-assisted communication system, where we jointly optimize UAVs’ trajectories, user association, and ground users(GUs)’ transmit power to maximize a defined fairness-weighted throughput metric. Owing to the dynamic nature of UAVs, this problem has to be solved in real time. However, the problem’s non-convex and combinatorial attributes pose challenges for conventional optimization-based algorithms, particularly in scenarios without central controllers. To address this issue, we propose a multi-agent deep reinforcement learning(MADRL) approach to provide distributed and online solutions. In contrast to previous MADRLbased methods considering only UAV agents, we model UAVs and GUs as heterogeneous agents sharing a common objective. Specifically, UAVs are tasked with optimizing their trajectories, while GUs are responsible for selecting a UAV for association and determining a transmit power level. To learn policies for these heterogeneous agents, we design a heterogeneous coordinated QMIX(HC-QMIX) algorithm to train local Q-networks in a centralized manner. With these well-trained local Q-networks, UAVs and GUs can make individual decisions based on their local observations. Extensive simulation results demonstrate that the proposed algorithm outperforms state-of-the-art benchmarks in terms of total throughput and system fairness.
引用
收藏
页码:225 / 245
页数:21
相关论文
共 50 条
  • [41] Low Entropy Communication in Multi-Agent Reinforcement Learning
    Yu, Lebin
    Qiu, Yunbo
    Wang, Qiexiang
    Zhang, Xudong
    Wang, Jian
    ICC 2023-IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, 2023, : 5173 - 5178
  • [42] Multi-agent reinforcement learning based on local communication
    Wenxu Zhang
    Lei Ma
    Xiaonan Li
    Cluster Computing, 2019, 22 : 15357 - 15366
  • [43] Biases for Emergent Communication in Multi-agent Reinforcement Learning
    Eccles, Tom
    Bachrach, Yoram
    Lever, Guy
    Lazaridou, Angeliki
    Graepel, Thore
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [44] Multi-agent few-shot meta reinforcement learning for trajectory design and channel selection in UAV-assisted networks
    Zhou, Shiyang
    Cheng, Yufan
    Lei, Xia
    Duan, Huanhuan
    CHINA COMMUNICATIONS, 2022, 19 (04) : 166 - 176
  • [45] Cooperative Internet of UAVs: Distributed Trajectory Design by Multi-Agent Deep Reinforcement Learning
    Hu, Jingzhi
    Zhang, Hongliang
    Song, Lingyang
    Schober, Robert
    Poor, H. Vincent
    IEEE TRANSACTIONS ON COMMUNICATIONS, 2020, 68 (11) : 6807 - 6821
  • [46] Multi-Target Pursuit by a Decentralized Heterogeneous UAV Swarm using Deep Multi-Agent Reinforcement Learning
    Kouzeghar, Maryam
    Song, Youngbin
    Meghjani, Malika
    Bouffanais, Roland
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA, 2023, : 3289 - 3295
  • [47] Deep Reinforcement Learning Approach for Joint Trajectory Design in Multi-UAV IoT Networks
    Xu, Shu
    Zhan, Xiangyu
    Li, Chunguo
    Wang, Dongming
    Yang, Luxi
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2022, 71 (03) : 3389 - 3394
  • [48] Distributed UAV-BSs Trajectory Optimization for User-Level Fair Communication Service With Multi-Agent Deep Reinforcement Learning
    Qin, Zhenquan
    Liu, Zhonghao
    Han, Guangjie
    Lin, Chuan
    Guo, Linlin
    Xie, Ling
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2021, 70 (12) : 12290 - 12301
  • [49] Joint Optimization of Multi-UAV Target Assignment and Path Planning Based on Multi-Agent Reinforcement Learning
    Qie, Han
    Shi, Dianxi
    Shen, Tianlong
    Xu, Xinhai
    Li, Yuan
    Wang, Liujing
    IEEE ACCESS, 2019, 7 : 146264 - 146272
  • [50] Trajectory Design for Multi-UAV-Enabled Wireless Powered Communication Networks: A Multi-Agent DRL Approach
    Qi, Honghao
    Wu, Muqing
    Zhang, Zijian
    Zhao, Min
    2024 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE, WCNC 2024, 2024,