Joint Resource Allocation and Trajectory Design for Multi-UAV Systems With Moving Users: Pointer Network and Unfolding

被引:10
|
作者
Hou, Qiushuo [1 ,2 ]
Cai, Yunlong [1 ,2 ]
Hu, Qiyu [1 ,2 ]
Lee, Mengyuan [1 ,2 ]
Yu, Guanding [1 ,2 ]
机构
[1] Zhejiang Univ, Coll Informat Sci & Elect Engn, Hangzhou 310027, Peoples R China
[2] Zhejiang Prov Key Lab Informat Proc Commun & Netwo, Hangzhou 310027, Peoples R China
基金
中国国家自然科学基金;
关键词
Artificial neural networks; Trajectory; Resource management; Optimization; Communication systems; Autonomous aerial vehicles; Trajectory optimization; Multi-UAV; resource optimization; trajectory design; pointer network; deep reinforcement learning; deep-unfolding; REINFORCEMENT LEARNING APPROACH; NEURAL-NETWORKS; DEEP; MIMO;
D O I
10.1109/TWC.2022.3217176
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
As an important part of the fifth generation (5G) mobile networks, unmanned aerial vehicles (UAVs) have been applied in various communication scenarios due to their high operability and low cost. In this paper, we investigate a multi-UAV communication system with moving users and consider the co-channel interference caused by the transmissions of all other UAVs. To ensure the fairness, we maximize the minimum average user rate during the observed time by jointly optimizing UAVs' trajectories, transmission power, and user association. Considering that UAVs can cover a large area for communications, UAVs do not need to move as soon as the users move. Therefore, a two-timescale structure is proposed for the considered scenario, where the UAVs' trajectories are optimized based on the channel state information (CSI) in a long timescale, while the transmission power and the user association are optimized based on the instantaneous CSI in a short timescale. To effectively tackle this challenging non-convex problem with both discrete and continuous variables, we propose a joint neural network (NN) design, where a deep reinforcement learning based Pointer Network named advantage pointer-critic (APC) is applied to optimize discrete variables and a deep-unfolding NN is used to optimize the continuous variables. Specifically, we first formulate a Markov decision process to model the user association, and then employ the APC network trained by the advantage actor-critic algorithm to address it. The APC network consists of a Pointer Network and a Multilayer Perceptron. As for the deep-unfolding NN, we first develop a block coordinate descent based algorithm to optimize the UAVs' trajectories and transmission power, and then unfold the algorithm into a layer-wise NN with introduced trainable parameters. These two networks are jointly trained in an unsupervised fashion. Simulation results validate that the proposed joint NN significantly outperforms the optimization algorithm with much lower complexity, and achieves good performances on scalability and generalization ability.
引用
收藏
页码:3310 / 3323
页数:14
相关论文
共 50 条
  • [41] Deep reinforcement learning based trajectory design and resource allocation for task-aware multi-UAV enabled MEC networks
    Li, Zewu
    Xu, Chen
    Zhang, Zhanpeng
    Wu, Runze
    COMPUTER COMMUNICATIONS, 2024, 213 : 88 - 98
  • [42] Joint Task Allocation and Resource Optimization Based on an Integrated Radar and Communication Multi-UAV System
    Zhang, Xun
    Wang, Kehao
    Li, Xiaobai
    Liu, Kezhong
    Cong, Yirui
    DRONES, 2023, 7 (08)
  • [43] Joint trajectory design and resource allocation for UAV-assisted mobile edge computing in power convergence network
    Cui, Junbin
    Wei, Yong
    Wang, Jianbo
    Shang, Li
    Lin, Peng
    EURASIP JOURNAL ON WIRELESS COMMUNICATIONS AND NETWORKING, 2025, 2025 (01)
  • [44] Joint Multi-UAV Deployment and Resource Allocation based on Personalized Federated Deep Reinforcement Learning
    Xu, Xinyi
    Feng, Gang
    Qin, Shuang
    Liu, Yijing
    Sun, Yao
    ICC 2023-IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, 2023, : 5677 - 5682
  • [45] NOMA-based Resource Allocation for RIS-assisted Multi-UAV Systems
    Feng, Wanmei
    Tang, Jie
    Wu, Qingqing
    Zhang, Xiuyin
    Jin, Shi
    Tang, Boyi
    Wong, Kai-Kit
    IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2022), 2022, : 4553 - 4558
  • [46] Joint Trajectory Design and Resource Allocation for UAV-assisted Jamming NOMA Cognitive UAV Networks
    Sun, Ruomei
    Wu, Yuhang
    Zhou, Fuhui
    Wu, Qihui
    2022 INTERNATIONAL SYMPOSIUM ON WIRELESS COMMUNICATION SYSTEMS, ISWCS, 2022,
  • [47] Joint Task Allocation and Trajectory Optimization for Multi-UAV Collaborative Air-Ground Edge Computing
    Qin, Peng
    Li, Jinghan
    Zhang, Jing
    Fu, Yang
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2024, 11 (06): : 6231 - 6243
  • [48] Joint Trajectory Design and Power Allocation for UAV Assisted Network With User Mobility
    Wang, Jing
    Zhou, Xiaotian
    Zhang, Haixia
    Yuan, Dongfeng
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 72 (10) : 13173 - 13189
  • [49] Joint Resource Allocation and 3D Aerial Trajectory Design for Video Streaming in UAV Communication Systems
    Zhan, Cheng
    Hu, Han
    Sui, Xiufeng
    Liu, Zhi
    Wang, Jianan
    Wang, Honggang
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 31 (08) : 3227 - 3241
  • [50] Multi-UAV cooperative trajectory optimisation for a multi-hop UAV relaying network
    Liu, Cuntao
    Guo, Yan
    Li, Ning
    Zhou, Bin
    ELECTRONICS LETTERS, 2021, 57 (21) : 819 - 822