Joint Resource Allocation and Trajectory Design for Multi-UAV Systems With Moving Users: Pointer Network and Unfolding

被引:10
|
作者
Hou, Qiushuo [1 ,2 ]
Cai, Yunlong [1 ,2 ]
Hu, Qiyu [1 ,2 ]
Lee, Mengyuan [1 ,2 ]
Yu, Guanding [1 ,2 ]
机构
[1] Zhejiang Univ, Coll Informat Sci & Elect Engn, Hangzhou 310027, Peoples R China
[2] Zhejiang Prov Key Lab Informat Proc Commun & Netwo, Hangzhou 310027, Peoples R China
基金
中国国家自然科学基金;
关键词
Artificial neural networks; Trajectory; Resource management; Optimization; Communication systems; Autonomous aerial vehicles; Trajectory optimization; Multi-UAV; resource optimization; trajectory design; pointer network; deep reinforcement learning; deep-unfolding; REINFORCEMENT LEARNING APPROACH; NEURAL-NETWORKS; DEEP; MIMO;
D O I
10.1109/TWC.2022.3217176
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
As an important part of the fifth generation (5G) mobile networks, unmanned aerial vehicles (UAVs) have been applied in various communication scenarios due to their high operability and low cost. In this paper, we investigate a multi-UAV communication system with moving users and consider the co-channel interference caused by the transmissions of all other UAVs. To ensure the fairness, we maximize the minimum average user rate during the observed time by jointly optimizing UAVs' trajectories, transmission power, and user association. Considering that UAVs can cover a large area for communications, UAVs do not need to move as soon as the users move. Therefore, a two-timescale structure is proposed for the considered scenario, where the UAVs' trajectories are optimized based on the channel state information (CSI) in a long timescale, while the transmission power and the user association are optimized based on the instantaneous CSI in a short timescale. To effectively tackle this challenging non-convex problem with both discrete and continuous variables, we propose a joint neural network (NN) design, where a deep reinforcement learning based Pointer Network named advantage pointer-critic (APC) is applied to optimize discrete variables and a deep-unfolding NN is used to optimize the continuous variables. Specifically, we first formulate a Markov decision process to model the user association, and then employ the APC network trained by the advantage actor-critic algorithm to address it. The APC network consists of a Pointer Network and a Multilayer Perceptron. As for the deep-unfolding NN, we first develop a block coordinate descent based algorithm to optimize the UAVs' trajectories and transmission power, and then unfold the algorithm into a layer-wise NN with introduced trainable parameters. These two networks are jointly trained in an unsupervised fashion. Simulation results validate that the proposed joint NN significantly outperforms the optimization algorithm with much lower complexity, and achieves good performances on scalability and generalization ability.
引用
收藏
页码:3310 / 3323
页数:14
相关论文
共 50 条
  • [31] Joint resource allocation and power control for radar interference mitigation in multi-UAV networks
    Xinyi WANG
    Zesong FEI
    Jingxuan HUANG
    J.Andrew ZHANG
    Jinhong YUAN
    ScienceChina(InformationSciences), 2021, 64 (08) : 104 - 116
  • [32] Resource Allocation for Multi-UAV Aided IoT NOMA Uplink Transmission Systems
    Duan, Ruiyang
    Wang, Jingjing
    Jiang, Chunxiao
    Yao, Haipeng
    Ren, Yong
    Qian, Yi
    IEEE INTERNET OF THINGS JOURNAL, 2019, 6 (04) : 7025 - 7037
  • [33] Multi-UAV Trajectory Control, Resource Allocation, and NOMA User Pairing for Uplink Energy Minimization
    Nguyen, Minh Tri
    Le, Long Bao
    IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (23) : 23728 - 23740
  • [34] Joint Trajectory and Resource Allocation Design for RIS-Assisted UAV-Enabled ISAC Systems
    Wu, Zhongqing
    Li, Xuehua
    Cai, Yuanxin
    Yuan, Weijie
    IEEE WIRELESS COMMUNICATIONS LETTERS, 2024, 13 (05) : 1384 - 1388
  • [35] Joint Optimal Placement and Dynamic Resource Allocation for multi-UAV Enhanced Reconfigurable Intelligent Surface Assisted Wireless Network
    Zhang, Yuzhu
    Qian, Lijun
    Xu, Hao
    2023 IEEE 20TH CONSUMER COMMUNICATIONS & NETWORKING CONFERENCE, CCNC, 2023,
  • [36] Joint Trajectory and Communication Design for Buffer-Aided Multi-UAV Relaying Networks
    Cao, Dongju
    Yang, Wendong
    Xu, Gangyi
    APPLIED SCIENCES-BASEL, 2019, 9 (24):
  • [37] Multi-UAV Replacement and Trajectory Design for Coverage Continuity
    Gupta, Nishant
    Agarwal, Satyam
    Mishra, Deepak
    IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC 2022), 2022,
  • [38] Deep Reinforcement Learning Approach for Joint Trajectory Design in Multi-UAV IoT Networks
    Xu, Shu
    Zhan, Xiangyu
    Li, Chunguo
    Wang, Dongming
    Yang, Luxi
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2022, 71 (03) : 3389 - 3394
  • [39] Multi-Agent Deep Reinforcement Learning for Trajectory Design and Power Allocation in Multi-UAV Networks
    Zhao, Nan
    Liu, Zehua
    Cheng, Yiqiang
    IEEE ACCESS, 2020, 8 : 139670 - 139679
  • [40] Joint Node Assignment and Trajectory Optimization for Rechargeable Multi-UAV Aided IoT Systems
    Li, Xiaowei
    Yao, Haipeng
    Wang, Jingjing
    Liu, Yunjie
    2019 11TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP), 2019,