Decentralized Trajectory and Power Control Based on Multi-Agent Deep Reinforcement Learning in UAV Networks

被引:9
|
作者
Chen, Binqiang [1 ]
Liu, Dong [1 ]
Hanzo, Lajos [2 ]
机构
[1] Beihang Univ, Beijing, Peoples R China
[2] Unveristy Southampton, Southampton, Hants, England
基金
中国国家自然科学基金;
关键词
UAV; multi-agent deep reinforcement learning; MADDPG; power allocation; trajectory planning; UNMANNED AERIAL VEHICLES;
D O I
10.1109/ICC45855.2022.9838637
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
Unmanned aerial vehicles (UAVs) are capable of enhancing the coverage of existing cellular networks by acting as aerial base stations (ABSs). Due to the limited on-board battery capacity and dynamic topology of UAV networks, trajectory planning and interference coordination are crucial for providing satisfactory service, especially in emergency scenarios, where it is unrealistic to control all UAVs in a centralized manner by gathering global user information. Hence, we solve the decentralized joint trajectory and transmit power control problem of multi-UAV ABS networks. Our goal is to maximize the number of satisfied users, while minimizing the overall energy consumption of UAVs. To allow each UAV to adjust its position and transmit power solely based on local-rather the global-observations, a multi-agent reinforcement learning (MARL) framework is conceived. In order to overcome the non-stationarity issue of MARL and to endow the UAVs with distributed decision making capability, we resort to the centralized training in conjunction with decentralized execution paradigm. By judiciously designing the reward, we propose a decentralized joint trajectory and power control (DTPC) algorithm with significantly reduced complexity. Our simulation results show that the proposed DTPC algorithm outperforms the state-of-the-art deep reinforcement learning based methods, despite its low complexity.
引用
收藏
页码:3983 / 3988
页数:6
相关论文
共 50 条
  • [31] Value Propagation for Decentralized Networked Deep Multi-agent Reinforcement Learning
    Qu, Chao
    Mannor, Shie
    Xu, Huan
    Qi, Yuan
    Song, Le
    Xiong, Junwu
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [32] Deep Reinforcement Learning for Trajectory Design and Power Allocation in UAV Networks
    Zhao, Nan
    Cheng, Yiqiang
    Pei, Yiyang
    Liang, Ying-Chang
    Niyato, Dusit
    ICC 2020 - 2020 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2020,
  • [33] Joint UAV trajectory and communication design with heterogeneous multi-agent reinforcement learning
    Zhou, Xuanhan
    Xiong, Jun
    Zhao, Haitao
    Liu, Xiaoran
    Ren, Baoquan
    Zhang, Xiaochen
    Wei, Jibo
    Yin, Hao
    SCIENCE CHINA-INFORMATION SCIENCES, 2024, 67 (03)
  • [34] Joint UAV trajectory and communication design with heterogeneous multi-agent reinforcement learning
    Xuanhan ZHOU
    Jun XIONG
    Haitao ZHAO
    Xiaoran LIU
    Baoquan REN
    Xiaochen ZHANG
    Jibo WEI
    Hao YIN
    ScienceChina(InformationSciences), 2024, 67 (03) : 225 - 245
  • [35] Decentralized Deterministic Multi-Agent Reinforcement Learning
    Grosnit, Antoine
    Cai, Desmond
    Wynter, Laura
    2021 60TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2021, : 1548 - 1553
  • [36] Securing UAV Communication Based on Multi-Agent Deep Reinforcement Learning in the Presence of Smart UAV Eavesdropper
    Wen, Chaoyang
    Fang, Yuan
    Qiu, Ling
    2022 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2022, : 1164 - 1169
  • [37] Power Allocation and Energy Cooperation for UAV-Enabled MmWave Networks: A Multi-Agent Deep Reinforcement Learning Approach
    Domingo, Mari Carmen
    SENSORS, 2022, 22 (01)
  • [38] Distributed Power Control for Large Energy Harvesting Networks: A Multi-Agent Deep Reinforcement Learning Approach
    Sharma, Mohit K.
    Zappone, Alessio
    Assaad, Mohamad
    Debbah, Merouane
    Vassilaras, Spyridon
    IEEE TRANSACTIONS ON COGNITIVE COMMUNICATIONS AND NETWORKING, 2019, 5 (04) : 1140 - 1154
  • [39] Joint Optimization of Handover Control and Power Allocation Based on Multi-Agent Deep Reinforcement Learning
    Guo, Delin
    Tang, Lan
    Zhang, Xinggan
    Liang, Ying-Chang
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 69 (11) : 13124 - 13138
  • [40] Multi-Agent Deep Reinforcement Learning for Joint Decoupled User Association and Trajectory Design in Full-Duplex Multi-UAV Networks
    Dai, Chen
    Zhu, Kun
    Hossain, Ekram
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2023, 22 (10) : 6056 - 6070