Optimal Frequency Reuse and Power Control in Multi-UAV Wireless Networks: Hierarchical Multi-Agent Reinforcement Learning Perspective

被引:8
|
作者
Lee, Seungmin [1 ,2 ]
Lim, Suhyeon [1 ,2 ]
Chae, Seong Ho [3 ]
Jung, Bang Chul [4 ]
Park, Chan Yi [5 ]
Lee, Howon [1 ,2 ]
机构
[1] Hankyong Natl Univ, Sch Elect & Elect Engn, Anseong 17579, South Korea
[2] Hankyong Natl Univ, Inst IT Convergence IITC, Anseong 17579, South Korea
[3] Tech Univ Korea, Dept Elect Engn, Siheung Si 15073, South Korea
[4] Chungnam Natl Univ, Dept Elect Engn, Daejeon 34134, South Korea
[5] Agcy Def Dev, Daejeon 34186, South Korea
关键词
Frequency conversion; Computer architecture; Time-frequency analysis; Microprocessors; Wireless networks; Q-learning; Autonomous aerial vehicles; Unmanned aerial vehicle; optimal frequency reuse; transmit power control; energy efficiency; hierarchical multi-agent Q-learning; multi-UAV wireless network; COVERAGE; ACCESS;
D O I
10.1109/ACCESS.2022.3166179
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
To overcome the problems caused by the limited battery lifetime in multiple-unmanned aerial vehicle (UAV) wireless networks, we propose a hierarchical multi-agent reinforcement learning (RL) framework to maximize the energy efficiency (EE) of UAVs by finding the optimal frequency reuse factor and transmit power. The proposed algorithm consists of distributed inner-loop RL for transmit power control of the UAV terminal (UT) and centralized outer-loop RL for finding the optimal frequency reuse factor. Specifically, the proposed algorithm adjusts these two factors jointly to effectively mitigate intercell interference and reduce undesired transmit power consumption in multi-UAV wireless networks. We show that, for this reason, the proposed algorithm outperforms conventional algorithms, such as a random action algorithm with a fixed frequency reuse factor and a hierarchical multi-agent Q-learning algorithm with binary transmit power controls. Furthermore, even in the environment where UTs are continuously moving based on the mixed mobility model, we show that the proposed algorithm can find the best reward when compared to conventional algorithms.
引用
收藏
页码:39555 / 39565
页数:11
相关论文
共 50 条
  • [21] Joint Optimization of Multi-UAV Target Assignment and Path Planning Based on Multi-Agent Reinforcement Learning
    Qie, Han
    Shi, Dianxi
    Shen, Tianlong
    Xu, Xinhai
    Li, Yuan
    Wang, Liujing
    IEEE ACCESS, 2019, 7 : 146264 - 146272
  • [22] Multi-Agent Reinforcement Learning for Active Voltage Control on Power Distribution Networks
    Wang, Jianhong
    Xu, Wangkun
    Gu, Yunjie
    Song, Wenbin
    Green, Tim C.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [23] Optimal formation tracking control based on reinforcement learning for multi-UAV systems
    Wang, Weizhen
    Chen, Xin
    Jia, Jiangbo
    Wu, Kaili
    Xie, Mingyang
    CONTROL ENGINEERING PRACTICE, 2023, 141
  • [24] Multi-UAV Dynamic Wireless Networking With Deep Reinforcement Learning
    Wang, Qiang
    Zhang, Wenqi
    Liu, Yuanwei
    Liu, Ying
    IEEE COMMUNICATIONS LETTERS, 2019, 23 (12) : 2243 - 2246
  • [25] Trajectory Design and Power Control for Multi-UAV Assisted Wireless Networks: A Machine Learning Approach
    Liu, Xiao
    Liu, Yuanwei
    Chen, Yue
    Hanzo, Lajos
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2019, 68 (08) : 7957 - 7969
  • [26] Graph Convolutional Multi-Agent Reinforcement Learning for UAV Coverage Control
    Dai, Anna
    Li, Rongpeng
    Zhaot, Zhifeng
    Zhang, Honggang
    2020 12TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP), 2020, : 1106 - 1111
  • [27] Autonomous and cooperative control of UAV cluster with multi-agent reinforcement learning
    Xu, D.
    Chen, G.
    AERONAUTICAL JOURNAL, 2022, 126 (1300): : 932 - 951
  • [28] Distributed Multi-agent Reinforcement Learning for Directional UAV Network Control
    He, Linsheng
    Zhao, Jiamiao
    Hu, Fei
    PROCEEDINGS OF THE 32ND INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE PARALLEL AND DISTRIBUTED COMPUTING, HPDC 2023, 2023, : 317 - 318
  • [29] A Nearly Optimal Multi-agent Formation Control with Reinforcement Learning
    Peng, Jiangwen
    Mu, Chaoxu
    Wang, Ke
    2021 PROCEEDINGS OF THE 40TH CHINESE CONTROL CONFERENCE (CCC), 2021, : 5315 - 5320
  • [30] Optimal control in microgrid using multi-agent reinforcement learning
    Li, Fu-Dong
    Wu, Min
    He, Yong
    Chen, Xin
    ISA TRANSACTIONS, 2012, 51 (06) : 743 - 751