Optimal Frequency Reuse and Power Control in Multi-UAV Wireless Networks: Hierarchical Multi-Agent Reinforcement Learning Perspective

被引:8
|
作者
Lee, Seungmin [1 ,2 ]
Lim, Suhyeon [1 ,2 ]
Chae, Seong Ho [3 ]
Jung, Bang Chul [4 ]
Park, Chan Yi [5 ]
Lee, Howon [1 ,2 ]
机构
[1] Hankyong Natl Univ, Sch Elect & Elect Engn, Anseong 17579, South Korea
[2] Hankyong Natl Univ, Inst IT Convergence IITC, Anseong 17579, South Korea
[3] Tech Univ Korea, Dept Elect Engn, Siheung Si 15073, South Korea
[4] Chungnam Natl Univ, Dept Elect Engn, Daejeon 34134, South Korea
[5] Agcy Def Dev, Daejeon 34186, South Korea
关键词
Frequency conversion; Computer architecture; Time-frequency analysis; Microprocessors; Wireless networks; Q-learning; Autonomous aerial vehicles; Unmanned aerial vehicle; optimal frequency reuse; transmit power control; energy efficiency; hierarchical multi-agent Q-learning; multi-UAV wireless network; COVERAGE; ACCESS;
D O I
10.1109/ACCESS.2022.3166179
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
To overcome the problems caused by the limited battery lifetime in multiple-unmanned aerial vehicle (UAV) wireless networks, we propose a hierarchical multi-agent reinforcement learning (RL) framework to maximize the energy efficiency (EE) of UAVs by finding the optimal frequency reuse factor and transmit power. The proposed algorithm consists of distributed inner-loop RL for transmit power control of the UAV terminal (UT) and centralized outer-loop RL for finding the optimal frequency reuse factor. Specifically, the proposed algorithm adjusts these two factors jointly to effectively mitigate intercell interference and reduce undesired transmit power consumption in multi-UAV wireless networks. We show that, for this reason, the proposed algorithm outperforms conventional algorithms, such as a random action algorithm with a fixed frequency reuse factor and a hierarchical multi-agent Q-learning algorithm with binary transmit power controls. Furthermore, even in the environment where UTs are continuously moving based on the mixed mobility model, we show that the proposed algorithm can find the best reward when compared to conventional algorithms.
引用
收藏
页码:39555 / 39565
页数:11
相关论文
共 50 条
  • [41] Emergent Communication in Multi-Agent Reinforcement Learning for Future Wireless Networks
    Chafii M.
    Naoumi S.
    Alami R.
    Almazrouei E.
    Bennis M.
    Debbah M.
    IEEE Internet of Things Magazine, 2023, 6 (04): : 18 - 24
  • [42] Multi-Agent Model-Based Reinforcement Learning for Trajectory Design and Power Control in UAV-Enabled Networks
    Zhou, Shiyang
    Cheng, Yufan
    Lei, Xia
    2022 3RD INFORMATION COMMUNICATION TECHNOLOGIES CONFERENCE (ICTC 2022), 2022, : 33 - 38
  • [43] Hierarchical Control of Multi-Agent Systems using Online Reinforcement Learning
    Bai, He
    George, Jemin
    Chakrabortty, Aranya
    2020 AMERICAN CONTROL CONFERENCE (ACC), 2020, : 340 - 345
  • [44] Hierarchical graph multi-agent reinforcement learning for traffic signal control
    Yang, Shantian
    INFORMATION SCIENCES, 2023, 634 : 55 - 72
  • [45] Sustainable Task Offloading in UAV Networks via Multi-Agent Reinforcement Learning
    Sacco, Alessio
    Esposito, Flavio
    Marchetto, Guido
    Montuschi, Paolo
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2021, 70 (05) : 5003 - 5015
  • [46] Multi-Agent Reinforcement Learning-Based Resource Allocation for UAV Networks
    Cui, Jingjing
    Liu, Yuanwei
    Nallanathan, Arumugam
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2020, 19 (02) : 729 - 743
  • [47] Distributed reinforcement learning in multi-agent networks
    Kar, Soummya
    Moura, Jose M. F.
    Poor, H. Vincent
    2013 IEEE 5TH INTERNATIONAL WORKSHOP ON COMPUTATIONAL ADVANCES IN MULTI-SENSOR ADAPTIVE PROCESSING (CAMSAP 2013), 2013, : 296 - +
  • [48] Studies on hierarchical reinforcement learning in multi-agent environment
    Yu Lasheng
    Marin, Alonso
    Hong Fei
    Lin Jian
    PROCEEDINGS OF 2008 IEEE INTERNATIONAL CONFERENCE ON NETWORKING, SENSING AND CONTROL, VOLS 1 AND 2, 2008, : 1714 - 1720
  • [49] Multi-Agent Hierarchical Reinforcement Learning with Dynamic Termination
    Han, Dongge
    Boehmer, Wendelin
    Wooldridge, Michael
    Rogers, Alex
    AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 2006 - 2008
  • [50] Multi-agent hierarchical reinforcement learning for energy management
    Jendoubi, Imen
    Bouffard, Francois
    APPLIED ENERGY, 2023, 332