Dense Multiagent Reinforcement Learning Aided Multi-UAV Information Coverage for Vehicular Networks

被引:5
|
作者
Fu, Hang [1 ,2 ]
Wang, Jingjing [1 ,2 ]
Chen, Jianrui [1 ,3 ]
Ren, Pengfei [1 ]
Zhang, Zheng [1 ]
Zhao, Guodong [4 ]
机构
[1] Beihang Univ, Sch Cyber Sci & Technol, Beijing 100191, Peoples R China
[2] Xidian Univ, State Key Lab Integrated Serv Networks, Xian 710071, Peoples R China
[3] Peng Cheng Lab, Shenzhen 518000, Peoples R China
[4] Beihang Univ, Sch Aeronaut Sci & Engn, Beijing 100191, Peoples R China
来源
IEEE INTERNET OF THINGS JOURNAL | 2024年 / 11卷 / 12期
关键词
Heuristic algorithms; Autonomous aerial vehicles; Vehicle dynamics; Training; Internet of Things; Energy consumption; Decision making; Communication coverage; dense reinforcement learning; distributed multiunmanned aerial vehicle (UAV); multiagent reinforcement learning (MARL); vehicular networks; RESOURCE-ALLOCATION; COMMUNICATION; OPTIMIZATION; ALTITUDE; INTERNET;
D O I
10.1109/JIOT.2024.3367005
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
With the rapid development of wireless communication networks, UAVs serving as base stations are increasingly being applied in various scenarios which not only include edge computation and task offloading, but also involve emergency communication, vehicular network enhancement, etc. In order to enhance the utility of UAV base stations' allocation and deployment, a series of algorithms have been proposed, utilizing heuristic methods, learning-based algorithms, or optimization approaches. However, it is intractable for current algorithms to handle the exponential computation increment with UAV base stations increasing, and complicated application scenarios with high dynamic demands. To solve the above issues, we formulate a decision problem with a long sequence to optimize the deployment of multi-UAV base stations for maximizing vehicular networks' communication coverage ratio, which needs to be subject to co-constraints consisting of moving velocity, energy consumption, and communication coverage radius. To solve this optimization problem, we creatively propose an algorithm named dense multiagent reinforcement learning (DMARL), which is under the dual-layer nested decision-making framework, centralized training with decentralized deployment, and accelerates training by only collecting critical states into the dense sampling buffer. To prove our proposed algorithm's effectiveness and generalization ability, we conduct experimental simulations in scenarios with different scales. Corresponding results have been provided to verify our algorithm's superiority in training efficiency and performance metrics, including coverage ratio and energy consumption, compared with other algorithms.
引用
收藏
页码:21274 / 21286
页数:13
相关论文
共 50 条
  • [21] Federated Learning Assisted Multi-UAV Networks
    Zhang, Hongming
    Hanzo, Lajos
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 69 (11) : 14104 - 14109
  • [22] Trajectory Design and Resource Allocation for Multi-UAV Networks: Deep Reinforcement Learning Approaches
    Chang, Zheng
    Deng, Hengwei
    You, Li
    Min, Geyong
    Garg, Sahil
    Kaddoum, Georges
    IEEE TRANSACTIONS ON NETWORK SCIENCE AND ENGINEERING, 2023, 10 (05): : 2940 - 2951
  • [23] Deep Reinforcement Learning Approach for Joint Trajectory Design in Multi-UAV IoT Networks
    Xu, Shu
    Zhan, Xiangyu
    Li, Chunguo
    Wang, Dongming
    Yang, Luxi
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2022, 71 (03) : 3389 - 3394
  • [24] Cooperative Multiagent Deep Reinforcement Learning Methods for UAV-Aided Mobile Edge Computing Networks
    Kim, Mintae
    Lee, Hoon
    Hwang, Sangwon
    Debbah, Merouane
    Lee, Inkyu
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (23): : 38040 - 38053
  • [25] Resource Allocation and Trajectory Design in UAV-Aided Cellular Networks Based on Multiagent Reinforcement Learning
    Yin, Sixing
    Yu, F. Richard
    IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (04) : 2933 - 2943
  • [26] On Designing Multi-UAV Aided Wireless Powered Dynamic Communication via Hierarchical Deep Reinforcement Learning
    Zhao, Ze Yu
    Che, Yue Ling
    Luo, Sheng
    Luo, Gege
    Wu, Kaishun
    Leung, Victor C. M.
    IEEE TRANSACTIONS ON MOBILE COMPUTING, 2024, 23 (12) : 13991 - 14004
  • [27] Resource Allocation in Vehicular Networks with Multi-UAV Served Edge Computing
    Wang, Yuhang
    He, Ying
    Dong, Minhui
    2021 IEEE 29TH INTERNATIONAL CONFERENCE ON NETWORK PROTOCOLS (ICNP 2021), 2021,
  • [28] A deep reinforcement learning based distributed multi-UAV dynamic area coverage algorithm for complex environment
    Xiao, Jian
    Yuan, Guohui
    Xue, Yuxi
    He, Jinhui
    Wang, Yaoting
    Zou, Yuanjiang
    Wang, Zhuoran
    NEUROCOMPUTING, 2024, 595
  • [29] Multi-Agent Deep Reinforcement Learning for Trajectory Design and Power Allocation in Multi-UAV Networks
    Zhao, Nan
    Liu, Zehua
    Cheng, Yiqiang
    IEEE ACCESS, 2020, 8 : 139670 - 139679
  • [30] Deep Reinforcement Learning-enabled Dynamic UAV Deployment and Power Control in Multi-UAV Wireless Networks
    Bai, Yu
    Chang, Zheng
    Jantti, Riku
    ICC 2024 - IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS, 2024, : 1286 - 1291