Exploiting Deep Reinforcement Learning for Stochastic AoI Minimization in Multi-UAV-assisted Wireless Networks

被引:0
|
作者
Long, Yusi [1 ,2 ]
Zhuang, Jialin [1 ]
Gong, Shimin [1 ,2 ]
Gu, Bo [1 ]
Xu, Jing [3 ]
Deng, Jing [4 ]
机构
[1] Sun Yat Sen Univ, Sch Intelligent Syst Engn, Shenzhen Campus, Shenzhen, Peoples R China
[2] Guangdong Prov Key Lab Fire Sci & Intelligent Eme, Guangzhou, Peoples R China
[3] Huazhong Univ Sci & Technol, Sch Elect Informat & Commun, Wuhan, Hubei, Peoples R China
[4] UNC Greensboro, Dept Comp Sci, Greensboro, NC USA
基金
中国国家自然科学基金;
关键词
UAV; backscatter; NOMA; DRL; trajectory planning; Lyapunov optimization; INFORMATION; AGE;
D O I
10.1109/WCNC57260.2024.10570857
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we consider a multiple unmanned aerial vehicles (UAVs)-assisted wireless sensing network, where low-power ground users (GUs) periodically sense the environmental information and upload the recent sensing information to a base station (BS). The GUs firstly backscatter their information to the UAVs and then the UAVs transmit the information to the BS by the non-orthogonal multiple access (NOMA) transmissions. Our goal is to minimize the long-term age-of-information (AoI) by jointly optimizing the UAV's sensing scheduling, transmission control, and trajectories. To solve this problem, we propose the Lyapunov-driven hierarchical proximal policy optimization framework, named Lya-HPPO, to decouple the multi-stage AoI minimization problem into several control subproblems. In each control subproblem, the UAVs' sensing scheduling and transmission control are firstly determined by the outer-loop deep reinforcement learning (DRL) approach, and then the inner-loop optimization module is to update the UAVs' trajectories. Simulation results verify that the proposed Lya-HPPO framework converges very fast to a stable value and can make online decisions in real time, while guaranteeing the long-term data buffer and AoI stability.
引用
收藏
页数:6
相关论文
共 50 条
  • [41] Power Minimization in Wireless Sensor Networks With Constrained AoI Using Stochastic Optimization
    Moltafet, Mohammad
    Leinonen, Markus
    Codreanu, Marian
    Pappas, Nikolaos
    CONFERENCE RECORD OF THE 2019 FIFTY-THIRD ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, 2019, : 406 - 410
  • [42] Position based Throughput Maximization of Multi-UAV-assisted Relay Networks
    Singh, Sandeep Kumar
    Agrawal, Kamal
    Singh, Keshav
    Li, Chih-Peng
    Huang, Wan-Jen
    2020 IEEE INTERNATIONAL CONFERENCE ON ADVANCED NETWORKS AND TELECOMMUNICATIONS SYSTEMS (IEEE ANTS), 2020,
  • [43] Resource Allocation in UAV-Assisted Wireless Networks Using Reinforcement Learning
    Luong, Phuong
    Gagnon, Francois
    Labeau, Fabrice
    2020 IEEE 92ND VEHICULAR TECHNOLOGY CONFERENCE (VTC2020-FALL), 2020,
  • [44] Graph-Attention-Based Reinforcement Learning for Trajectory Design and Resource Assignment in Multi-UAV-Assisted Communication
    Feng, Zikai
    Wu, Di
    Huang, Mengxing
    Yuen, Chau
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (16): : 27421 - 27434
  • [45] Trajectory Planning in UAV-Assisted Wireless Networks via Reinforcement Learning
    He, Simeng
    Zhang, Shangwei
    2022 IEEE 23RD INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE SWITCHING AND ROUTING (IEEE HPSR), 2022, : 232 - 237
  • [46] Deep-Reinforcement-Learning-Based Placement for Integrated Access Backhauling in UAV-Assisted Wireless Networks
    Wang, Yuhui
    Farooq, Junaid
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (08): : 14727 - 14738
  • [47] On the Peak AoI of UAV-Assisted IoT Networks: A Stochastic Geometry Approach
    Qin, Yujie
    Kishk, Mustafa A.
    Alouini, Mohamed-Slim
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (05): : 8676 - 8689
  • [48] AoI optimal UAV trajectory planning: A Deep Recurrent Reinforcement Learning Approach
    Wu, Mengjie
    Chi, Huijia
    Gan, Shuying
    Wang, Xijun
    Xu, Chao
    2021 IEEE 32ND ANNUAL INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR AND MOBILE RADIO COMMUNICATIONS (PIMRC), 2021,
  • [49] Deep Reinforcement Learning Assisted UAV Trajectory and Resource Optimization for NOMA Networks
    Chen, Peixin
    Zhao, Jian
    Shen, Furao
    2022 14TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING, WCSP, 2022, : 933 - 938
  • [50] Deep Reinforcement Learning Based AoI Minimization for NOMA-Enabled Integrated Satellite-Terrestrial Networks
    He, Xinyu
    Yang, Yang
    Lee, Jemin
    He, Gang
    Yan, Qing
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2025, 74 (02) : 3567 - 3572