Exploiting Deep Reinforcement Learning for Stochastic AoI Minimization in Multi-UAV-assisted Wireless Networks

被引：0

作者：

Long, Yusi ^{[1
,2
]}

Zhuang, Jialin ^{[1
]}

Gong, Shimin ^{[1
,2
]}

Gu, Bo ^{[1
]}

Xu, Jing ^{[3
]}

Deng, Jing ^{[4
]}

机构：

[1] Sun Yat Sen Univ, Sch Intelligent Syst Engn, Shenzhen Campus, Shenzhen, Peoples R China

[2] Guangdong Prov Key Lab Fire Sci & Intelligent Eme, Guangzhou, Peoples R China

[3] Huazhong Univ Sci & Technol, Sch Elect Informat & Commun, Wuhan, Hubei, Peoples R China

[4] UNC Greensboro, Dept Comp Sci, Greensboro, NC USA

来源：

2024 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE, WCNC 2024 | 2024年

基金：

中国国家自然科学基金;

关键词：

UAV; backscatter; NOMA; DRL; trajectory planning; Lyapunov optimization; INFORMATION; AGE;

D O I：

10.1109/WCNC57260.2024.10570857

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper, we consider a multiple unmanned aerial vehicles (UAVs)-assisted wireless sensing network, where low-power ground users (GUs) periodically sense the environmental information and upload the recent sensing information to a base station (BS). The GUs firstly backscatter their information to the UAVs and then the UAVs transmit the information to the BS by the non-orthogonal multiple access (NOMA) transmissions. Our goal is to minimize the long-term age-of-information (AoI) by jointly optimizing the UAV's sensing scheduling, transmission control, and trajectories. To solve this problem, we propose the Lyapunov-driven hierarchical proximal policy optimization framework, named Lya-HPPO, to decouple the multi-stage AoI minimization problem into several control subproblems. In each control subproblem, the UAVs' sensing scheduling and transmission control are firstly determined by the outer-loop deep reinforcement learning (DRL) approach, and then the inner-loop optimization module is to update the UAVs' trajectories. Simulation results verify that the proposed Lya-HPPO framework converges very fast to a stable value and can make online decisions in real time, while guaranteeing the long-term data buffer and AoI stability.

引用

页数：6

共 50 条

[41] Power Minimization in Wireless Sensor Networks With Constrained AoI Using Stochastic Optimization
Moltafet, Mohammad
Leinonen, Markus
Codreanu, Marian
Pappas, Nikolaos
CONFERENCE RECORD OF THE 2019 FIFTY-THIRD ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, 2019, : 406 - 410
[42] Position based Throughput Maximization of Multi-UAV-assisted Relay Networks
Singh, Sandeep Kumar
Agrawal, Kamal
Singh, Keshav
Li, Chih-Peng
Huang, Wan-Jen
2020 IEEE INTERNATIONAL CONFERENCE ON ADVANCED NETWORKS AND TELECOMMUNICATIONS SYSTEMS (IEEE ANTS), 2020,
[43] Resource Allocation in UAV-Assisted Wireless Networks Using Reinforcement Learning
Luong, Phuong
Gagnon, Francois
Labeau, Fabrice
2020 IEEE 92ND VEHICULAR TECHNOLOGY CONFERENCE (VTC2020-FALL), 2020,
[44] Graph-Attention-Based Reinforcement Learning for Trajectory Design and Resource Assignment in Multi-UAV-Assisted Communication
Feng, Zikai
Wu, Di
Huang, Mengxing
Yuen, Chau
IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (16): : 27421 - 27434
[45] Trajectory Planning in UAV-Assisted Wireless Networks via Reinforcement Learning
He, Simeng
Zhang, Shangwei
2022 IEEE 23RD INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE SWITCHING AND ROUTING (IEEE HPSR), 2022, : 232 - 237
[46] Deep-Reinforcement-Learning-Based Placement for Integrated Access Backhauling in UAV-Assisted Wireless Networks
Wang, Yuhui
Farooq, Junaid
IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (08): : 14727 - 14738
[47] On the Peak AoI of UAV-Assisted IoT Networks: A Stochastic Geometry Approach
Qin, Yujie
Kishk, Mustafa A.
Alouini, Mohamed-Slim
IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (05): : 8676 - 8689
[48] AoI optimal UAV trajectory planning: A Deep Recurrent Reinforcement Learning Approach
Wu, Mengjie
Chi, Huijia
Gan, Shuying
Wang, Xijun
Xu, Chao
2021 IEEE 32ND ANNUAL INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR AND MOBILE RADIO COMMUNICATIONS (PIMRC), 2021,
[49] Deep Reinforcement Learning Assisted UAV Trajectory and Resource Optimization for NOMA Networks
Chen, Peixin
Zhao, Jian
Shen, Furao
2022 14TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING, WCSP, 2022, : 933 - 938
[50] Deep Reinforcement Learning Based AoI Minimization for NOMA-Enabled Integrated Satellite-Terrestrial Networks
He, Xinyu
Yang, Yang
Lee, Jemin
He, Gang
Yan, Qing
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2025, 74 (02) : 3567 - 3572

← 1 2 3 4 5 →