Collaborative Deep Reinforcement Learning for Resource Optimization in Non-Terrestrial Networks

被引：1

作者：

Cao, Yang ^{[1
,2
]}

Lien, Shao-Yu ^{[3
]}

Liang, Ying-Chang ^{[1
,2
]}

Niyato, Dusit ^{[4
]}

Shen, Xuemin ^{[5
]}

机构：

[1] Univ Elect Sci & Technol China, Yangtze Delta Reg Inst Huzhou, Huzhou, Peoples R China

[2] Univ Elect Sci & Technol China, Chengdu, Peoples R China

[3] Natl Yang Ming Chiao Tung Univ, Tainan, Taiwan

[4] Nanyang Technol Univ, Singapore, Singapore

[5] Univ Waterloo, Waterloo, ON, Canada

来源：

2023 IEEE 34TH ANNUAL INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR AND MOBILE RADIO COMMUNICATIONS, PIMRC | 2023年

基金：

新加坡国家研究基金会; 国家重点研发计划;

关键词：

Non-terrestrial networks (NTNs); earth-fixed cell; resource allocation; deep reinforcement learning (DRL); multi-time-scale; Markov decision process (MMDPs);

D O I：

10.1109/PIMRC56721.2023.10294047

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

Non-terrestrial networks (NTNs) with low-earth orbit (LEO) satellites have been regarded as promising remedies to support global ubiquitous wireless services. Due to the rapid mobility of LEO satellite, inter-beam/satellite handovers happen frequently for a specific user equipment (UE). To tackle this issue, earth-fixed cell scenarios have been under studied, in which the LEO satellite adjusts its beam direction towards a fixed area within its dwell duration, to maintain stable transmission performance for the UE. Therefore, it is required that the LEO satellite performs real-time resource allocation, which however is unaffordable by the LEO satellite with limited computing capability. To address this issue, in this paper, we propose a two-time-scale collaborative deep reinforcement learning (DRL) scheme for beam management and resource allocation in NTNs, in which LEO satellite and UE with different control cycles update their decision-making policies through a sequential manner. Specifically, UE updates its policy subject to improving the value functions of both the agents. Furthermore, the LEO satellite only makes decisions through finitestep rollouts with a reference decision trajectory received from the UE. Simulation results show that the proposed scheme can effectively balance the throughput performance and computational complexity over traditional greedy-searching schemes.

引用

页数：7

共 50 条

[21] Cooperative Resource Allocation in Integrated Terrestrial/Non-Terrestrial 5G and Beyond Networks
Rinaldi, F.
Pizzi, S.
Molinaro, A.
Iera, A.
Araniti, G.
2020 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2020,
[22] Technical Analysis of Non-terrestrial Networks
Lin, Pingping
20TH INTERNATIONAL WIRELESS COMMUNICATIONS & MOBILE COMPUTING CONFERENCE, IWCMC 2024, 2024, : 975 - 979
[23] Federated Learning-based Jamming Detection for Tactical Terrestrial and Non-Terrestrial Networks
Meftah, Aida
Kaddoum, Georges
Do, Tri Nhu
Talhi, Chamseddine
IEEE CONFERENCE ON GLOBAL COMMUNICATIONS, GLOBECOM, 2023, : 2154 - 2159
[24] Federated Learning-Enabled Jamming Detection for Stochastic Terrestrial and Non-Terrestrial Networks
Meftah, Aida
Do, Tri Nhu
Kaddoum, Georges
Talhi, Chamseddine
IEEE TRANSACTIONS ON GREEN COMMUNICATIONS AND NETWORKING, 2025, 9 (01): : 271 - 290
[25] AI-Based Optimization of Handover Strategy in Non-Terrestrial Networks
ZHANG Chenchen
ZHANG Nan
CAO Wei
TIAN Kaibo
YANG Zhen
ZTECommunications, 2021, 19 (04) : 98 - 104
[26] UAV Communications in Integrated Terrestrial and Non-terrestrial Networks
Benzaghta, Mohamed
Geraci, Giovanni
Nikbakht, Rasoul
Lopez-Perez, David
2022 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM 2022), 2022, : 3706 - 3711
[27] Integrating LEO Satellites and Multi-UAV Reinforcement Learning for Hybrid FSO/RF Non-Terrestrial Networks
Lee, Ju-Hyung
Park, Jihong
Bennis, Mehdi
Ko, Young-Chai
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 72 (03) : 3647 - 3662
[28] Multi-agent reinforcement learning for cooperative trajectory design of UAV-BS fleets in terrestrial/non-terrestrial integrated networks
Hoang, Linh T.
Nguyen, Chuyen T.
Le, Hoang D.
Pham, Anh T.
IEICE COMMUNICATIONS EXPRESS, 2024, 13 (08): : 327 - 330
[29] Learning-Based FEC for Non-Terrestrial Networks With Delayed Feedback
Zhang, Feifan
Li, Ye
Wang, Jue
Quek, Tony Q. S.
IEEE COMMUNICATIONS LETTERS, 2022, 26 (02) : 306 - 310
[30] Distributed Learning Framework for Earth Observation on Multilayer Non-Terrestrial Networks
Naseh, David
Shinde, Swapnil Sadashiv
Tarchi, Daniele
2024 IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING FOR COMMUNICATION AND NETWORKING, ICMLCN 2024, 2024, : 558 - 559

← 1 2 3 4 5 →