Collaborative Deep Reinforcement Learning for Resource Optimization in Non-Terrestrial Networks

被引:1
|
作者
Cao, Yang [1 ,2 ]
Lien, Shao-Yu [3 ]
Liang, Ying-Chang [1 ,2 ]
Niyato, Dusit [4 ]
Shen, Xuemin [5 ]
机构
[1] Univ Elect Sci & Technol China, Yangtze Delta Reg Inst Huzhou, Huzhou, Peoples R China
[2] Univ Elect Sci & Technol China, Chengdu, Peoples R China
[3] Natl Yang Ming Chiao Tung Univ, Tainan, Taiwan
[4] Nanyang Technol Univ, Singapore, Singapore
[5] Univ Waterloo, Waterloo, ON, Canada
基金
新加坡国家研究基金会; 国家重点研发计划;
关键词
Non-terrestrial networks (NTNs); earth-fixed cell; resource allocation; deep reinforcement learning (DRL); multi-time-scale; Markov decision process (MMDPs);
D O I
10.1109/PIMRC56721.2023.10294047
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Non-terrestrial networks (NTNs) with low-earth orbit (LEO) satellites have been regarded as promising remedies to support global ubiquitous wireless services. Due to the rapid mobility of LEO satellite, inter-beam/satellite handovers happen frequently for a specific user equipment (UE). To tackle this issue, earth-fixed cell scenarios have been under studied, in which the LEO satellite adjusts its beam direction towards a fixed area within its dwell duration, to maintain stable transmission performance for the UE. Therefore, it is required that the LEO satellite performs real-time resource allocation, which however is unaffordable by the LEO satellite with limited computing capability. To address this issue, in this paper, we propose a two-time-scale collaborative deep reinforcement learning (DRL) scheme for beam management and resource allocation in NTNs, in which LEO satellite and UE with different control cycles update their decision-making policies through a sequential manner. Specifically, UE updates its policy subject to improving the value functions of both the agents. Furthermore, the LEO satellite only makes decisions through finitestep rollouts with a reference decision trajectory received from the UE. Simulation results show that the proposed scheme can effectively balance the throughput performance and computational complexity over traditional greedy-searching schemes.
引用
收藏
页数:7
相关论文
共 50 条
  • [21] Cooperative Resource Allocation in Integrated Terrestrial/Non-Terrestrial 5G and Beyond Networks
    Rinaldi, F.
    Pizzi, S.
    Molinaro, A.
    Iera, A.
    Araniti, G.
    2020 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), 2020,
  • [22] Technical Analysis of Non-terrestrial Networks
    Lin, Pingping
    20TH INTERNATIONAL WIRELESS COMMUNICATIONS & MOBILE COMPUTING CONFERENCE, IWCMC 2024, 2024, : 975 - 979
  • [23] Federated Learning-based Jamming Detection for Tactical Terrestrial and Non-Terrestrial Networks
    Meftah, Aida
    Kaddoum, Georges
    Do, Tri Nhu
    Talhi, Chamseddine
    IEEE CONFERENCE ON GLOBAL COMMUNICATIONS, GLOBECOM, 2023, : 2154 - 2159
  • [24] Federated Learning-Enabled Jamming Detection for Stochastic Terrestrial and Non-Terrestrial Networks
    Meftah, Aida
    Do, Tri Nhu
    Kaddoum, Georges
    Talhi, Chamseddine
    IEEE TRANSACTIONS ON GREEN COMMUNICATIONS AND NETWORKING, 2025, 9 (01): : 271 - 290
  • [25] AI-Based Optimization of Handover Strategy in Non-Terrestrial Networks
    ZHANG Chenchen
    ZHANG Nan
    CAO Wei
    TIAN Kaibo
    YANG Zhen
    ZTECommunications, 2021, 19 (04) : 98 - 104
  • [26] UAV Communications in Integrated Terrestrial and Non-terrestrial Networks
    Benzaghta, Mohamed
    Geraci, Giovanni
    Nikbakht, Rasoul
    Lopez-Perez, David
    2022 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM 2022), 2022, : 3706 - 3711
  • [27] Integrating LEO Satellites and Multi-UAV Reinforcement Learning for Hybrid FSO/RF Non-Terrestrial Networks
    Lee, Ju-Hyung
    Park, Jihong
    Bennis, Mehdi
    Ko, Young-Chai
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 72 (03) : 3647 - 3662
  • [28] Multi-agent reinforcement learning for cooperative trajectory design of UAV-BS fleets in terrestrial/non-terrestrial integrated networks
    Hoang, Linh T.
    Nguyen, Chuyen T.
    Le, Hoang D.
    Pham, Anh T.
    IEICE COMMUNICATIONS EXPRESS, 2024, 13 (08): : 327 - 330
  • [29] Learning-Based FEC for Non-Terrestrial Networks With Delayed Feedback
    Zhang, Feifan
    Li, Ye
    Wang, Jue
    Quek, Tony Q. S.
    IEEE COMMUNICATIONS LETTERS, 2022, 26 (02) : 306 - 310
  • [30] Distributed Learning Framework for Earth Observation on Multilayer Non-Terrestrial Networks
    Naseh, David
    Shinde, Swapnil Sadashiv
    Tarchi, Daniele
    2024 IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING FOR COMMUNICATION AND NETWORKING, ICMLCN 2024, 2024, : 558 - 559