Collaborative Deep Reinforcement Learning for Resource Optimization in Non-Terrestrial Networks

被引:1
|
作者
Cao, Yang [1 ,2 ]
Lien, Shao-Yu [3 ]
Liang, Ying-Chang [1 ,2 ]
Niyato, Dusit [4 ]
Shen, Xuemin [5 ]
机构
[1] Univ Elect Sci & Technol China, Yangtze Delta Reg Inst Huzhou, Huzhou, Peoples R China
[2] Univ Elect Sci & Technol China, Chengdu, Peoples R China
[3] Natl Yang Ming Chiao Tung Univ, Tainan, Taiwan
[4] Nanyang Technol Univ, Singapore, Singapore
[5] Univ Waterloo, Waterloo, ON, Canada
基金
新加坡国家研究基金会; 国家重点研发计划;
关键词
Non-terrestrial networks (NTNs); earth-fixed cell; resource allocation; deep reinforcement learning (DRL); multi-time-scale; Markov decision process (MMDPs);
D O I
10.1109/PIMRC56721.2023.10294047
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Non-terrestrial networks (NTNs) with low-earth orbit (LEO) satellites have been regarded as promising remedies to support global ubiquitous wireless services. Due to the rapid mobility of LEO satellite, inter-beam/satellite handovers happen frequently for a specific user equipment (UE). To tackle this issue, earth-fixed cell scenarios have been under studied, in which the LEO satellite adjusts its beam direction towards a fixed area within its dwell duration, to maintain stable transmission performance for the UE. Therefore, it is required that the LEO satellite performs real-time resource allocation, which however is unaffordable by the LEO satellite with limited computing capability. To address this issue, in this paper, we propose a two-time-scale collaborative deep reinforcement learning (DRL) scheme for beam management and resource allocation in NTNs, in which LEO satellite and UE with different control cycles update their decision-making policies through a sequential manner. Specifically, UE updates its policy subject to improving the value functions of both the agents. Furthermore, the LEO satellite only makes decisions through finitestep rollouts with a reference decision trajectory received from the UE. Simulation results show that the proposed scheme can effectively balance the throughput performance and computational complexity over traditional greedy-searching schemes.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] Multi-Tier Deep Reinforcement Learning for Non-Terrestrial Networks
    Cao, Yang
    Lien, Shao-Yu
    Liang, Ying-Chang
    Niyato, Dusit
    IEEE WIRELESS COMMUNICATIONS, 2024, 31 (03) : 194 - 201
  • [2] Collaborative Computing in Non-Terrestrial Networks: A Multi-Time-Scale Deep Reinforcement Learning Approach
    Cao, Yang
    Lien, Shao-Yu
    Liang, Ying-Chang
    Niyato, Dusit
    Shen, Xuemin
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2024, 23 (05) : 4932 - 4949
  • [3] Toward Intelligent Non-Terrestrial Networks Through Symbiotic Radio: A Collaborative Deep Reinforcement Learning Scheme
    Cao, Yang
    Lien, Shao-Yu
    Liang, Ying-Chang
    Niyato, Dusit
    IEEE NETWORK, 2025, 39 (01): : 211 - 219
  • [4] Multi-agent deep reinforcement learning for user association and resource allocation in integrated terrestrial and non-terrestrial networks
    Birabwa, Denise Joanitah
    Ramotsoela, Daniel
    Ventura, Neco
    COMPUTER NETWORKS, 2023, 231
  • [5] Autonomous Non-Terrestrial Base Station Deployment for Non-Terrestrial Networks: A Reinforcement Learning Approach
    Lien, Shao-Yu
    Deng, Der-Jiunn
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2022, 71 (10) : 10894 - 10909
  • [6] Deep Reinforcement Learning For Multi-User Access Control in Non-Terrestrial Networks
    Cao, Yang
    Lien, Shao-Yu
    Liang, Ying-Chang
    IEEE TRANSACTIONS ON COMMUNICATIONS, 2021, 69 (03) : 1605 - 1619
  • [7] Fair resource optimization for cooperative non-terrestrial vehicular networks
    Dutta, Ashit Kumar
    Alruwais, Nuha
    Alabdulkreem, Eatedal
    Negm, Noha
    Darem, Abdulbasit A.
    Al Duhayyim, Mesfer
    Khan, Wali Ullah
    Nauman, Ali
    COMPUTER NETWORKS, 2024, 251
  • [8] Multi-tier Collaborative Deep Reinforcement Learning for Non-terrestrial Network Empowered Vehicular Connections
    Cao, Yang
    Lien, Shao-Yu
    Liang, Ying-Chang
    2021 IEEE 29TH INTERNATIONAL CONFERENCE ON NETWORK PROTOCOLS (ICNP 2021), 2021,
  • [9] Multi-Agent Deep Reinforcement Learning for Interference-Aware Channel Allocation in Non-Terrestrial Networks
    Cho, Yeongi
    Yang, Wooyeol
    Oh, Daesub
    Jo, Han-Shin
    IEEE COMMUNICATIONS LETTERS, 2023, 27 (03) : 936 - 940
  • [10] Deep Reinforcement Learning for UAV Placement over Mixed FSO/RF-Based Non-terrestrial Networks
    Nguyen, Tinh, V
    Le, Hoang D.
    Mai, Vuong
    Swaminathan, R.
    Pham, Anh T.
    2024 IEEE VTS ASIA PACIFIC WIRELESS COMMUNICATIONS SYMPOSIUM, APWCS 2024, 2024,