Collaborative Computing in Non-Terrestrial Networks: A Multi-Time-Scale Deep Reinforcement Learning Approach

被引:2
|
作者
Cao, Yang [1 ]
Lien, Shao-Yu [2 ]
Liang, Ying-Chang [3 ]
Niyato, Dusit [4 ]
Shen, Xuemin [5 ]
机构
[1] Southwest Jiaotong Univ, Sch Informat Sci & Technol, Chengdu 611756, Peoples R China
[2] Natl Yang Ming Chiao Tung Univ, Inst Intelligent Syst, Tainan 711, Taiwan
[3] Univ Elect Sci & Technol China, Ctr Intelligent Networking & Commun CINC, Chengdu 611731, Peoples R China
[4] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore 639798, Singapore
[5] Univ Waterloo, Dept Elect & Comp Engn, Waterloo, ON, Canada
关键词
Low earth orbit satellites; Satellite broadcasting; Satellites; Optimization; Convergence; Resource management; 3GPP; Non-terrestrial networks (NTNs); earth-fixed cell; beam management; resource allocation; deep reinforcement learning (DRL); multi-time-scale Markov decision process (MMDPs);
D O I
10.1109/TWC.2023.3323554
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Constructing earth-fixed cells with low-earth orbit (LEO) satellites in non-terrestrial networks (NTNs) has been the most promising paradigm to enable global coverage. The limited computing capabilities on LEO satellites however render tackling resource optimization within a short duration a critical challenge. Although the sufficient computing capabilities of the ground infrastructures can be utilized to assist the LEO satellite, different time-scale control cycles and coupling decisions between the space- and ground-segments still obstruct the joint optimization design for computing agents at different segments. To address the above challenges, in this paper, a multi-time-scale deep reinforcement learning (DRL) scheme is developed for achieving the radio resource optimization in NTNs, in which the LEO satellite and user equipment (UE) collaborate with each other to perform individual decision-making tasks with different control cycles. Specifically, the UE updates its policy toward improving value functions of both the satellite and UE, while the LEO satellite only performs finite-step rollout for decision-makings based on the reference decision trajectory provided by the UE. Most importantly, rigorous analysis to guarantee the performance convergence of the proposed scheme is provided. Comprehensive simulations are conducted to justify the effectiveness of the proposed scheme in balancing the transmission performance and computational complexity.
引用
收藏
页码:4932 / 4949
页数:18
相关论文
共 50 条
  • [31] An auction approach to aircraft bandwidth scheduling in non-terrestrial networks
    Li, Xianglong
    Mo, Kaiwei
    Hou, Yeqiao
    Li, Zongpeng
    Xu, Hong
    Xue, Chun Jason
    COMPUTER NETWORKS, 2024, 247
  • [32] Federated Learning-based Jamming Detection for Tactical Terrestrial and Non-Terrestrial Networks
    Meftah, Aida
    Kaddoum, Georges
    Do, Tri Nhu
    Talhi, Chamseddine
    IEEE CONFERENCE ON GLOBAL COMMUNICATIONS, GLOBECOM, 2023, : 2154 - 2159
  • [33] Federated Learning-Enabled Jamming Detection for Stochastic Terrestrial and Non-Terrestrial Networks
    Meftah, Aida
    Do, Tri Nhu
    Kaddoum, Georges
    Talhi, Chamseddine
    IEEE TRANSACTIONS ON GREEN COMMUNICATIONS AND NETWORKING, 2025, 9 (01): : 271 - 290
  • [34] Uplink Time Synchronization Based on Time Drift Measurements in Non-Terrestrial Networks
    Yeo, Woon-Young
    Lee, Dong-Jun
    IEEE ACCESS, 2024, 12 : 168877 - 168893
  • [35] Scaling Collaborative Space Networks with Deep Multi-Agent Reinforcement Learning
    Ma, Ricky
    Hernandez, Gabe
    Hernandez, Carrie
    2023 IEEE COGNITIVE COMMUNICATIONS FOR AEROSPACE APPLICATIONS WORKSHOP, CCAAW, 2023,
  • [36] Learning-Based FEC for Non-Terrestrial Networks With Delayed Feedback
    Zhang, Feifan
    Li, Ye
    Wang, Jue
    Quek, Tony Q. S.
    IEEE COMMUNICATIONS LETTERS, 2022, 26 (02) : 306 - 310
  • [37] Distributed Learning Framework for Earth Observation on Multilayer Non-Terrestrial Networks
    Naseh, David
    Shinde, Swapnil Sadashiv
    Tarchi, Daniele
    2024 IEEE INTERNATIONAL CONFERENCE ON MACHINE LEARNING FOR COMMUNICATION AND NETWORKING, ICMLCN 2024, 2024, : 558 - 559
  • [38] The Multi-Time-Scale Management Optimization Method for Park Integrated Energy System Based on the Bi-Layer Deep Reinforcement Learning
    Chen M.
    Sun Y.
    Xie Z.
    Diangong Jishu Xuebao/Transactions of China Electrotechnical Society, 2023, 38 (07): : 1864 - 1881
  • [39] Task Assignment in Mobile Edge Computing Networks: A Deep Reinforcement Learning Approach
    Feng, Mingjie
    Zhao, Qi
    Sullivan, Nichole
    Chen, Genshe
    Pham, Khanh
    Blasch, Erik
    SENSORS AND SYSTEMS FOR SPACE APPLICATIONS XIV, 2021, 11755
  • [40] A Multiagent Deep Reinforcement Learning Approach for Multi-UAV Cooperative Search in Multilayered Aerial Computing Networks
    Wu, Jiaqi
    Luo, Jingjing
    Jiang, Changkun
    Gao, Lin
    IEEE INTERNET OF THINGS JOURNAL, 2025, 12 (05): : 5807 - 5821