Collaborative Computing in Non-Terrestrial Networks: A Multi-Time-Scale Deep Reinforcement Learning Approach

被引:2
|
作者
Cao, Yang [1 ]
Lien, Shao-Yu [2 ]
Liang, Ying-Chang [3 ]
Niyato, Dusit [4 ]
Shen, Xuemin [5 ]
机构
[1] Southwest Jiaotong Univ, Sch Informat Sci & Technol, Chengdu 611756, Peoples R China
[2] Natl Yang Ming Chiao Tung Univ, Inst Intelligent Syst, Tainan 711, Taiwan
[3] Univ Elect Sci & Technol China, Ctr Intelligent Networking & Commun CINC, Chengdu 611731, Peoples R China
[4] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore 639798, Singapore
[5] Univ Waterloo, Dept Elect & Comp Engn, Waterloo, ON, Canada
关键词
Low earth orbit satellites; Satellite broadcasting; Satellites; Optimization; Convergence; Resource management; 3GPP; Non-terrestrial networks (NTNs); earth-fixed cell; beam management; resource allocation; deep reinforcement learning (DRL); multi-time-scale Markov decision process (MMDPs);
D O I
10.1109/TWC.2023.3323554
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Constructing earth-fixed cells with low-earth orbit (LEO) satellites in non-terrestrial networks (NTNs) has been the most promising paradigm to enable global coverage. The limited computing capabilities on LEO satellites however render tackling resource optimization within a short duration a critical challenge. Although the sufficient computing capabilities of the ground infrastructures can be utilized to assist the LEO satellite, different time-scale control cycles and coupling decisions between the space- and ground-segments still obstruct the joint optimization design for computing agents at different segments. To address the above challenges, in this paper, a multi-time-scale deep reinforcement learning (DRL) scheme is developed for achieving the radio resource optimization in NTNs, in which the LEO satellite and user equipment (UE) collaborate with each other to perform individual decision-making tasks with different control cycles. Specifically, the UE updates its policy toward improving value functions of both the satellite and UE, while the LEO satellite only performs finite-step rollout for decision-makings based on the reference decision trajectory provided by the UE. Most importantly, rigorous analysis to guarantee the performance convergence of the proposed scheme is provided. Comprehensive simulations are conducted to justify the effectiveness of the proposed scheme in balancing the transmission performance and computational complexity.
引用
收藏
页码:4932 / 4949
页数:18
相关论文
共 50 条
  • [41] Machine Learning-Based Solutions for Handover Decisions in Non-Terrestrial Networks
    Dahouda, Mwamba Kasongo
    Jin, Sihwa
    Joe, Inwhee
    ELECTRONICS, 2023, 12 (08)
  • [42] Multi-Connectivity in 5G and Beyond Non-Terrestrial Networks
    Majamaa, Mikko
    Martikainen, Henrik
    Sormunen, Lauri
    Puttonen, Jani
    JOURNAL OF COMMUNICATIONS SOFTWARE AND SYSTEMS, 2022, 18 (04) : 350 - 358
  • [43] Deep Reinforcement Learning-based Task Offloading in Satellite-Terrestrial Edge Computing Networks
    Zhu, Dali
    Liu, Haitao
    Li, Ting
    Sun, Jiyan
    Liang, Jie
    Zhang, Hangsheng
    Geng, Liru
    Liu, Yudong
    2021 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2021,
  • [44] Analysis and Control of Multi-Time-Scale Modular Directed Heterogeneous Networks
    Lazri, Anes
    Panteley, Elena
    Loria, Antonio
    IEEE TRANSACTIONS ON CONTROL OF NETWORK SYSTEMS, 2025, 12 (01): : 661 - 672
  • [45] Collaborative path penetration in 5G-IoT networks: A multi-agent deep reinforcement learning approach
    Shen, Hang
    Li, Xiang
    Wang, Yan
    Wang, Tianjing
    Bai, Guangwei
    PEER-TO-PEER NETWORKING AND APPLICATIONS, 2025, 18 (03)
  • [46] Collaborative Computation Offloading and Resource Allocation in Multi-UAV-Assisted IoT Networks: A Deep Reinforcement Learning Approach
    Seid, Abegaz Mohammed
    Boateng, Gordon Owusu
    Anokye, Stephen
    Kwantwi, Thomas
    Sun, Guolin
    Liu, Guisong
    IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (15) : 12203 - 12218
  • [47] Multi-time-scale synchronization and information processing in bursting neuron networks
    T. Pereira
    M. S. Baptista
    J. Kurths
    The European Physical Journal Special Topics, 2007, 146 : 155 - 168
  • [48] Collaborative Caching in Edge Computing via Federated Learning and Deep Reinforcement Learning
    Wang, Yali
    Chen, Jiachao
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2022, 2022
  • [49] Cooperative Control of Multi-Time-Scale Agent Networks Under Digraphs
    Wei, Li-Mei
    Yang, Wu
    Hua, Tong
    IEEE ACCESS, 2023, 11 : 30796 - 30806
  • [50] Fair collaborative vehicle routing: A deep multi-agent reinforcement learning approach
    Mak, Stephen
    Xu, Liming
    Pearce, Tim
    Ostroumov, Michael
    Brintrup, Alexandra
    TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2023, 157