Collaborative Computing in Non-Terrestrial Networks: A Multi-Time-Scale Deep Reinforcement Learning Approach

被引:2
|
作者
Cao, Yang [1 ]
Lien, Shao-Yu [2 ]
Liang, Ying-Chang [3 ]
Niyato, Dusit [4 ]
Shen, Xuemin [5 ]
机构
[1] Southwest Jiaotong Univ, Sch Informat Sci & Technol, Chengdu 611756, Peoples R China
[2] Natl Yang Ming Chiao Tung Univ, Inst Intelligent Syst, Tainan 711, Taiwan
[3] Univ Elect Sci & Technol China, Ctr Intelligent Networking & Commun CINC, Chengdu 611731, Peoples R China
[4] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore 639798, Singapore
[5] Univ Waterloo, Dept Elect & Comp Engn, Waterloo, ON, Canada
关键词
Low earth orbit satellites; Satellite broadcasting; Satellites; Optimization; Convergence; Resource management; 3GPP; Non-terrestrial networks (NTNs); earth-fixed cell; beam management; resource allocation; deep reinforcement learning (DRL); multi-time-scale Markov decision process (MMDPs);
D O I
10.1109/TWC.2023.3323554
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Constructing earth-fixed cells with low-earth orbit (LEO) satellites in non-terrestrial networks (NTNs) has been the most promising paradigm to enable global coverage. The limited computing capabilities on LEO satellites however render tackling resource optimization within a short duration a critical challenge. Although the sufficient computing capabilities of the ground infrastructures can be utilized to assist the LEO satellite, different time-scale control cycles and coupling decisions between the space- and ground-segments still obstruct the joint optimization design for computing agents at different segments. To address the above challenges, in this paper, a multi-time-scale deep reinforcement learning (DRL) scheme is developed for achieving the radio resource optimization in NTNs, in which the LEO satellite and user equipment (UE) collaborate with each other to perform individual decision-making tasks with different control cycles. Specifically, the UE updates its policy toward improving value functions of both the satellite and UE, while the LEO satellite only performs finite-step rollout for decision-makings based on the reference decision trajectory provided by the UE. Most importantly, rigorous analysis to guarantee the performance convergence of the proposed scheme is provided. Comprehensive simulations are conducted to justify the effectiveness of the proposed scheme in balancing the transmission performance and computational complexity.
引用
收藏
页码:4932 / 4949
页数:18
相关论文
共 50 条
  • [21] Multi-agent reinforcement learning for cooperative trajectory design of UAV-BS fleets in terrestrial/non-terrestrial integrated networks
    Hoang, Linh T.
    Nguyen, Chuyen T.
    Le, Hoang D.
    Pham, Anh T.
    IEICE COMMUNICATIONS EXPRESS, 2024, 13 (08): : 327 - 330
  • [22] On the Energy Consumption of UAV Edge Computing in Non-Terrestrial Networks
    Traspadini, Alessandro
    Giordani, Marco
    Giambene, Giovanni
    De Cola, Tomaso
    Zorzi, Michele
    FIFTY-SEVENTH ASILOMAR CONFERENCE ON SIGNALS, SYSTEMS & COMPUTERS, IEEECONF, 2023, : 1684 - 1690
  • [23] Split Learning with Differential Privacy for Integrated Terrestrial and Non-Terrestrial Networks
    Wu, Maoqiang
    Cheng, Guoliang
    Li, Peichun
    Yu, Rong
    Wu, Yuan
    Pan, Miao
    Lu, Rongxing
    IEEE WIRELESS COMMUNICATIONS, 2024, 31 (03) : 177 - 184
  • [24] Multi-Time-Scale Optimal Scheduling Strategy for Marine Renewable Energy Based on Deep Reinforcement Learning Algorithm
    Xu, Ren
    Lin, Fei
    Shao, Wenyi
    Wang, Haoran
    Meng, Fanping
    Li, Jun
    ENTROPY, 2024, 26 (04)
  • [25] Deep Learning Empowered Secure RIS-Assisted Non-Terrestrial Relay Networks
    Huang, Chong
    Chen, Gaojie
    Zhou, Yitong
    Jia, Haocheng
    Xiao, Pei
    Tafazolli, Rahim
    2022 IEEE 96TH VEHICULAR TECHNOLOGY CONFERENCE (VTC2022-FALL), 2022,
  • [26] Controllability of multiplex, multi-time-scale networks
    Posfai, Marton
    Gao, Jianxi
    Cornelius, Sean P.
    Barabasi, Albert-Laszlo
    D'Souza, Raissa M.
    PHYSICAL REVIEW E, 2016, 94 (03)
  • [27] Distributed Machine Learning for Terrestrial and Non-Terrestrial Internet of Things Networks
    Do T.N.
    Kaddoum G.
    IEEE Internet of Things Magazine, 2023, 6 (04): : 54 - 61
  • [28] Multi-agent deep reinforcement learning for collaborative task offloading in mobile edge computing networks
    Chen, Minxuan
    Guo, Aihuang
    Song, Chunlin
    DIGITAL SIGNAL PROCESSING, 2023, 140
  • [29] Potential for Deep Rural Broadband Coverage With Terrestrial and Non-Terrestrial Radio Networks
    Feltrin, Luca
    Jalden, Niklas
    Trojer, Elmar
    Wikstrom, Gustav
    FRONTIERS IN COMMUNICATIONS AND NETWORKS, 2021, 2
  • [30] Artificial Intelligence and Machine Learning Technologies for Integration of Terrestrial in Non-Terrestrial Networks
    Khalid M.
    Ali J.
    Roh B.-H.
    IEEE Internet of Things Magazine, 2024, 7 (01): : 28 - 33