Multi-Task Deep Reinforcement Learning for Terahertz NOMA Resource Allocation With Hybrid Discrete and Continuous Actions

被引:2
|
作者
Hu, Zhifeng [1 ]
Han, Chong [1 ,2 ,3 ]
Deng, Yansha [4 ]
Wang, Xudong [5 ]
机构
[1] Shanghai Jiao Tong Univ, Terahertz Wireless Commun TWC Lab, Shanghai 200240, Peoples R China
[2] Shanghai Jiao Tong Univ, Dept Engn, Shanghai 200240, Peoples R China
[3] Shanghai Jiao Tong Univ, Cooperat Medianet Innovat Ctr CMIC, Shanghai 200240, Peoples R China
[4] Kings Coll London, Dept Engn, London WC2R 2LS, England
[5] Shanghai Jiao Tong Univ, Univ Michigan Shanghai Jiao Tong Univ UM SJTU Join, Shanghai 200240, Peoples R China
基金
中国国家自然科学基金;
关键词
Resource management; NOMA; Throughput; Terahertz communications; Wireless communication; Multitasking; Hybrid power systems; Deep reinforcement learning (DRL); non-orthogonal multiple access (NOMA); Terahertz (THz) networks; NONORTHOGONAL MULTIPLE-ACCESS; POWER ALLOCATION; JOINT POWER; MIMO-NOMA; SYSTEMS; NETWORKS; COMMUNICATION; INTERFERENCE; CHALLENGES; CAPACITY;
D O I
10.1109/TVT.2024.3381238
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Terahertz (THz) non-orthogonal multiple access (NOMA) networks have great potential for next-generation wireless communications, by providing promising ultra-high data rates and user fairness. In THz-NOMA networks, efficient and effective long-term beamforming-bandwidth-power (BBP) allocation is yet an open problem due to its non-deterministic polynomial-time hard (NP-hard) nature. In this article, the continuous property of power and sub-arrays ratios assignment and the discrete property of sub-bands allocation are carefully treated. In light of these attributes, an offline hybrid discrete and continuous actions (DISCO) multi-task deep reinforcement learning (DRL) algorithm is proposed to maximize the long-term throughput. Specifically, the deployment of multi-task learning enables the actor of DISCO to smartly integrate two state-of-the-art DRL algorithms, e.g., actor-critic (AC) that only selects discrete actions and deep deterministic policy gradient (DDPG) that only generates continuous actions. Rigorous theoretical derivations for the neural network design and backpropagation process are provided to tailor our proposed DISCO for the BBP problem. Compared to the benchmark no-learning and conventional DRL algorithms, DISCO enhances the network throughput, while achieving good fairness among users. Furthermore, DISCO consumes hundred-of-millisecond computational time, revealing the practicability of DISCO.
引用
收藏
页码:11647 / 11663
页数:17
相关论文
共 50 条
  • [31] Sparse Multi-Task Reinforcement Learning
    Calandriello, Daniele
    Lazaric, Alessandro
    Restelli, Marcello
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 27 (NIPS 2014), 2014, 27
  • [32] Sparse multi-task reinforcement learning
    Calandriello, Daniele
    Lazaric, Alessandro
    Restelli, Marcello
    INTELLIGENZA ARTIFICIALE, 2015, 9 (01) : 5 - 20
  • [33] Multi-task Learning with Modular Reinforcement Learning
    Xue, Jianyong
    Alexandre, Frederic
    FROM ANIMALS TO ANIMATS 16, 2022, 13499 : 127 - 138
  • [34] Energy-Efficient Resource Allocation in Uplink NOMA Systems with Deep Reinforcement Learning
    Zhang, Yuhan
    Wang, Xiaoming
    Xu, Youyun
    2019 11TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS AND SIGNAL PROCESSING (WCSP), 2019,
  • [35] Computation Offloading and Resource Allocation in NOMA-MEC: A Deep Reinforcement Learning Approach
    Shang, Ce
    Sun, Yan
    Luo, Hong
    Guizani, Mohsen
    IEEE INTERNET OF THINGS JOURNAL, 2023, 10 (17) : 15464 - 15476
  • [36] Study on deep reinforcement learning for multi-task scheduling in cloud manufacturing
    Xiao, Jiuhong
    Cai, Yishuai
    Chen, Yong
    INTERNATIONAL JOURNAL OF COMPUTER INTEGRATED MANUFACTURING, 2025,
  • [37] NOMA Assisted Multi-Task Multi-Access Mobile Edge Computing via Deep Reinforcement Learning for Industrial Internet of Things
    Qian, Liping
    Wu, Yuan
    Jiang, Fuli
    Yu, Ningning
    Lu, Weidang
    Lin, Bin
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2021, 17 (08) : 5688 - 5698
  • [38] Decision making on robot with multi-task using deep reinforcement learning for each task
    Shimoguchi, Yuya
    Kurashige, Kentarou
    2019 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), 2019, : 3460 - 3465
  • [39] Computational task offloading algorithm based on deep reinforcement learning and multi-task dependency
    Zhang, Xiaoqi
    Lin, Tengxiang
    Lin, Cheng-Kuan
    Chen, Zhen
    Cheng, Hongju
    THEORETICAL COMPUTER SCIENCE, 2024, 993
  • [40] Unsupervised Task Clustering for Multi-task Reinforcement Learning
    Ackermann, Johannes
    Richter, Oliver
    Wattenhofer, Roger
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, 2021, 12975 : 222 - 237