Multi-Task Deep Reinforcement Learning for Terahertz NOMA Resource Allocation With Hybrid Discrete and Continuous Actions

被引:2
|
作者
Hu, Zhifeng [1 ]
Han, Chong [1 ,2 ,3 ]
Deng, Yansha [4 ]
Wang, Xudong [5 ]
机构
[1] Shanghai Jiao Tong Univ, Terahertz Wireless Commun TWC Lab, Shanghai 200240, Peoples R China
[2] Shanghai Jiao Tong Univ, Dept Engn, Shanghai 200240, Peoples R China
[3] Shanghai Jiao Tong Univ, Cooperat Medianet Innovat Ctr CMIC, Shanghai 200240, Peoples R China
[4] Kings Coll London, Dept Engn, London WC2R 2LS, England
[5] Shanghai Jiao Tong Univ, Univ Michigan Shanghai Jiao Tong Univ UM SJTU Join, Shanghai 200240, Peoples R China
基金
中国国家自然科学基金;
关键词
Resource management; NOMA; Throughput; Terahertz communications; Wireless communication; Multitasking; Hybrid power systems; Deep reinforcement learning (DRL); non-orthogonal multiple access (NOMA); Terahertz (THz) networks; NONORTHOGONAL MULTIPLE-ACCESS; POWER ALLOCATION; JOINT POWER; MIMO-NOMA; SYSTEMS; NETWORKS; COMMUNICATION; INTERFERENCE; CHALLENGES; CAPACITY;
D O I
10.1109/TVT.2024.3381238
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Terahertz (THz) non-orthogonal multiple access (NOMA) networks have great potential for next-generation wireless communications, by providing promising ultra-high data rates and user fairness. In THz-NOMA networks, efficient and effective long-term beamforming-bandwidth-power (BBP) allocation is yet an open problem due to its non-deterministic polynomial-time hard (NP-hard) nature. In this article, the continuous property of power and sub-arrays ratios assignment and the discrete property of sub-bands allocation are carefully treated. In light of these attributes, an offline hybrid discrete and continuous actions (DISCO) multi-task deep reinforcement learning (DRL) algorithm is proposed to maximize the long-term throughput. Specifically, the deployment of multi-task learning enables the actor of DISCO to smartly integrate two state-of-the-art DRL algorithms, e.g., actor-critic (AC) that only selects discrete actions and deep deterministic policy gradient (DDPG) that only generates continuous actions. Rigorous theoretical derivations for the neural network design and backpropagation process are provided to tailor our proposed DISCO for the BBP problem. Compared to the benchmark no-learning and conventional DRL algorithms, DISCO enhances the network throughput, while achieving good fairness among users. Furthermore, DISCO consumes hundred-of-millisecond computational time, revealing the practicability of DISCO.
引用
收藏
页码:11647 / 11663
页数:17
相关论文
共 50 条
  • [1] Optimization of Deep Reinforcement Learning with Hybrid Multi-Task Learning
    Varghese, Nelson Vithayathil
    Mahmoud, Qusay H.
    2021 15TH ANNUAL IEEE INTERNATIONAL SYSTEMS CONFERENCE (SYSCON 2021), 2021,
  • [2] Multi-Task Deep Reinforcement Learning for Continuous Action Control
    Yang, Zhaoyang
    Merrick, Kathryn
    Abbass, Hussein
    Jin, Lianwen
    PROCEEDINGS OF THE TWENTY-SIXTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 3301 - 3307
  • [3] Knowledge Transfer in Multi-Task Deep Reinforcement Learning for Continuous Control
    Xu, Zhiyuan
    Wu, Kun
    Che, Zhengping
    Tang, Jian
    Ye, Jieping
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [4] A Hybrid Multi-Task Learning Approach for Optimizing Deep Reinforcement Learning Agents
    Varghese, Nelson Vithayathil
    Mahmoud, Qusay H.
    IEEE ACCESS, 2021, 9 : 44681 - 44703
  • [5] Deep Reinforcement Learning for Resource Allocation in Multi-Band and Hybrid OMA-NOMA Wireless Networks
    Chaieb, Cirine
    Abdelkefi, Fatma
    Ajib, Wessam
    IEEE TRANSACTIONS ON COMMUNICATIONS, 2023, 71 (01) : 187 - 198
  • [6] Multi-Task Resource Allocation and Task Offloading via Multi-Agent Deep Reinforcement Learning in Edge-Cloud system
    Tian, Guoqing
    Wang, Xilong
    Li, Xin
    Qin, Xiaolin
    2024 5TH INTERNATIONAL CONFERENCE ON COMPUTING, NETWORKS AND INTERNET OF THINGS, CNIOT 2024, 2024, : 370 - 377
  • [7] A Survey of Multi-Task Deep Reinforcement Learning
    Vithayathil Varghese, Nelson
    Mahmoud, Qusay H.
    ELECTRONICS, 2020, 9 (09) : 1 - 21
  • [8] Multi-Task Deep Reinforcement Learning with PopArt
    Hessel, Matteo
    Soyer, Hubert
    Espeholt, Lasse
    Czarnecki, Wojciech
    Schmitt, Simon
    van Hasselt, Hado
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 3796 - 3803
  • [9] Attentive Multi-task Deep Reinforcement Learning
    Bram, Timo
    Brunner, Gino
    Richter, Oliver
    Wattenhofer, Roger
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2019, PT III, 2020, 11908 : 134 - 149
  • [10] Resource Allocation for Multi-service NOMA System Based on Deep Reinforcement Learning
    Zhang, Zhenyu
    Zheng, Weijun
    Shao, Weiping
    Zhang, Yong
    Guo, Da
    HUMAN CENTERED COMPUTING, HCC 2021, 2022, 13795 : 219 - 231