Deep Reinforcement Learning-Based Dynamic Spectrum Access for D2D Communication Underlay Cellular Networks

被引:21
|
作者
Huang, Jingfei [1 ]
Yang, Yang [1 ]
He, Gang [1 ]
Xiao, Yang [1 ]
Liu, Jun [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Artificial Intelligence, Beijing 100876, Peoples R China
基金
北京市自然科学基金; 中国国家自然科学基金;
关键词
Device-to-device communication; Interference; Throughput; Resource management; Cellular networks; Heuristic algorithms; Training; dynamic spectrum access; time slots; deep reinforcement learning; double deep Q-network; RESOURCE-ALLOCATION;
D O I
10.1109/LCOMM.2021.3079920
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
This letter investigates a deep reinforcement learning (DRL)-based spectrum access scheme for device-to-device (D2D) communication underlay cellular networks. Specifically, cellular users (CUEs) and D2D pairs attempt to access the time slots (TSs) of a shared spectrum, and TSs are dynamically scheduled to CUEs in different frames. Based on the DRL theory, D2D pairs can be seen as a centralized agent which aims to learn an optimal spectrum access strategy to maximize the sum throughput without any prior information. In particular, with different locations of CUEs, the spectrum access manners for D2D communication are changed to ensure the communication quality of CUEs at the cell edge. Then, a double deep Q-network (DDQN) based D2D spectrum access (D(4)SA) algorithm is proposed, which makes D2D pairs learn to decide whether to access the spectrum in different TSs. Moreover, to ensure the fairness of resource allocation among D2D pairs, we improve the proposed algorithm and incorporate fairness into the objective function. Simulation results show that our proposed algorithm can achieve an optimal sum throughput close to the theoretical upper bound, where the performance is significantly improved compared to the scheme based on base station cooperation.
引用
收藏
页码:2614 / 2618
页数:5
相关论文
共 50 条
  • [1] Deep Reinforcement Learning Based Resource Allocation for D2D Communications Underlay Cellular Networks
    Yu, Seoyoung
    Lee, Jeong Woo
    SENSORS, 2022, 22 (23)
  • [2] A deep reinforcement learning-based D2D spectrum allocation underlaying a cellular network
    Liang, Yao-Jen
    Tseng, Yu-Chan
    Hsieh, Chi-Wen
    WIRELESS NETWORKS, 2025, 31 (01) : 435 - 441
  • [3] Deep Reinforcement Learning-Based Optimization Method for D2D Communication Energy Efficiency in Heterogeneous Cellular Networks
    Pan, Ziyu
    Yang, Jie
    IEEE ACCESS, 2024, 12 : 140439 - 140455
  • [4] Deep reinforcement learning-based resource allocation for D2D communications in heterogeneous cellular networks
    Zhi, Yuan
    Tian, Jie
    Deng, Xiaofang
    Qiao, Jingping
    Lu, Dianjie
    DIGITAL COMMUNICATIONS AND NETWORKS, 2022, 8 (05) : 834 - 842
  • [5] Deep reinforcement learning-based resource allocation for D2D communications in heterogeneous cellular networks
    Yuan Zhi
    Jie Tian
    Xiaofang Deng
    Jingping Qiao
    Dianjie Lu
    Digital Communications and Networks, 2022, 8 (05) : 834 - 842
  • [6] Multi-Agent Deep Reinforcement Learning Based Spectrum Allocation for D2D Underlay Communications
    Li, Zheng
    Guo, Caili
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 69 (02) : 1828 - 1840
  • [7] Adaptive spectrum-shared association for controlled underlay D2D communication in cellular networks
    Radaydeh, Redha M.
    Al-Qahtani, Fawaz S.
    Celik, Abdulkadir
    Alouini, Mohamed-Slim
    Tayem, Nizar
    IET COMMUNICATIONS, 2019, 13 (18) : 3075 - 3087
  • [8] Deep-Reinforcement-Learning-Based Proportional Fair Scheduling Control Scheme for Underlay D2D Communication
    Budhiraja, Ishan
    Kumar, Neeraj
    Tyagi, Sudhanshu
    IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (05) : 3143 - 3156
  • [9] D2D Communication Underlay in Uplink Cellular Networks with Distance Based Power Control
    Zhang, Zekun
    Hu, Rose Qingyang
    Qian, Yi
    2016 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS (ICC), 2016,
  • [10] Deep Reinforcement Learning-based Data Transmission for D2D Communications
    Moussaid, Achraf
    Jaafar, Wael
    Ajib, Wessam
    Elbiazc, Halima
    2018 14TH INTERNATIONAL CONFERENCE ON WIRELESS AND MOBILE COMPUTING, NETWORKING AND COMMUNICATIONS (WIMOB 2018), 2018,