Deep Reinforcement Learning-Based Dynamic Spectrum Access for D2D Communication Underlay Cellular Networks

被引:21
|
作者
Huang, Jingfei [1 ]
Yang, Yang [1 ]
He, Gang [1 ]
Xiao, Yang [1 ]
Liu, Jun [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Artificial Intelligence, Beijing 100876, Peoples R China
基金
北京市自然科学基金; 中国国家自然科学基金;
关键词
Device-to-device communication; Interference; Throughput; Resource management; Cellular networks; Heuristic algorithms; Training; dynamic spectrum access; time slots; deep reinforcement learning; double deep Q-network; RESOURCE-ALLOCATION;
D O I
10.1109/LCOMM.2021.3079920
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
This letter investigates a deep reinforcement learning (DRL)-based spectrum access scheme for device-to-device (D2D) communication underlay cellular networks. Specifically, cellular users (CUEs) and D2D pairs attempt to access the time slots (TSs) of a shared spectrum, and TSs are dynamically scheduled to CUEs in different frames. Based on the DRL theory, D2D pairs can be seen as a centralized agent which aims to learn an optimal spectrum access strategy to maximize the sum throughput without any prior information. In particular, with different locations of CUEs, the spectrum access manners for D2D communication are changed to ensure the communication quality of CUEs at the cell edge. Then, a double deep Q-network (DDQN) based D2D spectrum access (D(4)SA) algorithm is proposed, which makes D2D pairs learn to decide whether to access the spectrum in different TSs. Moreover, to ensure the fairness of resource allocation among D2D pairs, we improve the proposed algorithm and incorporate fairness into the objective function. Simulation results show that our proposed algorithm can achieve an optimal sum throughput close to the theoretical upper bound, where the performance is significantly improved compared to the scheme based on base station cooperation.
引用
收藏
页码:2614 / 2618
页数:5
相关论文
共 50 条
  • [31] Simultaneous wireless information and power transfer in heterogeneous cellular networks with underlay D2D communication
    Sreelakshmy, K. R.
    Jacob, Lillykutty
    WIRELESS NETWORKS, 2020, 26 (05) : 3315 - 3330
  • [32] Resource Allocation Using Particle Swarm Optimization for D2D Communication Underlay of Cellular Networks
    Su, Lin
    Ji, Yusheng
    Wang, Ping
    Liu, Fuqiang
    2013 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE (WCNC), 2013, : 129 - 133
  • [33] Analytical Modeling of Mode Selection and Power Control for Underlay D2D Communication in Cellular Networks
    ElSawy, Hesham
    Hossain, Ekram
    Alouini, Mohamed-Slim
    IEEE TRANSACTIONS ON COMMUNICATIONS, 2014, 62 (11) : 4147 - 4161
  • [34] Power Allocation Approach for Underlay D2D Communication in Cellular Network
    Pawar, Praveen
    Trivedi, Aditya
    2017 CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY (CICT), 2017,
  • [35] Performance Improvement for Device-to-Device (D2D) Users in Underlay Cellular Communication Networks
    Zhong, Bin
    Lin, Hehong
    Chen, Liang
    Zhang, Zhongshan
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2024, 18 (09): : 2805 - 2817
  • [36] Simultaneous wireless information and power transfer in heterogeneous cellular networks with underlay D2D communication
    K. R. Sreelakshmy
    Lillykutty Jacob
    Wireless Networks, 2020, 26 : 3315 - 3330
  • [37] Underlay D2D Communication in a Finite Cellular Network with Exclusion Zone
    Guo, Jing
    Durrani, Salman
    Zhou, Xiangyun
    Yanikomeroglu, Halim
    2017 IEEE 86TH VEHICULAR TECHNOLOGY CONFERENCE (VTC-FALL), 2017,
  • [38] Dynamic Spectrum Access for D2D Networks: A Hypergraph Game Approach
    Zhu, Xucheng
    Liu, Xin
    Xu, Yuhua
    Zhang, Yuli
    Ruan, Lang
    Yang, Yang
    2017 17TH IEEE INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY (ICCT 2017), 2017, : 861 - 866
  • [39] Dynamic Power Control Based on FFR for D2D Communication Underlaying Cellular Networks
    Jiang, Fan
    Wang, Xian-Chao
    Li, Chen-Bi
    Shen, Bin-Yan
    2016 8TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS & SIGNAL PROCESSING (WCSP), 2016,
  • [40] Sparse CNN and Deep Reinforcement Learning-Based D2D Scheduling in UAV-Assisted Industrial IoT Networks
    Tuong, Van Dat
    Noh, Wonjong
    Cho, Sungrae
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 20 (01) : 213 - 223