Deep Reinforcement Learning for User Association and Resource Allocation in Heterogeneous Cellular Networks

被引:323
|
作者
Zhao, Nan [1 ,2 ]
Liang, Ying-Chang [2 ]
Niyato, Dusit [3 ]
Pei, Yiyang [4 ]
Wu, Minghu [5 ]
Jiang, Yunhao [5 ]
机构
[1] Hubei Univ Technol, Hubei Collaborat Innovat Ctr High Efficiency Util, Wuhan 430068, Hubei, Peoples R China
[2] Univ Elect Sci & Technol China, CINC, Chengdu 611731, Sichuan, Peoples R China
[3] Nanyang Technol Univ, Sch Comp Sci & Engn, Singapore, Singapore
[4] Singapore Inst Technol, Infocomm Technol Cluster, Singapore, Singapore
[5] Hubei Univ Technol, Hubei Key Lab High Efficiency Utilizat Solar Ener, Wuhan 430068, Hubei, Peoples R China
基金
中国国家自然科学基金;
关键词
Heterogeneous cellular networks; user association; resource allocation; multi-agent deep reinforcement learning; ACCESS; MANAGEMENT; SELECTION; HETNETS;
D O I
10.1109/TWC.2019.2933417
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Heterogeneous cellular networks can offload the mobile traffic and reduce the deployment costs, which have been considered to be a promising technique in the next-generation wireless network. Due to the non-convex and combinatorial characteristics, it is challenging to obtain an optimal strategy for the joint user association and resource allocation issue. In this paper, a reinforcement learning (RL) approach is proposed to achieve the maximum long-term overall network utility while guaranteeing the quality of service requirements of user equipments (UEs) in the downlink of heterogeneous cellular networks. A distributed optimization method based on multi-agent RL is developed. Moreover, to solve the computationally expensive problem with the large action space, multi-agent deep RL method is proposed. Specifically, the state, action and reward function are defined for UEs, and dueling double deep Q-network (D3QN) strategy is introduced to obtain the nearly optimal policy. Through message passing, the distributed UEs can obtain the global state space with a small communication overhead. With the double-Q strategy and dueling architecture, D3QN can rapidly converge to a subgame perfect Nash equilibrium. Simulation results demonstrate that D3QN achieves the better performance than other RL approaches in solving large-scale learning problems.
引用
收藏
页码:5141 / 5152
页数:12
相关论文
共 50 条
  • [31] Parallel Deep Reinforcement Learning based Online User Association Optimization in Heterogeneous Networks
    Li, Zhiyang
    Chen, Ming
    Wang, Kezhi
    Pan, Cunhua
    Huang, Nuo
    Hu, Yuntao
    2020 IEEE INTERNATIONAL CONFERENCE ON COMMUNICATIONS WORKSHOPS (ICC WORKSHOPS), 2020,
  • [32] Distributed User Association with Resource Partitioning in Heterogeneous Cellular Networks
    Tian-Qing Zhou
    Yong-Ming Huang
    Yuan Sun
    Lu-Xi Yang
    Wireless Personal Communications, 2017, 95 : 4131 - 4148
  • [33] Distributed User Association with Resource Partitioning in Heterogeneous Cellular Networks
    Zhou, Tian-Qing
    Huang, Yong-Ming
    Sun, Yuan
    Yang, Lu-Xi
    WIRELESS PERSONAL COMMUNICATIONS, 2017, 95 (04) : 4131 - 4148
  • [34] Deep Reinforcement Learning Based Caching Placement and User Association for Dynamic Cellular Networks
    Wang, Yue
    Feng, Chunyan
    Zhang, Tiankui
    2021 IEEE 32ND ANNUAL INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR AND MOBILE RADIO COMMUNICATIONS (PIMRC), 2021,
  • [35] BiLSTM Based Reinforcement Learning for Resource Allocation and User Association in LTE-U Networks
    Luo, Zhikun
    Yu, Guanding
    WIRELESS PERSONAL COMMUNICATIONS, 2020, 114 (03) : 2629 - 2641
  • [36] BiLSTM Based Reinforcement Learning for Resource Allocation and User Association in LTE-U Networks
    Zhikun Luo
    Guanding Yu
    Wireless Personal Communications, 2020, 114 : 2629 - 2641
  • [37] Resource Allocation for Heterogeneous Service in Green Mobile Edge Networks Using Deep Reinforcement Learning
    Sun, Si-yuan
    Zheng, Ying
    Zhou, Jun-hua
    Weng, Jiu-xing
    Wei, Yi-fei
    Wang, Xiao-jun
    KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS, 2021, 15 (07): : 2496 - 2512
  • [38] Combined Learning for Resource Allocation in Autonomous Heterogeneous Cellular Networks
    Chen, Xianfu
    Zhang, Honggang
    Chen, Tao
    Palicot, Jacques
    2013 IEEE 24TH INTERNATIONAL SYMPOSIUM ON PERSONAL, INDOOR, AND MOBILE RADIO COMMUNICATIONS (PIMRC), 2013, : 1061 - 1065
  • [39] Multi-agent deep reinforcement learning for user association and resource allocation in integrated terrestrial and non-terrestrial networks
    Birabwa, Denise Joanitah
    Ramotsoela, Daniel
    Ventura, Neco
    COMPUTER NETWORKS, 2023, 231
  • [40] Deep Reinforcement Learning Based Resource Allocation with Heterogeneous QoS for Cellular V2X
    Tian, Jin
    Shi, Yan
    Tong, Xiaolu
    Chen, Shanzhi
    Zhao, Rui
    2023 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE, WCNC, 2023,