Decentralized Federated Reinforcement Learning for User-Centric Dynamic TFDD Control

被引:7
|
作者
Yin, Ziyan [1 ]
Wang, Zhe [2 ]
Li, Jun [1 ]
Ding, Ming [3 ]
Chen, Wen [4 ]
Jin, Shi [5 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Elect & Opt Engn, Nanjing 210094, Peoples R China
[2] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing 210094, Peoples R China
[3] CSIRO, Data61, Sydney, NSW 2015, Australia
[4] Shanghai Jiao Tong Univ, Dept Elect Engn, Shanghai 200240, Peoples R China
[5] Southeast Univ, Natl Mobile Commun Res Lab, Nanjing 210096, Peoples R China
基金
中国国家自然科学基金;
关键词
Heuristic algorithms; Resource management; Quality of service; Time-frequency analysis; Interference; Fading channels; Dynamic scheduling; Dynamic TFDD; decentralized partially observable Markov decision process; federated learning; multi-agent reinforcement learning; resource allocation; NETWORKS; OPTIMIZATION; MANAGEMENT; ALLOCATION; SYSTEMS; 5G;
D O I
10.1109/JSTSP.2022.3221671
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The explosive growth of dynamic and heterogeneous data traffic brings great challenges for 5G and beyond mobile networks. To enhance the network capacity and reliability, we propose a learning-based dynamic time-frequency division duplexing (D-TFDD) scheme that adaptively allocates the uplink and downlink time-frequency resources of base stations (BSs) to meet the asymmetric and heterogeneous traffic demands while alleviating the inter-cell interference. We formulate the problem as a decentralized partially observable Markov decision process (Dec-POMDP) that maximizes the long-term expected sum rate under the users' packet dropping ratio constraints. In order to jointly optimize the global resources in a decentralized manner, we propose a federated reinforcement learning (RL) algorithm named federated Wolpertinger deep deterministic policy gradient (FWDDPG) algorithm. The BSs decide their local time-frequency configurations through RL algorithms and achieve global training via exchanging local RL models with their neighbors under a decentralized federated learning framework. Specifically, to deal with the large-scale discrete action space of each BS, we adopt a DDPG-based algorithm to generate actions in a continuous space, and then utilize Wolpertinger policy to reduce the mapping errors from continuous action space back to discrete action space. Simulation results demonstrate the superiority of our proposed algorithm to the benchmark algorithms with respect to system sum rate.
引用
收藏
页码:40 / 53
页数:14
相关论文
共 50 条
  • [31] Dynamic and user-centric network selection in heterogeneous networks
    Cai, Xuejun
    Chen, Ling
    Sofia, Rute
    Wu, Yanqi
    2007 IEEE INTERNATIONAL PERFORMANCE COMPUTING AND COMMUNICATIONS CONFERENCE, VOLS 1 AND 2, 2007, : 538 - 544
  • [32] User-centric AP Clustering with Deep Reinforcement Learning for Cell-Free Massive MIMO
    Tsukamoto, Yu
    Ikami, Akio
    Aihara, Naoki
    Murakami, Takahide
    Shinbo, Hiroyuki
    Amano, Yoshiaki
    PROCEEDINGS OF THE INT'L ACM SYMPOSIUM ON MOBILITY MANAGEMENT AND WIRELESS ACCESS, MOBIWAC 2023, 2023, : 17 - 24
  • [33] Dynamic User-Centric Clustered Workplaces for COVID-19 Control Measures Based on Geofencing and Deep Learning
    Abd El-Haleem, Ahmed M.
    Mohamed, Noor El-Deen M.
    Abdelhakam, Mostafa M.
    Elmesalawy, Mahmoud M.
    PROCEEDINGS OF 2022 7TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING TECHNOLOGIES, ICMLT 2022, 2022, : 230 - 236
  • [34] User-Centric Spectrum Sharing in Dynamic Network Architecture
    Shafigh, Alireza Shams
    Glisic, Savo
    2016 IEEE GLOBECOM WORKSHOPS (GC WKSHPS), 2016,
  • [35] A dynamic QoE routing system for user-centric applications
    Hai Anh Tran
    Mellouk, Abdelhamid
    Hoceini, Said
    Augustin, Brice
    Zeadally, Sherali
    TRANSACTIONS ON EMERGING TELECOMMUNICATIONS TECHNOLOGIES, 2013, 24 (03): : 266 - 279
  • [36] MMS: A user-centric portal for e-learning
    Allison, C
    Bain, A
    Ling, B
    Nicoll, R
    14TH INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2003, : 292 - 296
  • [37] User-Centric Learning and Evaluation of Interactive Segmentation Systems
    Pushmeet Kohli
    Hannes Nickisch
    Carsten Rother
    Christoph Rhemann
    International Journal of Computer Vision, 2012, 100 : 261 - 274
  • [38] Developing a Mobile Learning App: A User-Centric Approach
    Adamu, Muhammad Sadi
    PROCEEDINGS OF THE FIRST AFRICAN CONFERENCE FOR HUMAN COMPUTER INTERACTION (AFRICHI'16), 2016, : 139 - 143
  • [39] User-Centric Learning and Evaluation of Interactive Segmentation Systems
    Kohli, Pushmeet
    Nickisch, Hannes
    Rother, Carsten
    Rhemann, Christoph
    INTERNATIONAL JOURNAL OF COMPUTER VISION, 2012, 100 (03) : 261 - 274
  • [40] Telesuit: An Immersive User-Centric Telepresence Control Suit
    Cardenas, Irvin Steve
    Vitullo, Kelsey A.
    Park, Michelle
    Kim, Jong-Hoon
    Benitez, Margarita
    Chen, Chanjuan
    Ohrn-McDaniels, Linda
    HRI '19: 2019 14TH ACM/IEEE INTERNATIONAL CONFERENCE ON HUMAN-ROBOT INTERACTION, 2019, : 654 - 655