Decentralized Federated Reinforcement Learning for User-Centric Dynamic TFDD Control

被引:7
|
作者
Yin, Ziyan [1 ]
Wang, Zhe [2 ]
Li, Jun [1 ]
Ding, Ming [3 ]
Chen, Wen [4 ]
Jin, Shi [5 ]
机构
[1] Nanjing Univ Sci & Technol, Sch Elect & Opt Engn, Nanjing 210094, Peoples R China
[2] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing 210094, Peoples R China
[3] CSIRO, Data61, Sydney, NSW 2015, Australia
[4] Shanghai Jiao Tong Univ, Dept Elect Engn, Shanghai 200240, Peoples R China
[5] Southeast Univ, Natl Mobile Commun Res Lab, Nanjing 210096, Peoples R China
基金
中国国家自然科学基金;
关键词
Heuristic algorithms; Resource management; Quality of service; Time-frequency analysis; Interference; Fading channels; Dynamic scheduling; Dynamic TFDD; decentralized partially observable Markov decision process; federated learning; multi-agent reinforcement learning; resource allocation; NETWORKS; OPTIMIZATION; MANAGEMENT; ALLOCATION; SYSTEMS; 5G;
D O I
10.1109/JSTSP.2022.3221671
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The explosive growth of dynamic and heterogeneous data traffic brings great challenges for 5G and beyond mobile networks. To enhance the network capacity and reliability, we propose a learning-based dynamic time-frequency division duplexing (D-TFDD) scheme that adaptively allocates the uplink and downlink time-frequency resources of base stations (BSs) to meet the asymmetric and heterogeneous traffic demands while alleviating the inter-cell interference. We formulate the problem as a decentralized partially observable Markov decision process (Dec-POMDP) that maximizes the long-term expected sum rate under the users' packet dropping ratio constraints. In order to jointly optimize the global resources in a decentralized manner, we propose a federated reinforcement learning (RL) algorithm named federated Wolpertinger deep deterministic policy gradient (FWDDPG) algorithm. The BSs decide their local time-frequency configurations through RL algorithms and achieve global training via exchanging local RL models with their neighbors under a decentralized federated learning framework. Specifically, to deal with the large-scale discrete action space of each BS, we adopt a DDPG-based algorithm to generate actions in a continuous space, and then utilize Wolpertinger policy to reduce the mapping errors from continuous action space back to discrete action space. Simulation results demonstrate the superiority of our proposed algorithm to the benchmark algorithms with respect to system sum rate.
引用
收藏
页码:40 / 53
页数:14
相关论文
共 50 条
  • [1] FeLebrities: A User-Centric Assessment of Federated Learning Frameworks
    Riviera, Walter
    Galazzo, Ilaria Boscolo
    Menegaz, Gloria
    IEEE ACCESS, 2023, 11 : 96865 - 96878
  • [2] A User-Centric Privacy Control Framework for Decentralized IoT Platforms
    Tao, Xu
    Zhao, Rui
    Chehida, Salim
    Conzon, Davide
    Ferrera, Enrico
    Bensalem, Saddek
    QUALITY OF INFORMATION AND COMMUNICATIONS TECHNOLOGY, QUATIC 2024, 2024, 2178 : 408 - 415
  • [3] User-Centric Federated Learning: Trading off Wireless Resources for Personalization
    Mestoukirdi, Mohamad
    Zecchin, Matteo
    Gesbert, David
    Li, Qianrui
    IEEE Transactions on Machine Learning in Communications and Networking, 2023, 1 : 346 - 359
  • [4] A USER-CENTRIC APPROACH FOR FEDERATED IDENTITY MANAGEMENT
    Bergadano, Francesco
    Accornero, Renato
    Lucisano, Giovanna
    Rispoli, Daniele
    INTERNATIONAL JOURNAL ON INFORMATION TECHNOLOGIES AND SECURITY, 2013, 5 (01): : 3 - 18
  • [5] User-Centric Green Light Optimized Speed Advisory with Reinforcement Learning
    Schlamp, Anna-Lena
    Gerner, Jeremias
    Bogenberger, Klaus
    Schmidtner, Stefanie
    2023 IEEE 26TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS, ITSC, 2023, : 3463 - 3470
  • [6] User-centric privacy management for federated identity management
    Ahn, Gail-Joon
    Ko, Moonam
    2007 INTERNATIONAL CONFERENCE ON COLLABORATIVE COMPUTING: NETWORKING, APPLICATIONS AND WORKSHARING, 2008, : 187 - 195
  • [7] A user-centric federated single sign-on system
    Suriadi, Suriadi
    Foo, Ernest
    Josang, Audun
    JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2009, 32 (02) : 388 - 401
  • [8] A User-centric Federated Single Sign-on System
    Suriadi, Suriadi
    Foo, Ernest
    Josang, Audun
    2007 IFIP INTERNATIONAL CONFERENCE ON NETWORK AND PARALLEL COMPUTING WORKSHOPS, PROCEEDINGS, 2007, : 99 - 106
  • [9] Dynamic user-centric access control for detection of ransomware attacks
    McIntosh, Timothy
    Kayes, A. S. M.
    Chen, Yi-Ping Phoebe
    Ng, Alex
    Watters, Paul
    COMPUTERS & SECURITY, 2021, 111
  • [10] User-centric Security and The Dynamic Enterprise
    Cosquer, F.
    ENRICHING COMMUNICATIONS, 2009, 3 (01): : 53 - 56