Decentralized Federated Reinforcement Learning for User-Centric Dynamic TFDD Control

被引：7

作者：

Yin, Ziyan ^{[1
]}

Wang, Zhe ^{[2
]}

Li, Jun ^{[1
]}

Ding, Ming ^{[3
]}

Chen, Wen ^{[4
]}

Jin, Shi ^{[5
]}

机构：

[1] Nanjing Univ Sci & Technol, Sch Elect & Opt Engn, Nanjing 210094, Peoples R China

[2] Nanjing Univ Sci & Technol, Sch Comp Sci & Engn, Nanjing 210094, Peoples R China

[3] CSIRO, Data61, Sydney, NSW 2015, Australia

[4] Shanghai Jiao Tong Univ, Dept Elect Engn, Shanghai 200240, Peoples R China

[5] Southeast Univ, Natl Mobile Commun Res Lab, Nanjing 210096, Peoples R China

来源：

IEEE JOURNAL OF SELECTED TOPICS IN SIGNAL PROCESSING | 2023年 / 17卷 / 01期

基金：

中国国家自然科学基金;

关键词：

Heuristic algorithms; Resource management; Quality of service; Time-frequency analysis; Interference; Fading channels; Dynamic scheduling; Dynamic TFDD; decentralized partially observable Markov decision process; federated learning; multi-agent reinforcement learning; resource allocation; NETWORKS; OPTIMIZATION; MANAGEMENT; ALLOCATION; SYSTEMS; 5G;

D O I：

10.1109/JSTSP.2022.3221671

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

The explosive growth of dynamic and heterogeneous data traffic brings great challenges for 5G and beyond mobile networks. To enhance the network capacity and reliability, we propose a learning-based dynamic time-frequency division duplexing (D-TFDD) scheme that adaptively allocates the uplink and downlink time-frequency resources of base stations (BSs) to meet the asymmetric and heterogeneous traffic demands while alleviating the inter-cell interference. We formulate the problem as a decentralized partially observable Markov decision process (Dec-POMDP) that maximizes the long-term expected sum rate under the users' packet dropping ratio constraints. In order to jointly optimize the global resources in a decentralized manner, we propose a federated reinforcement learning (RL) algorithm named federated Wolpertinger deep deterministic policy gradient (FWDDPG) algorithm. The BSs decide their local time-frequency configurations through RL algorithms and achieve global training via exchanging local RL models with their neighbors under a decentralized federated learning framework. Specifically, to deal with the large-scale discrete action space of each BS, we adopt a DDPG-based algorithm to generate actions in a continuous space, and then utilize Wolpertinger policy to reduce the mapping errors from continuous action space back to discrete action space. Simulation results demonstrate the superiority of our proposed algorithm to the benchmark algorithms with respect to system sum rate.

引用

页码：40 / 53

页数：14

共 50 条

[31] Dynamic and user-centric network selection in heterogeneous networks
Cai, Xuejun
Chen, Ling
Sofia, Rute
Wu, Yanqi
2007 IEEE INTERNATIONAL PERFORMANCE COMPUTING AND COMMUNICATIONS CONFERENCE, VOLS 1 AND 2, 2007, : 538 - 544
[32] User-centric AP Clustering with Deep Reinforcement Learning for Cell-Free Massive MIMO
Tsukamoto, Yu
Ikami, Akio
Aihara, Naoki
Murakami, Takahide
Shinbo, Hiroyuki
Amano, Yoshiaki
PROCEEDINGS OF THE INT'L ACM SYMPOSIUM ON MOBILITY MANAGEMENT AND WIRELESS ACCESS, MOBIWAC 2023, 2023, : 17 - 24
[33] Dynamic User-Centric Clustered Workplaces for COVID-19 Control Measures Based on Geofencing and Deep Learning
Abd El-Haleem, Ahmed M.
Mohamed, Noor El-Deen M.
Abdelhakam, Mostafa M.
Elmesalawy, Mahmoud M.
PROCEEDINGS OF 2022 7TH INTERNATIONAL CONFERENCE ON MACHINE LEARNING TECHNOLOGIES, ICMLT 2022, 2022, : 230 - 236
[34] User-Centric Spectrum Sharing in Dynamic Network Architecture
Shafigh, Alireza Shams
Glisic, Savo
2016 IEEE GLOBECOM WORKSHOPS (GC WKSHPS), 2016,
[35] A dynamic QoE routing system for user-centric applications
Hai Anh Tran
Mellouk, Abdelhamid
Hoceini, Said
Augustin, Brice
Zeadally, Sherali
TRANSACTIONS ON EMERGING TELECOMMUNICATIONS TECHNOLOGIES, 2013, 24 (03): : 266 - 279
[36] MMS: A user-centric portal for e-learning
Allison, C
Bain, A
Ling, B
Nicoll, R
14TH INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2003, : 292 - 296
[37] User-Centric Learning and Evaluation of Interactive Segmentation Systems
Pushmeet Kohli
Hannes Nickisch
Carsten Rother
Christoph Rhemann
International Journal of Computer Vision, 2012, 100 : 261 - 274
[38] Developing a Mobile Learning App: A User-Centric Approach
Adamu, Muhammad Sadi
PROCEEDINGS OF THE FIRST AFRICAN CONFERENCE FOR HUMAN COMPUTER INTERACTION (AFRICHI'16), 2016, : 139 - 143
[39] User-Centric Learning and Evaluation of Interactive Segmentation Systems
Kohli, Pushmeet
Nickisch, Hannes
Rother, Carsten
Rhemann, Christoph
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2012, 100 (03) : 261 - 274
[40] Telesuit: An Immersive User-Centric Telepresence Control Suit
Cardenas, Irvin Steve
Vitullo, Kelsey A.
Park, Michelle
Kim, Jong-Hoon
Benitez, Margarita
Chen, Chanjuan
Ohrn-McDaniels, Linda
HRI '19: 2019 14TH ACM/IEEE INTERNATIONAL CONFERENCE ON HUMAN-ROBOT INTERACTION, 2019, : 654 - 655

← 1 2 3 4 5 →