Dynamic Spectrum Sharing Based on Federated Learning and Multi-Agent Actor-Critic Reinforcement Learning

被引：3

作者：

Yang, Tongtong ^{[1
]}

Zhang, Wensheng ^{[1
]}

Bo, Yulian ^{[1
]}

Sun, Jian ^{[1
]}

Wang, Cheng-Xiang ^{[2
,3
]}

机构：

[1] Shandong Univ, Shandong Prov Key Lab Wireless Commun, Sch Informat Sci & Engn, Qingdao 266237, Peoples R China

[2] Southeast Univ, Sch Informat Sci & Engn, Natl Mobile Commun Res Lab, Nanjing 210096, Peoples R China

[3] Purple Mt Labs, Nanjing 211111, Peoples R China

来源：

2023 INTERNATIONAL WIRELESS COMMUNICATIONS AND MOBILE COMPUTING, IWCMC | 2023年

基金：

中国国家自然科学基金; 国家重点研发计划;

关键词：

Dynamic spectrum sharing; federated learning; deep reinforcement learning; multi-agent actor-critic algorithm; CRNs;

D O I：

10.1109/IWCMC58020.2023.10182572

中图分类号：

TP301 [理论、方法];

学科分类号：

081202 ;

摘要：

In order to improve spectrum efficiency in emergency communications, a dynamic spectrum sharing (DSS) scheme based on federated learning (FL) and deep reinforcement learning (DRL) is proposed. The operation model follows the paradigm of cognitive radio networks (CRNs), in which multiple secondary users (SUs) with different bandwidth requirements, spectrum sensing and access capabilities randomly access idle frequency bands that primary users (PUs) do not occupy. Different users in emergency communications are considered as SUs or PUs according to their communication priorities. A maximum entropy based multi-agent actor-critic (ME-MAAC) algorithm is used to realize an optimal spectrum sharing strategy by updating varying rewards to SUs. During the learning process, the FL algorithm is used to assign appropriate weights to SUs. Simulation results show that the performance of proposed scheme is better in terms of reward value, access rate, and convergence speed.

引用

页码：947 / 952

页数：6

共 50 条

[41] Deployment Algorithm of Service Function Chain Based on Multi-Agent Soft Actor-Critic Learning
Tang, Lun
Li, Shirui
Du, Yucong
Chen, Qianbin
JOURNAL OF ELECTRONICS & INFORMATION TECHNOLOGY, 2023, 45 (08) : 2893 - 2901
[42] Dynamic Charging Scheme Problem With Actor-Critic Reinforcement Learning
Yang, Meiyi
Liu, Nianbo
Zuo, Lin
Feng, Yong
Liu, Minghui
Gong, Haigang
Liu, Ming
IEEE INTERNET OF THINGS JOURNAL, 2021, 8 (01) : 370 - 380
[43] Dynamic Content Caching Based on Actor-Critic Reinforcement Learning for IoT Systems
Lai, Lifeng
Zheng, Fu-Chun
Wen, Wanli
Luo, Jingjing
Li, Ge
2022 IEEE 96TH VEHICULAR TECHNOLOGY CONFERENCE (VTC2022-FALL), 2022,
[44] Spectrum Sharing in Vehicular Networks Based on Multi-Agent Reinforcement Learning
Liang, Le
Ye, Hao
Li, Geoffrey Ye
IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 2019, 37 (10) : 2282 - 2292
[45] A World Model for Actor-Critic in Reinforcement Learning
Panov, A. I.
Ugadiarov, L. A.
PATTERN RECOGNITION AND IMAGE ANALYSIS, 2023, 33 (03) : 467 - 477
[46] Curious Hierarchical Actor-Critic Reinforcement Learning
Roeder, Frank
Eppe, Manfred
Nguyen, Phuong D. H.
Wermter, Stefan
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2020, PT II, 2020, 12397 : 408 - 419
[47] Integrated Actor-Critic for Deep Reinforcement Learning
Zheng, Jiaohao
Kurt, Mehmet Necip
Wang, Xiaodong
ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2021, PT IV, 2021, 12894 : 505 - 518
[48] A fuzzy Actor-Critic reinforcement learning network
Wang, Xue-Song
Cheng, Yu-Hu
Yi, Jian-Qiang
INFORMATION SCIENCES, 2007, 177 (18) : 3764 - 3781
[49] A modified actor-critic reinforcement learning algorithm
Mustapha, SM
Lachiver, G
2000 CANADIAN CONFERENCE ON ELECTRICAL AND COMPUTER ENGINEERING, CONFERENCE PROCEEDINGS, VOLS 1 AND 2: NAVIGATING TO A NEW ERA, 2000, : 605 - 609
[50] Research on actor-critic reinforcement learning in RoboCup
Guo, He
Liu, Tianying
Wang, Yuxin
Chen, Feng
Fan, Jianming
WCICA 2006: SIXTH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-12, CONFERENCE PROCEEDINGS, 2006, : 205 - 205

← 1 2 3 4 5 →