Attention-Weighted Federated Deep Reinforcement Learning for Device-to-Device Assisted Heterogeneous Collaborative Edge Caching

被引：107

作者：

Wang, Xiaofei ^{[1
]}

Li, Ruibin ^{[1
]}

Wang, Chenyang ^{[1
]}

Li, Xiuhua ^{[2
,3
]}

Taleb, Tarik ^{[4
,5
,6
]}

Leung, Victor C. M. ^{[7
,8
]}

机构：

[1] Tianjin Univ, Coll Intelligence & Comp, Tianjin 300072, Peoples R China

[2] Chongqing Univ, Minist Educ, Key Lab Dependable Serv Comp Cyber Phys Soc, Chongqing, Peoples R China

[3] Chongqing Univ, Sch Big Data & Software Engn, Chongqing 401331, Peoples R China

[4] Aalto Univ, Sch Elect Engn, Dept Commun & Networking, Espoo 02150, Finland

[5] Oulu Univ, Dept Informat Technol & Elect Engn, Oulu 90570, Finland

[6] Sejong Univ, Dept Comp & Informat Secur, Seoul 05006, South Korea

[7] Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen 518060, Peoples R China

[8] Univ British Columbia, Dept Elect & Comp Engn, Vancouver, BC V6T 1Z4, Canada

来源：

IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS | 2021年 / 39卷 / 01期

基金：

芬兰科学院; 加拿大自然科学与工程研究理事会;

关键词：

Device-to-device communication; Data models; Collaboration; Servers; Delays; Computational modeling; Training; Edge caching; device to device; attention-weighted federated learning; deep reinforcement learning; INTERNET;

D O I：

10.1109/JSAC.2020.3036946

中图分类号：

TM [电工技术]; TN [电子技术、通信技术];

学科分类号：

0808 ; 0809 ;

摘要：

In order to meet the growing demands for multimedia service access and release the pressure of the core network, edge caching and device-to-device (D2D) communication have been regarded as two promising techniques in next generation mobile networks and beyond. However, most existing related studies lack consideration of effective cooperation and adaptability to the dynamic network environments. In this article, based on the flexible trilateral cooperation among user equipment, edge base stations and a cloud server, we propose a D2D-assisted heterogeneous collaborative edge caching framework by jointly optimizing the node selection and cache replacement in mobile networks. We formulate the joint optimization problem as a Markov decision process, and use a deep Q-learning network to solve the long-term mixed integer linear programming problem. We further design an attention-weighted federated deep reinforcement learning (AWFDRL) model that uses federated learning to improve the training efficiency of the Q-learning network by considering the limited computing and storage capacity, and incorporates an attention mechanism to optimize the aggregation weights to avoid the imbalance of local model quality. We prove the convergence of the corresponding algorithm, and present simulation results to show the effectiveness of the proposed AWFDRL framework in reducing average delay of content access, improving hit rate and offloading traffic.

引用

页码：154 / 169

页数：16

共 50 条

[41] Federated Reinforcement Learning Based on Multi-head Attention Mechanism for Vehicle Edge Caching
Li, XinRan
Wei, ZhenChun
Lyu, ZengWei
Yuan, XiaoHui
Xu, Juan
Zhang, ZeYu
WIRELESS ALGORITHMS, SYSTEMS, AND APPLICATIONS, PT III, 2022, 13473 : 648 - 656
[42] On-Device Indoor Positioning: A Federated Reinforcement Learning Approach With Heterogeneous Devices
Dou, Fei
Lu, Jin
Zhu, Tan
Bi, Jinbo
IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (03) : 3909 - 3926
[43] An efficient Deep reinforcement learning with extended Kalman filter for device-to-device communication underlaying cellular network
Khuntia, Pratap
Hazra, Ranjay
TRANSACTIONS ON EMERGING TELECOMMUNICATIONS TECHNOLOGIES, 2019, 30 (09)
[44] Collaborative Edge Caching and Transcoding for 360° Video Streaming Based on Deep Reinforcement Learning
Yang, Taoyu
Tan, Zengjie
Xu, Yuanyuan
Cai, Shuwen
IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (24) : 25551 - 25564
[45] Deep Reinforcement Learning Based Collaborative Mobile Edge Caching for Omnidirectional Video Streaming
Tan, Zengjie
Xu, Yuanyuan
WIRELESS ALGORITHMS, SYSTEMS, AND APPLICATIONS, WASA 2021, PT I, 2021, 12937 : 453 - 464
[46] Device Association for RAN Slicing Based on Hybrid Federated Deep Reinforcement Learning
Liu, Yi-Jing
Feng, Gang
Sun, Yao
Qin, Shuang
Liang, Ying-Chang
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 69 (12) : 15731 - 15745
[47] Antenna Beamwidth Optimization in Directional Device-to-Device Communication Using Multi-Agent Deep Reinforcement Learning
Bahadori, Niloofar
Nabil, Mahmoud
Homaifar, Abdollah
IEEE ACCESS, 2021, 9 : 110601 - 110613
[48] Deadline-Aware Cache Placement Scheme Using Fuzzy Reinforcement Learning in Device-to-Device Mobile Edge Networks
Manoj Kumar Somesula
Anusha Kotte
Sudarshan Chakravarthy Annadanam
Sai Krishna Mothku
Mobile Networks and Applications, 2022, 27 : 2100 - 2117
[49] Deadline-Aware Cache Placement Scheme Using Fuzzy Reinforcement Learning in Device-to-Device Mobile Edge Networks
Somesula, Manoj Kumar
Kotte, Anusha
Annadanam, Sudarshan Chakravarthy
Mothku, Sai Krishna
MOBILE NETWORKS & APPLICATIONS, 2022, 27 (05): : 2100 - 2117
[50] Collaborative Task Offloading Based on Deep Reinforcement Learning in Heterogeneous Edge Networks
Du, Yupeng
Huang, Zhenglei
Yang, Shujie
Xiao, Han
20TH INTERNATIONAL WIRELESS COMMUNICATIONS & MOBILE COMPUTING CONFERENCE, IWCMC 2024, 2024, : 375 - 380

← 1 2 3 4 5 →