Attention-Weighted Federated Deep Reinforcement Learning for Device-to-Device Assisted Heterogeneous Collaborative Edge Caching

被引:107
|
作者
Wang, Xiaofei [1 ]
Li, Ruibin [1 ]
Wang, Chenyang [1 ]
Li, Xiuhua [2 ,3 ]
Taleb, Tarik [4 ,5 ,6 ]
Leung, Victor C. M. [7 ,8 ]
机构
[1] Tianjin Univ, Coll Intelligence & Comp, Tianjin 300072, Peoples R China
[2] Chongqing Univ, Minist Educ, Key Lab Dependable Serv Comp Cyber Phys Soc, Chongqing, Peoples R China
[3] Chongqing Univ, Sch Big Data & Software Engn, Chongqing 401331, Peoples R China
[4] Aalto Univ, Sch Elect Engn, Dept Commun & Networking, Espoo 02150, Finland
[5] Oulu Univ, Dept Informat Technol & Elect Engn, Oulu 90570, Finland
[6] Sejong Univ, Dept Comp & Informat Secur, Seoul 05006, South Korea
[7] Shenzhen Univ, Coll Comp Sci & Software Engn, Shenzhen 518060, Peoples R China
[8] Univ British Columbia, Dept Elect & Comp Engn, Vancouver, BC V6T 1Z4, Canada
基金
芬兰科学院; 加拿大自然科学与工程研究理事会;
关键词
Device-to-device communication; Data models; Collaboration; Servers; Delays; Computational modeling; Training; Edge caching; device to device; attention-weighted federated learning; deep reinforcement learning; INTERNET;
D O I
10.1109/JSAC.2020.3036946
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
In order to meet the growing demands for multimedia service access and release the pressure of the core network, edge caching and device-to-device (D2D) communication have been regarded as two promising techniques in next generation mobile networks and beyond. However, most existing related studies lack consideration of effective cooperation and adaptability to the dynamic network environments. In this article, based on the flexible trilateral cooperation among user equipment, edge base stations and a cloud server, we propose a D2D-assisted heterogeneous collaborative edge caching framework by jointly optimizing the node selection and cache replacement in mobile networks. We formulate the joint optimization problem as a Markov decision process, and use a deep Q-learning network to solve the long-term mixed integer linear programming problem. We further design an attention-weighted federated deep reinforcement learning (AWFDRL) model that uses federated learning to improve the training efficiency of the Q-learning network by considering the limited computing and storage capacity, and incorporates an attention mechanism to optimize the aggregation weights to avoid the imbalance of local model quality. We prove the convergence of the corresponding algorithm, and present simulation results to show the effectiveness of the proposed AWFDRL framework in reducing average delay of content access, improving hit rate and offloading traffic.
引用
收藏
页码:154 / 169
页数:16
相关论文
共 50 条
  • [41] Federated Reinforcement Learning Based on Multi-head Attention Mechanism for Vehicle Edge Caching
    Li, XinRan
    Wei, ZhenChun
    Lyu, ZengWei
    Yuan, XiaoHui
    Xu, Juan
    Zhang, ZeYu
    WIRELESS ALGORITHMS, SYSTEMS, AND APPLICATIONS, PT III, 2022, 13473 : 648 - 656
  • [42] On-Device Indoor Positioning: A Federated Reinforcement Learning Approach With Heterogeneous Devices
    Dou, Fei
    Lu, Jin
    Zhu, Tan
    Bi, Jinbo
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (03) : 3909 - 3926
  • [44] Collaborative Edge Caching and Transcoding for 360° Video Streaming Based on Deep Reinforcement Learning
    Yang, Taoyu
    Tan, Zengjie
    Xu, Yuanyuan
    Cai, Shuwen
    IEEE INTERNET OF THINGS JOURNAL, 2022, 9 (24) : 25551 - 25564
  • [45] Deep Reinforcement Learning Based Collaborative Mobile Edge Caching for Omnidirectional Video Streaming
    Tan, Zengjie
    Xu, Yuanyuan
    WIRELESS ALGORITHMS, SYSTEMS, AND APPLICATIONS, WASA 2021, PT I, 2021, 12937 : 453 - 464
  • [46] Device Association for RAN Slicing Based on Hybrid Federated Deep Reinforcement Learning
    Liu, Yi-Jing
    Feng, Gang
    Sun, Yao
    Qin, Shuang
    Liang, Ying-Chang
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 69 (12) : 15731 - 15745
  • [47] Antenna Beamwidth Optimization in Directional Device-to-Device Communication Using Multi-Agent Deep Reinforcement Learning
    Bahadori, Niloofar
    Nabil, Mahmoud
    Homaifar, Abdollah
    IEEE ACCESS, 2021, 9 : 110601 - 110613
  • [48] Deadline-Aware Cache Placement Scheme Using Fuzzy Reinforcement Learning in Device-to-Device Mobile Edge Networks
    Manoj Kumar Somesula
    Anusha Kotte
    Sudarshan Chakravarthy Annadanam
    Sai Krishna Mothku
    Mobile Networks and Applications, 2022, 27 : 2100 - 2117
  • [49] Deadline-Aware Cache Placement Scheme Using Fuzzy Reinforcement Learning in Device-to-Device Mobile Edge Networks
    Somesula, Manoj Kumar
    Kotte, Anusha
    Annadanam, Sudarshan Chakravarthy
    Mothku, Sai Krishna
    MOBILE NETWORKS & APPLICATIONS, 2022, 27 (05): : 2100 - 2117
  • [50] Collaborative Task Offloading Based on Deep Reinforcement Learning in Heterogeneous Edge Networks
    Du, Yupeng
    Huang, Zhenglei
    Yang, Shujie
    Xiao, Han
    20TH INTERNATIONAL WIRELESS COMMUNICATIONS & MOBILE COMPUTING CONFERENCE, IWCMC 2024, 2024, : 375 - 380