Federated Reinforcement Learning with Environment Heterogeneity

被引:0
|
作者
Jin, Hao [1 ]
Peng, Yang [1 ]
Yang, Wenhao [1 ]
Wang, Shusen [2 ]
Zhang, Zhihua [1 ]
机构
[1] Peking Univ, Beijing, Peoples R China
[2] Xiaohongshu Inc, Shanghai, Peoples R China
来源
INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151 | 2022年 / 151卷
基金
北京市自然科学基金;
关键词
GAME; GO;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We study a Federated Reinforcement Learning (FedRL) problem in which n agents collaboratively learn a single policy without sharing the trajectories they collected during agent-environment interaction. We stress the constraint of environment heterogeneity, which means n environments corresponding to these n agents have different state transitions. To obtain a value function or a policy function which optimizes the overall performance in all environments, we propose two federated RL algorithms, QAvg and PAvg. We theoretically prove that these algorithms converge to suboptimal solutions, while such sub-optimality depends on how heterogeneous these n environments are. Moreover, we propose a heuristic that achieves personalization by embedding the n environments into n vectors. The personalization heuristic not only improves the training but also allows for better generalization to new environments.
引用
收藏
页码:18 / 37
页数:20
相关论文
共 50 条
  • [31] Byzantine-Robust Aggregation for Federated Learning with Reinforcement Learning
    Yan, Sizheng
    Du, Junping
    Xue, Zhe
    Li, Ang
    WEB AND BIG DATA, APWEB-WAIM 2024, PT IV, 2024, 14964 : 152 - 166
  • [32] Local Learning Matters: Rethinking Data Heterogeneity in Federated Learning
    Mendieta, Matias
    Yang, Taojiannan
    Wang, Pu
    Lee, Minwoo
    Ding, Zhengming
    Chen, Chen
    2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 8387 - 8396
  • [33] Jointly optimizing resource and heterogeneity in IoT networks using a Three-Stage Asynchronous Federated Reinforcement Learning
    Sagar, A. S. M. Sharifuzzaman
    Chen, Yu
    Rob, Md. Abdur
    Kim, Hyung Seok
    INTERNET OF THINGS, 2024, 27
  • [34] Clustered Federated Learning in Heterogeneous Environment
    Yan, Yihan
    Tong, Xiaojun
    Wang, Shen
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (09) : 12796 - 12809
  • [35] Assisted driving system based on federated reinforcement learning
    Tang, Xiaolan
    Liang, Yuting
    Wang, Guan
    Chen, Wenlong
    DISPLAYS, 2023, 80
  • [36] A Selective Federated Reinforcement Learning Strategy for Autonomous Driving
    Fu, Yuchuan
    Li, Changle
    Yu, F. Richard
    Luan, Tom H.
    Zhang, Yao
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (02) : 1655 - 1668
  • [37] FedRLChain: Secure Federated Deep Reinforcement Learning With Blockchain
    Chowdhury, Sujit
    Mukherjee, Arnab
    Halder, Raju
    IEEE TRANSACTIONS ON SERVICES COMPUTING, 2023, 16 (06) : 3865 - 3878
  • [38] Federated Offline Reinforcement Learning with Proximal Policy Evaluation
    Sheng YUE
    Yongheng DENG
    Guanbo WANG
    Ju REN
    Yaoxue ZHANG
    Chinese Journal of Electronics, 2024, 33 (06) : 1360 - 1372
  • [39] Federated Reinforcement Learning for Collective Navigation of Robotic Swarms
    Na, Seongin
    Roucek, Tomas
    Ulrich, Jiri
    Pikman, Jan
    Krajnik, Tomas
    Lennox, Barry
    Arvin, Farshad
    IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2023, 15 (04) : 2122 - 2131
  • [40] Momentum-Based Contextual Federated Reinforcement Learning
    Yue, Sheng
    Hua, Xingyuan
    Deng, Yongheng
    Chen, Lili
    Ren, Ju
    Zhang, Yaoxue
    IEEE-ACM TRANSACTIONS ON NETWORKING, 2024,