Federated Reinforcement Learning with Environment Heterogeneity

被引：0

作者：

Jin, Hao ^{[1
]}

Peng, Yang ^{[1
]}

Yang, Wenhao ^{[1
]}

Wang, Shusen ^{[2
]}

Zhang, Zhihua ^{[1
]}

机构：

[1] Peking Univ, Beijing, Peoples R China

[2] Xiaohongshu Inc, Shanghai, Peoples R China

来源：

INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151 | 2022年 / 151卷

基金：

北京市自然科学基金;

关键词：

GAME; GO;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We study a Federated Reinforcement Learning (FedRL) problem in which n agents collaboratively learn a single policy without sharing the trajectories they collected during agent-environment interaction. We stress the constraint of environment heterogeneity, which means n environments corresponding to these n agents have different state transitions. To obtain a value function or a policy function which optimizes the overall performance in all environments, we propose two federated RL algorithms, QAvg and PAvg. We theoretically prove that these algorithms converge to suboptimal solutions, while such sub-optimality depends on how heterogeneous these n environments are. Moreover, we propose a heuristic that achieves personalization by embedding the n environments into n vectors. The personalization heuristic not only improves the training but also allows for better generalization to new environments.

引用

页码：18 / 37

页数：20

共 50 条

[31] Byzantine-Robust Aggregation for Federated Learning with Reinforcement Learning
Yan, Sizheng
Du, Junping
Xue, Zhe
Li, Ang
WEB AND BIG DATA, APWEB-WAIM 2024, PT IV, 2024, 14964 : 152 - 166
[32] Local Learning Matters: Rethinking Data Heterogeneity in Federated Learning
Mendieta, Matias
Yang, Taojiannan
Wang, Pu
Lee, Minwoo
Ding, Zhengming
Chen, Chen
2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 8387 - 8396
[33] Jointly optimizing resource and heterogeneity in IoT networks using a Three-Stage Asynchronous Federated Reinforcement Learning
Sagar, A. S. M. Sharifuzzaman
Chen, Yu
Rob, Md. Abdur
Kim, Hyung Seok
INTERNET OF THINGS, 2024, 27
[34] Clustered Federated Learning in Heterogeneous Environment
Yan, Yihan
Tong, Xiaojun
Wang, Shen
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (09) : 12796 - 12809
[35] Assisted driving system based on federated reinforcement learning
Tang, Xiaolan
Liang, Yuting
Wang, Guan
Chen, Wenlong
DISPLAYS, 2023, 80
[36] A Selective Federated Reinforcement Learning Strategy for Autonomous Driving
Fu, Yuchuan
Li, Changle
Yu, F. Richard
Luan, Tom H.
Zhang, Yao
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (02) : 1655 - 1668
[37] FedRLChain: Secure Federated Deep Reinforcement Learning With Blockchain
Chowdhury, Sujit
Mukherjee, Arnab
Halder, Raju
IEEE TRANSACTIONS ON SERVICES COMPUTING, 2023, 16 (06) : 3865 - 3878
[38] Federated Offline Reinforcement Learning with Proximal Policy Evaluation
Sheng YUE
Yongheng DENG
Guanbo WANG
Ju REN
Yaoxue ZHANG
Chinese Journal of Electronics, 2024, 33 (06) : 1360 - 1372
[39] Federated Reinforcement Learning for Collective Navigation of Robotic Swarms
Na, Seongin
Roucek, Tomas
Ulrich, Jiri
Pikman, Jan
Krajnik, Tomas
Lennox, Barry
Arvin, Farshad
IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2023, 15 (04) : 2122 - 2131
[40] Momentum-Based Contextual Federated Reinforcement Learning
Yue, Sheng
Hua, Xingyuan
Deng, Yongheng
Chen, Lili
Ren, Ju
Zhang, Yaoxue
IEEE-ACM TRANSACTIONS ON NETWORKING, 2024,

← 1 2 3 4 5 →