Byzantine-Robust Online and Offline Distributed Reinforcement Learning

被引：0

作者：

Chen, Yiding ^{[1
]}

Zhang, Xuezhou ^{[2
]}

Zhang, Kaiqing ^{[3
]}

Wang, Mengdi ^{[2
]}

Zhu, Xiaojin ^{[1
]}

机构：

[1] Univ Wisconsin Madison, Madison, WI 53707 USA

[2] Princeton Univ, Princeton, NJ USA

[3] Univ Maryland College Pk, College Pk, MD USA

来源：

INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 206 | 2023年 / 206卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We consider a distributed reinforcement learning setting where multiple agents separately explore the environment and communicate their experiences through a central server. However, afraction of agents are adversarial and can report arbitrary fake information. Critically, these adversarial agents can collude and their fake data can be of any sizes. We desire to robustly identify a near-optimal policy for the underlying Markov decision process in the presence of these adversarial agents. Our main technical contribution is COW, a novel algorithm for the robust mean estimation from batches problem, that can handle arbitrary batch sizes. Building upon this new estimator, in the offline setting, we design a Byzantine-robust distributed pessimistic value iteration algorithm; in the online setting, we design a Byzantine-robust distributed optimistic value iteration algorithm. Both algorithms obtain near-optimal sample complexities and achieve superior robustness guarantee than prior works.

引用

页数：40

共 50 条

[1] Byzantine-Robust Distributed Learning With Compression
Zhu, Heng
Ling, Qing
IEEE TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING OVER NETWORKS, 2023, 9 : 280 - 294
[2] Byzantine-Robust Aggregation for Federated Learning with Reinforcement Learning
Yan, Sizheng
Du, Junping
Xue, Zhe
Li, Ang
WEB AND BIG DATA, APWEB-WAIM 2024, PT IV, 2024, 14964 : 152 - 166
[3] STOCHASTIC ADMM FOR BYZANTINE-ROBUST DISTRIBUTED LEARNING
Lin, Feng
Ling, Qing
Li, Weiyu
Xiong, Zhiwei
2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 3172 - 3176
[4] Byzantine-Robust Distributed Online Learning: Taming Adversarial Participants in An Adversarial Environment
Dong, Xingrong
Wu, Zhaoxian
Ling, Qing
Tian, Zhi
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2024, 72 : 235 - 248
[5] Communication-Efficient and Byzantine-Robust Distributed Learning
Ghosh, Avishek
Maity, Raj Kumar
Kadhe, Swanand
Mazumdar, Arya
Ramchandran, Kannan
2020 INFORMATION THEORY AND APPLICATIONS WORKSHOP (ITA), 2020,
[6] Byzantine-Robust Distributed Learning: Towards Optimal Statistical Rates
Yin, Dong
Chen, Yudong
Ramchandran, Kannan
Bartlett, Peter
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
[7] Byzantine-robust distributed sparse learning for M-estimation
Jiyuan Tu
Weidong Liu
Xiaojun Mao
Machine Learning, 2023, 112 : 3773 - 3804
[8] Byzantine-robust distributed sparse learning for M-estimation
Tu, Jiyuan
Liu, Weidong
Mao, Xiaojun
MACHINE LEARNING, 2023, 112 (10) : 3773 - 3804
[9] Stochastic alternating direction method of multipliers for Byzantine-robust distributed learning
Lin, Feng
Li, Weiyu
Ling, Qing
SIGNAL PROCESSING, 2022, 195
[10] Communication-Efficient and Byzantine-Robust Distributed Learning with Error Feedback
Ghosh A.
Maity R.K.
Kadhe S.
Mazumdar A.
Ramchandran K.
IEEE Journal on Selected Areas in Information Theory, 2021, 2 (03): : 942 - 953

← 1 2 3 4 5 →