Byzantine-Robust Online and Offline Distributed Reinforcement Learning

被引：0

作者：

Chen, Yiding ^{[1
]}

Zhang, Xuezhou ^{[2
]}

Zhang, Kaiqing ^{[3
]}

Wang, Mengdi ^{[2
]}

Zhu, Xiaojin ^{[1
]}

机构：

[1] Univ Wisconsin Madison, Madison, WI 53707 USA

[2] Princeton Univ, Princeton, NJ USA

[3] Univ Maryland College Pk, College Pk, MD USA

来源：

INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 206 | 2023年 / 206卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We consider a distributed reinforcement learning setting where multiple agents separately explore the environment and communicate their experiences through a central server. However, afraction of agents are adversarial and can report arbitrary fake information. Critically, these adversarial agents can collude and their fake data can be of any sizes. We desire to robustly identify a near-optimal policy for the underlying Markov decision process in the presence of these adversarial agents. Our main technical contribution is COW, a novel algorithm for the robust mean estimation from batches problem, that can handle arbitrary batch sizes. Building upon this new estimator, in the offline setting, we design a Byzantine-robust distributed pessimistic value iteration algorithm; in the online setting, we design a Byzantine-robust distributed optimistic value iteration algorithm. Both algorithms obtain near-optimal sample complexities and achieve superior robustness guarantee than prior works.

引用

页数：40

共 50 条

[21] Byzantine-robust and efficient distributed sparsity learning: a surrogate composite quantile regression approach
Chen, Canyi
Zhu, Zhengtian
STATISTICS AND COMPUTING, 2024, 34 (05)
[22] RSA: Byzantine-Robust Stochastic Aggregation Methods for Distributed Learning from Heterogeneous Datasets
Li, Liping
Xu, Wei
Chen, Tianyi
Giannakis, Georgios B.
Ling, Qing
THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 1544 - 1551
[23] FedSuper: A Byzantine-Robust Federated Learning Under Supervision
Zhao, Ping
Jiang, Jin
Zhang, Guanglin
ACM TRANSACTIONS ON SENSOR NETWORKS, 2024, 20 (02)
[24] Byzantine-robust federated learning with ensemble incentive mechanism
Zhao, Shihai
Pu, Juncheng
Fu, Xiaodong
Liu, Li
Dai, Fei
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2024, 159 : 272 - 283
[25] CareFL: Contribution Guided Byzantine-Robust Federated Learning
Dong, Qihao
Yang, Shengyuan
Dai, Zhiyang
Gao, Yansong
Wang, Shang
Cao, Yuan
Fu, Anmin
Susilo, Willy
IEEE Transactions on Information Forensics and Security, 2024, 19 : 9714 - 9729
[26] Privacy-preserving Byzantine-robust federated learning
Ma, Xu
Zhou, Yuqing
Wang, Laihua
Miao, Meixia
COMPUTER STANDARDS & INTERFACES, 2022, 80
[27] Towards Federated Learning with Byzantine-Robust Client Weighting
Portnoy, Amit
Tirosh, Yoav
Hendler, Danny
APPLIED SCIENCES-BASEL, 2022, 12 (17):
[28] Privacy-Preserving and Byzantine-Robust Federated Learning
Dong, Caiqin
Weng, Jian
Li, Ming
Liu, Jia-Nan
Liu, Zhiquan
Cheng, Yudan
Yu, Shui
IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2024, 21 (02) : 889 - 904
[29] BOBA: Byzantine-Robust Federated Learning with Label Skewness
Bao, Wenxuan
Wu, Jun
He, Jingrui
INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 238, 2024, 238
[30] Byzantine-Robust Federated Learning through Dynamic Clustering
Wang, Hanyu
Wang, Liming
Li, Hongjia
2023 IEEE 22ND INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS, TRUSTCOM, BIGDATASE, CSE, EUC, ISCI 2023, 2024, : 222 - 230

← 1 2 3 4 5 →