Byzantine-Robust Online and Offline Distributed Reinforcement Learning

被引:0
|
作者
Chen, Yiding [1 ]
Zhang, Xuezhou [2 ]
Zhang, Kaiqing [3 ]
Wang, Mengdi [2 ]
Zhu, Xiaojin [1 ]
机构
[1] Univ Wisconsin Madison, Madison, WI 53707 USA
[2] Princeton Univ, Princeton, NJ USA
[3] Univ Maryland College Pk, College Pk, MD USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We consider a distributed reinforcement learning setting where multiple agents separately explore the environment and communicate their experiences through a central server. However, afraction of agents are adversarial and can report arbitrary fake information. Critically, these adversarial agents can collude and their fake data can be of any sizes. We desire to robustly identify a near-optimal policy for the underlying Markov decision process in the presence of these adversarial agents. Our main technical contribution is COW, a novel algorithm for the robust mean estimation from batches problem, that can handle arbitrary batch sizes. Building upon this new estimator, in the offline setting, we design a Byzantine-robust distributed pessimistic value iteration algorithm; in the online setting, we design a Byzantine-robust distributed optimistic value iteration algorithm. Both algorithms obtain near-optimal sample complexities and achieve superior robustness guarantee than prior works.
引用
收藏
页数:40
相关论文
共 50 条
  • [21] Byzantine-robust and efficient distributed sparsity learning: a surrogate composite quantile regression approach
    Chen, Canyi
    Zhu, Zhengtian
    STATISTICS AND COMPUTING, 2024, 34 (05)
  • [22] RSA: Byzantine-Robust Stochastic Aggregation Methods for Distributed Learning from Heterogeneous Datasets
    Li, Liping
    Xu, Wei
    Chen, Tianyi
    Giannakis, Georgios B.
    Ling, Qing
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 1544 - 1551
  • [23] FedSuper: A Byzantine-Robust Federated Learning Under Supervision
    Zhao, Ping
    Jiang, Jin
    Zhang, Guanglin
    ACM TRANSACTIONS ON SENSOR NETWORKS, 2024, 20 (02)
  • [24] Byzantine-robust federated learning with ensemble incentive mechanism
    Zhao, Shihai
    Pu, Juncheng
    Fu, Xiaodong
    Liu, Li
    Dai, Fei
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2024, 159 : 272 - 283
  • [25] CareFL: Contribution Guided Byzantine-Robust Federated Learning
    Dong, Qihao
    Yang, Shengyuan
    Dai, Zhiyang
    Gao, Yansong
    Wang, Shang
    Cao, Yuan
    Fu, Anmin
    Susilo, Willy
    IEEE Transactions on Information Forensics and Security, 2024, 19 : 9714 - 9729
  • [26] Privacy-preserving Byzantine-robust federated learning
    Ma, Xu
    Zhou, Yuqing
    Wang, Laihua
    Miao, Meixia
    COMPUTER STANDARDS & INTERFACES, 2022, 80
  • [27] Towards Federated Learning with Byzantine-Robust Client Weighting
    Portnoy, Amit
    Tirosh, Yoav
    Hendler, Danny
    APPLIED SCIENCES-BASEL, 2022, 12 (17):
  • [28] Privacy-Preserving and Byzantine-Robust Federated Learning
    Dong, Caiqin
    Weng, Jian
    Li, Ming
    Liu, Jia-Nan
    Liu, Zhiquan
    Cheng, Yudan
    Yu, Shui
    IEEE TRANSACTIONS ON DEPENDABLE AND SECURE COMPUTING, 2024, 21 (02) : 889 - 904
  • [29] BOBA: Byzantine-Robust Federated Learning with Label Skewness
    Bao, Wenxuan
    Wu, Jun
    He, Jingrui
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 238, 2024, 238
  • [30] Byzantine-Robust Federated Learning through Dynamic Clustering
    Wang, Hanyu
    Wang, Liming
    Li, Hongjia
    2023 IEEE 22ND INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS, TRUSTCOM, BIGDATASE, CSE, EUC, ISCI 2023, 2024, : 222 - 230