Byzantine-Robust Online and Offline Distributed Reinforcement Learning

被引:0
|
作者
Chen, Yiding [1 ]
Zhang, Xuezhou [2 ]
Zhang, Kaiqing [3 ]
Wang, Mengdi [2 ]
Zhu, Xiaojin [1 ]
机构
[1] Univ Wisconsin Madison, Madison, WI 53707 USA
[2] Princeton Univ, Princeton, NJ USA
[3] Univ Maryland College Pk, College Pk, MD USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We consider a distributed reinforcement learning setting where multiple agents separately explore the environment and communicate their experiences through a central server. However, afraction of agents are adversarial and can report arbitrary fake information. Critically, these adversarial agents can collude and their fake data can be of any sizes. We desire to robustly identify a near-optimal policy for the underlying Markov decision process in the presence of these adversarial agents. Our main technical contribution is COW, a novel algorithm for the robust mean estimation from batches problem, that can handle arbitrary batch sizes. Building upon this new estimator, in the offline setting, we design a Byzantine-robust distributed pessimistic value iteration algorithm; in the online setting, we design a Byzantine-robust distributed optimistic value iteration algorithm. Both algorithms obtain near-optimal sample complexities and achieve superior robustness guarantee than prior works.
引用
收藏
页数:40
相关论文
共 50 条
  • [1] Byzantine-Robust Distributed Learning With Compression
    Zhu, Heng
    Ling, Qing
    IEEE TRANSACTIONS ON SIGNAL AND INFORMATION PROCESSING OVER NETWORKS, 2023, 9 : 280 - 294
  • [2] Byzantine-Robust Aggregation for Federated Learning with Reinforcement Learning
    Yan, Sizheng
    Du, Junping
    Xue, Zhe
    Li, Ang
    WEB AND BIG DATA, APWEB-WAIM 2024, PT IV, 2024, 14964 : 152 - 166
  • [3] STOCHASTIC ADMM FOR BYZANTINE-ROBUST DISTRIBUTED LEARNING
    Lin, Feng
    Ling, Qing
    Li, Weiyu
    Xiong, Zhiwei
    2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 3172 - 3176
  • [4] Byzantine-Robust Distributed Online Learning: Taming Adversarial Participants in An Adversarial Environment
    Dong, Xingrong
    Wu, Zhaoxian
    Ling, Qing
    Tian, Zhi
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2024, 72 : 235 - 248
  • [5] Communication-Efficient and Byzantine-Robust Distributed Learning
    Ghosh, Avishek
    Maity, Raj Kumar
    Kadhe, Swanand
    Mazumdar, Arya
    Ramchandran, Kannan
    2020 INFORMATION THEORY AND APPLICATIONS WORKSHOP (ITA), 2020,
  • [6] Byzantine-Robust Distributed Learning: Towards Optimal Statistical Rates
    Yin, Dong
    Chen, Yudong
    Ramchandran, Kannan
    Bartlett, Peter
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
  • [7] Byzantine-robust distributed sparse learning for M-estimation
    Jiyuan Tu
    Weidong Liu
    Xiaojun Mao
    Machine Learning, 2023, 112 : 3773 - 3804
  • [8] Byzantine-robust distributed sparse learning for M-estimation
    Tu, Jiyuan
    Liu, Weidong
    Mao, Xiaojun
    MACHINE LEARNING, 2023, 112 (10) : 3773 - 3804
  • [9] Stochastic alternating direction method of multipliers for Byzantine-robust distributed learning
    Lin, Feng
    Li, Weiyu
    Ling, Qing
    SIGNAL PROCESSING, 2022, 195
  • [10] Communication-Efficient and Byzantine-Robust Distributed Learning with Error Feedback
    Ghosh A.
    Maity R.K.
    Kadhe S.
    Mazumdar A.
    Ramchandran K.
    IEEE Journal on Selected Areas in Information Theory, 2021, 2 (03): : 942 - 953