Robust and private stochastic linear bandits

被引:0
|
作者
Charisopoulos, Vasileios [1 ]
Esfandiari, Hossein [2 ]
Mirrokni, Vahab [2 ]
机构
[1] Cornell Univ, Operat Res Informat Engn, Ithaca, NY USA
[2] Google Res, Mountain View, CA 94043 USA
来源
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 202 | 2023年 / 202卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we study the stochastic linear bandit problem under the additional requirements of differential privacy, robustness and batched observations. In particular, we assume an adversary randomly chooses a constant fraction of the observed rewards in each batch, replacing them with arbitrary numbers. We present differentially private and robust variants of the arm elimination algorithm using logarithmic batch queries under two privacy models and provide regret bounds in both settings. In the first model, every reward in each round is reported by a potentially different client, which reduces to standard local differential privacy (LDP). In the second model, every action is "owned" by a different client, who may aggregate the rewards over multiple queries and privatize the aggregate response instead. To the best of our knowledge, our algorithms are the first simultaneously providing differential privacy and adversarial robustness in the stochastic linear bandits problem.
引用
收藏
页数:19
相关论文
共 50 条
  • [21] Byzantine-Robust Federated Linear Bandits
    Jadbabaie, Ali
    Li, Haochuan
    Qian, Jian
    Tian, Yi
    2022 IEEE 61ST CONFERENCE ON DECISION AND CONTROL (CDC), 2022, : 5206 - 5213
  • [22] Differentially Private Linear Bandits with Partial Distributed Feedback
    Li, Fengjiao
    Zhou, Xingyu
    Ji, Bo
    2022 20TH INTERNATIONAL SYMPOSIUM ON MODELING AND OPTIMIZATION IN MOBILE, AD HOC, AND WIRELESS NETWORKS (WIOPT 2022), 2022, : 41 - 48
  • [23] Stochastic Contextual Dueling Bandits under Linear Stochastic Transitivity Models
    Bengs, Viktor
    Saha, Aadirupa
    Huellermeier, Eyke
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [24] Learning in Generalized Linear Contextual Bandits with Stochastic Delays
    Zhou, Zhengyuan
    Xu, Renyuan
    Blanchet, Jose
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [25] Perturbed-History Exploration in Stochastic Linear Bandits
    Kveton, Branislav
    Szepesvari, Csaba
    Ghavamzadeh, Mohammad
    Boutilier, Craig
    35TH UNCERTAINTY IN ARTIFICIAL INTELLIGENCE CONFERENCE (UAI 2019), 2020, 115 : 530 - 540
  • [26] Collaborative Multi-agent Stochastic Linear Bandits
    Moradipari, Ahmadreza
    Ghavamzadeh, Mohammad
    Alizadeh, Mahnoosh
    2022 AMERICAN CONTROL CONFERENCE, ACC, 2022, : 2761 - 2766
  • [27] Quantum Multi-Armed Bandits and Stochastic Linear Bandits Enjoy Logarithmic Regrets
    Wan, Zongqi
    Zhang, Zhijie
    Li, Tongyang
    Zhang, Jialin
    Sun, Xiaoming
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 8, 2023, : 10087 - 10094
  • [28] Multi-agent Heterogeneous Stochastic Linear Bandits
    Ghosh, Avishek
    Sankararaman, Abishek
    Ramchandran, Kannan
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2022, PT IV, 2023, 13716 : 300 - 316
  • [29] Leveraging Initial Hints for Free in Stochastic Linear Bandits
    Cutkosky, Ashok
    Dann, Chris
    Das, Abhimanyu
    Zhang, Qiuyi
    INTERNATIONAL CONFERENCE ON ALGORITHMIC LEARNING THEORY, VOL 167, 2022, 167
  • [30] Robust Pure Exploration in Linear Bandits with Limited Budget
    Alieva, Ayya
    Cutkosky, Ashok
    Das, Abhimanyu
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139