Robust and private stochastic linear bandits

被引:0
|
作者
Charisopoulos, Vasileios [1 ]
Esfandiari, Hossein [2 ]
Mirrokni, Vahab [2 ]
机构
[1] Cornell Univ, Operat Res Informat Engn, Ithaca, NY USA
[2] Google Res, Mountain View, CA 94043 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we study the stochastic linear bandit problem under the additional requirements of differential privacy, robustness and batched observations. In particular, we assume an adversary randomly chooses a constant fraction of the observed rewards in each batch, replacing them with arbitrary numbers. We present differentially private and robust variants of the arm elimination algorithm using logarithmic batch queries under two privacy models and provide regret bounds in both settings. In the first model, every reward in each round is reported by a potentially different client, which reduces to standard local differential privacy (LDP). In the second model, every action is "owned" by a different client, who may aggregate the rewards over multiple queries and privatize the aggregate response instead. To the best of our knowledge, our algorithms are the first simultaneously providing differential privacy and adversarial robustness in the stochastic linear bandits problem.
引用
收藏
页数:19
相关论文
共 50 条
  • [1] Stochastic Linear Bandits Robust to Adversarial Attacks
    Bogunovic, Ilija
    Losalka, Arpan
    Krause, Andreas
    Scarlett, Jonathan
    24TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS (AISTATS), 2021, 130
  • [2] Differentially Private Stochastic Linear Bandits: (Almost) for Free
    Hanna O.
    Girgis A.M.
    Fragouli C.
    Diggavi S.
    IEEE Journal on Selected Areas in Information Theory, 2024, 5 : 135 - 147
  • [3] On Private and Robust Bandits
    Wu, Yulian
    Zhou, Xingyu
    Tao, Youming
    Wang, Di
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [4] Robust Stochastic Linear Contextual Bandits Under Adversarial Attacks
    Ding, Qin
    Hsieh, Cho-Jui
    Sharpnack, James
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, 2022, 151
  • [5] Safe Linear Stochastic Bandits
    Khezeli, Kia
    Bitar, Eilyan
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 10202 - 10209
  • [6] Stochastic Bandits with Linear Constraints
    Pacchiano, Aldo
    Ghavamzadeh, Mohammad
    Bartlett, Peter
    Jiang, Heinrich
    24TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS (AISTATS), 2021, 130
  • [7] Stochastic Bandits Robust to Adversarial Corruptions
    Lykouris, Thodoris
    Mirrokni, Vahab
    Leme, Renato Paes
    STOC'18: PROCEEDINGS OF THE 50TH ANNUAL ACM SIGACT SYMPOSIUM ON THEORY OF COMPUTING, 2018, : 114 - 122
  • [8] Shuffle Private Linear Contextual Bandits
    Chowdhury, Sayak Ray
    Zhou, Xingyu
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [9] Differentially Private Contextual Linear Bandits
    Shariff, Roshan
    Sheffet, Or
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [10] Stochastic Conservative Contextual Linear Bandits
    Lin, Jiabin
    Lee, Xian Yeow
    Jubery, Talukder
    Moothedath, Shana
    Sarkar, Soumik
    Ganapathysubramanian, Baskar
    2022 IEEE 61ST CONFERENCE ON DECISION AND CONTROL (CDC), 2022, : 7321 - 7326