Robust and private stochastic linear bandits

被引:0
|
作者
Charisopoulos, Vasileios [1 ]
Esfandiari, Hossein [2 ]
Mirrokni, Vahab [2 ]
机构
[1] Cornell Univ, Operat Res Informat Engn, Ithaca, NY USA
[2] Google Res, Mountain View, CA 94043 USA
来源
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 202 | 2023年 / 202卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we study the stochastic linear bandit problem under the additional requirements of differential privacy, robustness and batched observations. In particular, we assume an adversary randomly chooses a constant fraction of the observed rewards in each batch, replacing them with arbitrary numbers. We present differentially private and robust variants of the arm elimination algorithm using logarithmic batch queries under two privacy models and provide regret bounds in both settings. In the first model, every reward in each round is reported by a potentially different client, which reduces to standard local differential privacy (LDP). In the second model, every action is "owned" by a different client, who may aggregate the rewards over multiple queries and privatize the aggregate response instead. To the best of our knowledge, our algorithms are the first simultaneously providing differential privacy and adversarial robustness in the stochastic linear bandits problem.
引用
收藏
页数:19
相关论文
共 50 条
  • [41] Problem-Complexity Adaptive Model Selection for Stochastic Linear Bandits
    Ghosh, Avishek
    Sankararaman, Abishek
    Ramchandran, Kannan
    24TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS (AISTATS), 2021, 130
  • [42] Double Doubly Robust Thompson Sampling for Generalized Linear Contextual Bandits
    Kim, Wonyoung
    Lee, Kyungbok
    Paik, Myunghee Cho
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 7, 2023, : 8300 - 8307
  • [43] Stochastic Rising Bandits
    Metelli, Alberto Maria
    Trovo, Francesco
    Pirola, Matteo
    Restelli, Marcello
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [44] Stochastic Top-K Subset Bandits with Linear Space and Non-Linear Feedback
    Agarwal, Mridul
    Aggarwal, Vaneet
    Quinn, Christopher J.
    Umrawal, Abhishek K.
    ALGORITHMIC LEARNING THEORY, VOL 132, 2021, 132
  • [45] Stochastic Rank-1 Bandits Stochastic Rank-1 Bandits
    Katariya, Sumeet
    Kveton, Branislav
    Szepesvari, Csaba
    Vernade, Claire
    Wen, Zheng
    ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 54, 2017, 54 : 392 - 401
  • [46] Almost Optimal Algorithms for Linear Stochastic Bandits with Heavy-Tailed Payoffs
    Shao, Han
    Yu, Xiaotian
    King, Irwin
    Lyu, Michael R.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
  • [47] Improved Algorithms for Stochastic Linear Bandits Using Tail Bounds for Martingale Mixtures
    Flynn, Hamish
    Reeb, David
    Kandemir, Melih
    Peters, Jan
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [48] Contexts can be Cheap: Solving Stochastic Contextual Bandits with Linear Bandit Algorithms
    Hanna, Osama A.
    Yang, Lin F.
    Fragouli, Christina
    THIRTY SIXTH ANNUAL CONFERENCE ON LEARNING THEORY, VOL 195, 2023, 195
  • [49] An Efficient Pessimistic-Optimistic Algorithm for Stochastic Linear Bandits with General Constraints
    Liu, Xin
    Li, Bin
    Shi, Pengyi
    Ying, Lei
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [50] Nearly Optimal Regret for Stochastic Linear Bandits with Heavy-Tailed Payoffs
    Xue, Bo
    Wang, Guanghui
    Wang, Yimu
    Zhang, Lijun
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 2936 - 2942