Robust and private stochastic linear bandits

被引：0

作者：

Charisopoulos, Vasileios ^{[1
]}

Esfandiari, Hossein ^{[2
]}

Mirrokni, Vahab ^{[2
]}

机构：

[1] Cornell Univ, Operat Res Informat Engn, Ithaca, NY USA

[2] Google Res, Mountain View, CA 94043 USA

来源：

INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 202 | 2023年 / 202卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we study the stochastic linear bandit problem under the additional requirements of differential privacy, robustness and batched observations. In particular, we assume an adversary randomly chooses a constant fraction of the observed rewards in each batch, replacing them with arbitrary numbers. We present differentially private and robust variants of the arm elimination algorithm using logarithmic batch queries under two privacy models and provide regret bounds in both settings. In the first model, every reward in each round is reported by a potentially different client, which reduces to standard local differential privacy (LDP). In the second model, every action is "owned" by a different client, who may aggregate the rewards over multiple queries and privatize the aggregate response instead. To the best of our knowledge, our algorithms are the first simultaneously providing differential privacy and adversarial robustness in the stochastic linear bandits problem.

引用

页数：19

共 50 条

[41] Problem-Complexity Adaptive Model Selection for Stochastic Linear Bandits
Ghosh, Avishek
Sankararaman, Abishek
Ramchandran, Kannan
24TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS (AISTATS), 2021, 130
[42] Double Doubly Robust Thompson Sampling for Generalized Linear Contextual Bandits
Kim, Wonyoung
Lee, Kyungbok
Paik, Myunghee Cho
THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 7, 2023, : 8300 - 8307
[43] Stochastic Rising Bandits
Metelli, Alberto Maria
Trovo, Francesco
Pirola, Matteo
Restelli, Marcello
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
[44] Stochastic Top-K Subset Bandits with Linear Space and Non-Linear Feedback
Agarwal, Mridul
Aggarwal, Vaneet
Quinn, Christopher J.
Umrawal, Abhishek K.
ALGORITHMIC LEARNING THEORY, VOL 132, 2021, 132
[45] Stochastic Rank-1 Bandits Stochastic Rank-1 Bandits
Katariya, Sumeet
Kveton, Branislav
Szepesvari, Csaba
Vernade, Claire
Wen, Zheng
ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 54, 2017, 54 : 392 - 401
[46] Almost Optimal Algorithms for Linear Stochastic Bandits with Heavy-Tailed Payoffs
Shao, Han
Yu, Xiaotian
King, Irwin
Lyu, Michael R.
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
[47] Improved Algorithms for Stochastic Linear Bandits Using Tail Bounds for Martingale Mixtures
Flynn, Hamish
Reeb, David
Kandemir, Melih
Peters, Jan
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[48] Contexts can be Cheap: Solving Stochastic Contextual Bandits with Linear Bandit Algorithms
Hanna, Osama A.
Yang, Lin F.
Fragouli, Christina
THIRTY SIXTH ANNUAL CONFERENCE ON LEARNING THEORY, VOL 195, 2023, 195
[49] An Efficient Pessimistic-Optimistic Algorithm for Stochastic Linear Bandits with General Constraints
Liu, Xin
Li, Bin
Shi, Pengyi
Ying, Lei
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
[50] Nearly Optimal Regret for Stochastic Linear Bandits with Heavy-Tailed Payoffs
Xue, Bo
Wang, Guanghui
Wang, Yimu
Zhang, Lijun
PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 2936 - 2942

← 1 2 3 4 5 →