Robust and private stochastic linear bandits

被引：0

作者：

Charisopoulos, Vasileios ^{[1
]}

Esfandiari, Hossein ^{[2
]}

Mirrokni, Vahab ^{[2
]}

机构：

[1] Cornell Univ, Operat Res Informat Engn, Ithaca, NY USA

[2] Google Res, Mountain View, CA 94043 USA

来源：

INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 202 | 2023年 / 202卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we study the stochastic linear bandit problem under the additional requirements of differential privacy, robustness and batched observations. In particular, we assume an adversary randomly chooses a constant fraction of the observed rewards in each batch, replacing them with arbitrary numbers. We present differentially private and robust variants of the arm elimination algorithm using logarithmic batch queries under two privacy models and provide regret bounds in both settings. In the first model, every reward in each round is reported by a potentially different client, which reduces to standard local differential privacy (LDP). In the second model, every action is "owned" by a different client, who may aggregate the rewards over multiple queries and privatize the aggregate response instead. To the best of our knowledge, our algorithms are the first simultaneously providing differential privacy and adversarial robustness in the stochastic linear bandits problem.

引用

页数：19

共 50 条

[1] Stochastic Linear Bandits Robust to Adversarial Attacks
Bogunovic, Ilija
Losalka, Arpan
Krause, Andreas
Scarlett, Jonathan
24TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS (AISTATS), 2021, 130
[2] Differentially Private Stochastic Linear Bandits: (Almost) for Free
Hanna O.
Girgis A.M.
Fragouli C.
Diggavi S.
IEEE Journal on Selected Areas in Information Theory, 2024, 5 : 135 - 147
[3] On Private and Robust Bandits
Wu, Yulian
Zhou, Xingyu
Tao, Youming
Wang, Di
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[4] Robust Stochastic Linear Contextual Bandits Under Adversarial Attacks
Ding, Qin
Hsieh, Cho-Jui
Sharpnack, James
INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, 2022, 151
[5] Safe Linear Stochastic Bandits
Khezeli, Kia
Bitar, Eilyan
THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 10202 - 10209
[6] Stochastic Bandits with Linear Constraints
Pacchiano, Aldo
Ghavamzadeh, Mohammad
Bartlett, Peter
Jiang, Heinrich
24TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS (AISTATS), 2021, 130
[7] Stochastic Bandits Robust to Adversarial Corruptions
Lykouris, Thodoris
Mirrokni, Vahab
Leme, Renato Paes
STOC'18: PROCEEDINGS OF THE 50TH ANNUAL ACM SIGACT SYMPOSIUM ON THEORY OF COMPUTING, 2018, : 114 - 122
[8] Shuffle Private Linear Contextual Bandits
Chowdhury, Sayak Ray
Zhou, Xingyu
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
[9] Differentially Private Contextual Linear Bandits
Shariff, Roshan
Sheffet, Or
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
[10] Stochastic Conservative Contextual Linear Bandits
Lin, Jiabin
Lee, Xian Yeow
Jubery, Talukder
Moothedath, Shana
Sarkar, Soumik
Ganapathysubramanian, Baskar
2022 IEEE 61ST CONFERENCE ON DECISION AND CONTROL (CDC), 2022, : 7321 - 7326

← 1 2 3 4 5 →