Robust and private stochastic linear bandits

被引：0

作者：

Charisopoulos, Vasileios ^{[1
]}

Esfandiari, Hossein ^{[2
]}

Mirrokni, Vahab ^{[2
]}

机构：

[1] Cornell Univ, Operat Res Informat Engn, Ithaca, NY USA

[2] Google Res, Mountain View, CA 94043 USA

来源：

INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 202 | 2023年 / 202卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we study the stochastic linear bandit problem under the additional requirements of differential privacy, robustness and batched observations. In particular, we assume an adversary randomly chooses a constant fraction of the observed rewards in each batch, replacing them with arbitrary numbers. We present differentially private and robust variants of the arm elimination algorithm using logarithmic batch queries under two privacy models and provide regret bounds in both settings. In the first model, every reward in each round is reported by a potentially different client, which reduces to standard local differential privacy (LDP). In the second model, every action is "owned" by a different client, who may aggregate the rewards over multiple queries and privatize the aggregate response instead. To the best of our knowledge, our algorithms are the first simultaneously providing differential privacy and adversarial robustness in the stochastic linear bandits problem.

引用

页数：19

共 50 条

[21] Byzantine-Robust Federated Linear Bandits
Jadbabaie, Ali
Li, Haochuan
Qian, Jian
Tian, Yi
2022 IEEE 61ST CONFERENCE ON DECISION AND CONTROL (CDC), 2022, : 5206 - 5213
[22] Differentially Private Linear Bandits with Partial Distributed Feedback
Li, Fengjiao
Zhou, Xingyu
Ji, Bo
2022 20TH INTERNATIONAL SYMPOSIUM ON MODELING AND OPTIMIZATION IN MOBILE, AD HOC, AND WIRELESS NETWORKS (WIOPT 2022), 2022, : 41 - 48
[23] Stochastic Contextual Dueling Bandits under Linear Stochastic Transitivity Models
Bengs, Viktor
Saha, Aadirupa
Huellermeier, Eyke
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
[24] Learning in Generalized Linear Contextual Bandits with Stochastic Delays
Zhou, Zhengyuan
Xu, Renyuan
Blanchet, Jose
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
[25] Perturbed-History Exploration in Stochastic Linear Bandits
Kveton, Branislav
Szepesvari, Csaba
Ghavamzadeh, Mohammad
Boutilier, Craig
35TH UNCERTAINTY IN ARTIFICIAL INTELLIGENCE CONFERENCE (UAI 2019), 2020, 115 : 530 - 540
[26] Collaborative Multi-agent Stochastic Linear Bandits
Moradipari, Ahmadreza
Ghavamzadeh, Mohammad
Alizadeh, Mahnoosh
2022 AMERICAN CONTROL CONFERENCE, ACC, 2022, : 2761 - 2766
[27] Quantum Multi-Armed Bandits and Stochastic Linear Bandits Enjoy Logarithmic Regrets
Wan, Zongqi
Zhang, Zhijie
Li, Tongyang
Zhang, Jialin
Sun, Xiaoming
THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 8, 2023, : 10087 - 10094
[28] Multi-agent Heterogeneous Stochastic Linear Bandits
Ghosh, Avishek
Sankararaman, Abishek
Ramchandran, Kannan
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2022, PT IV, 2023, 13716 : 300 - 316
[29] Leveraging Initial Hints for Free in Stochastic Linear Bandits
Cutkosky, Ashok
Dann, Chris
Das, Abhimanyu
Zhang, Qiuyi
INTERNATIONAL CONFERENCE ON ALGORITHMIC LEARNING THEORY, VOL 167, 2022, 167
[30] Robust Pure Exploration in Linear Bandits with Limited Budget
Alieva, Ayya
Cutkosky, Ashok
Das, Abhimanyu
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139

← 1 2 3 4 5 →