Robust and private stochastic linear bandits

被引：0

作者：

Charisopoulos, Vasileios ^{[1
]}

Esfandiari, Hossein ^{[2
]}

Mirrokni, Vahab ^{[2
]}

机构：

[1] Cornell Univ, Operat Res Informat Engn, Ithaca, NY USA

[2] Google Res, Mountain View, CA 94043 USA

来源：

INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 202 | 2023年 / 202卷

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In this paper, we study the stochastic linear bandit problem under the additional requirements of differential privacy, robustness and batched observations. In particular, we assume an adversary randomly chooses a constant fraction of the observed rewards in each batch, replacing them with arbitrary numbers. We present differentially private and robust variants of the arm elimination algorithm using logarithmic batch queries under two privacy models and provide regret bounds in both settings. In the first model, every reward in each round is reported by a potentially different client, which reduces to standard local differential privacy (LDP). In the second model, every action is "owned" by a different client, who may aggregate the rewards over multiple queries and privatize the aggregate response instead. To the best of our knowledge, our algorithms are the first simultaneously providing differential privacy and adversarial robustness in the stochastic linear bandits problem.

引用

页数：19

共 50 条

[31] Robust Heavy-Tailed Linear Bandits Algorithm
Ma L.
Zhao P.
Zhou Z.
Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2023, 60 (06): : 1385 - 1395
[32] Robust Stochastic Multi-Armed Bandits with Historical Data
Yacobi, Sarah Boufelja
Bounefouf, Djallel
COMPANION OF THE WORLD WIDE WEB CONFERENCE, WWW 2023, 2023, : 959 - 965
[33] (Nearly) Optimal Differentially Private Stochastic Multi-Arm Bandits
Mishra, Nikita
Thakurta, Abhradeep
UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2015, : 592 - 601
[34] Randomized Exploration for Non-Stationary Stochastic Linear Bandits
Kim, Baekjin
Tewari, Ambuj
CONFERENCE ON UNCERTAINTY IN ARTIFICIAL INTELLIGENCE (UAI 2020), 2020, 124 : 71 - 80
[35] Linear Stochastic Bandits over a Bit-Constrained Channel
Mitra, Aritra
Hassani, Hamed
Pappas, George J.
LEARNING FOR DYNAMICS AND CONTROL CONFERENCE, VOL 211, 2023, 211
[36] Multi-task Representation Learning with Stochastic Linear Bandits
Cella, Leonardo
Lounici, Karim
Pacreau, Gregoire
Pontil, Massimiliano
INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 206, 2023, 206
[37] Efficient and Robust High-Dimensional Linear Contextual Bandits
Chen, Cheng
Luo, Luo
Zhang, Weinan
Yu, Yong
Lian, Yijiang
PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 4259 - 4265
[38] Robust Risk-Averse Stochastic Multi-armed Bandits
Maillard, Odalric-Ambrym
ALGORITHMIC LEARNING THEORY (ALT 2013), 2013, 8139 : 218 - 233
[39] Reward-Biased Maximum Likelihood Estimation for Linear Stochastic Bandits
Hung, Yu-Heng
Hsieh, Ping-Chun
Liu, Xi
Kumar, P. R.
THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 7874 - 7882
[40] Hierarchize Pareto Dominance in Multi-Objective Stochastic Linear Bandits
Cheng, Ji
Xue, Bo
Yi, Jiaxiang
Zhang, Qingfu
THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 10, 2024, : 11489 - 11497

← 1 2 3 4 5 →