Robust and private stochastic linear bandits

被引:0
|
作者
Charisopoulos, Vasileios [1 ]
Esfandiari, Hossein [2 ]
Mirrokni, Vahab [2 ]
机构
[1] Cornell Univ, Operat Res Informat Engn, Ithaca, NY USA
[2] Google Res, Mountain View, CA 94043 USA
来源
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 202 | 2023年 / 202卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we study the stochastic linear bandit problem under the additional requirements of differential privacy, robustness and batched observations. In particular, we assume an adversary randomly chooses a constant fraction of the observed rewards in each batch, replacing them with arbitrary numbers. We present differentially private and robust variants of the arm elimination algorithm using logarithmic batch queries under two privacy models and provide regret bounds in both settings. In the first model, every reward in each round is reported by a potentially different client, which reduces to standard local differential privacy (LDP). In the second model, every action is "owned" by a different client, who may aggregate the rewards over multiple queries and privatize the aggregate response instead. To the best of our knowledge, our algorithms are the first simultaneously providing differential privacy and adversarial robustness in the stochastic linear bandits problem.
引用
收藏
页数:19
相关论文
共 50 条
  • [31] Robust Heavy-Tailed Linear Bandits Algorithm
    Ma L.
    Zhao P.
    Zhou Z.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2023, 60 (06): : 1385 - 1395
  • [32] Robust Stochastic Multi-Armed Bandits with Historical Data
    Yacobi, Sarah Boufelja
    Bounefouf, Djallel
    COMPANION OF THE WORLD WIDE WEB CONFERENCE, WWW 2023, 2023, : 959 - 965
  • [33] (Nearly) Optimal Differentially Private Stochastic Multi-Arm Bandits
    Mishra, Nikita
    Thakurta, Abhradeep
    UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2015, : 592 - 601
  • [34] Randomized Exploration for Non-Stationary Stochastic Linear Bandits
    Kim, Baekjin
    Tewari, Ambuj
    CONFERENCE ON UNCERTAINTY IN ARTIFICIAL INTELLIGENCE (UAI 2020), 2020, 124 : 71 - 80
  • [35] Linear Stochastic Bandits over a Bit-Constrained Channel
    Mitra, Aritra
    Hassani, Hamed
    Pappas, George J.
    LEARNING FOR DYNAMICS AND CONTROL CONFERENCE, VOL 211, 2023, 211
  • [36] Multi-task Representation Learning with Stochastic Linear Bandits
    Cella, Leonardo
    Lounici, Karim
    Pacreau, Gregoire
    Pontil, Massimiliano
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 206, 2023, 206
  • [37] Efficient and Robust High-Dimensional Linear Contextual Bandits
    Chen, Cheng
    Luo, Luo
    Zhang, Weinan
    Yu, Yong
    Lian, Yijiang
    PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 4259 - 4265
  • [38] Robust Risk-Averse Stochastic Multi-armed Bandits
    Maillard, Odalric-Ambrym
    ALGORITHMIC LEARNING THEORY (ALT 2013), 2013, 8139 : 218 - 233
  • [39] Reward-Biased Maximum Likelihood Estimation for Linear Stochastic Bandits
    Hung, Yu-Heng
    Hsieh, Ping-Chun
    Liu, Xi
    Kumar, P. R.
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 7874 - 7882
  • [40] Hierarchize Pareto Dominance in Multi-Objective Stochastic Linear Bandits
    Cheng, Ji
    Xue, Bo
    Yi, Jiaxiang
    Zhang, Qingfu
    THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 10, 2024, : 11489 - 11497