Quantum Kernelized Bandits

被引:0
|
作者
Hikima, Yasunari [1 ]
Murao, Kazunori [1 ]
Takemori, Sho [1 ]
Umeda, Yuhei [1 ]
机构
[1] Fujitsu Ltd, AI Lab, Kawasaki, Kanagawa, Japan
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
We consider the quantum kernelized bandit problem, where the player observes information of rewards through quantum circuits termed the quantum reward oracle, and the mean reward function belongs to a reproducing kernel Hilbert space (RKHS). We propose a UCB-type algorithm that utilizes the quantum Monte Carlo (QMC) method and provide regret bounds in terms of the decay rate of eigenvalues of the Mercer operator of the kernel. Our algorithm achieves (O) over tilde (T3/1+beta p) log(1/delta) and (O)over tilde>(log(3(1+beta e-1)/2) (T) log(1/delta) cumulative regret bounds with probability at least 1 - delta if the kernel has a beta(p)-polynomial eigendecay and beta(e)-exponential eigendecay, respectively. In particular, in the case of the exponential eigendecay, our regret bounds exponentially improve that of classical algorithms. Moreover, our results indicate that our regret bound is better than the lower bound in the classical kernelized bandit problem if the rate of decay is sufficiently fast.
引用
收藏
页码:1640 / 1657
页数:18
相关论文
共 50 条
  • [1] Adversarial Contextual Bandits Go Kernelized
    Neu, Gergely
    Olkhovskaya, Julia
    Vakili, Sattar
    INTERNATIONAL CONFERENCE ON ALGORITHMIC LEARNING THEORY, VOL 237, 2024, 237
  • [2] On Kernelized Multi-armed Bandits
    Chowdhury, Sayak Ray
    Gopalan, Aditya
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
  • [3] (Private) Kernelized Bandits with Distributed Biased Feedback
    Li, Fengjiao
    Zhou, Xingyu
    Ji, Bo
    PROCEEDINGS OF THE ACM ON MEASUREMENT AND ANALYSIS OF COMPUTING SYSTEMS, 2023, 7 (01)
  • [4] On Kernelized Multi-Armed Bandits with Constraints
    Zhou, Xingyu
    Ji, Bo
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
  • [5] (Private) Kernelized Bandits with Distributed Biased Feedback
    Li F.
    Zhou X.
    Ji B.
    Performance Evaluation Review, 2023, 51 (01): : 61 - 62
  • [6] Communication Efficient Distributed Learning for Kernelized Contextual Bandits
    Li, Chuanhao
    Wang, Huazheng
    Wang, Mengdi
    Wang, Hongning
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
  • [7] Instance-dependent Regret Analysis of Kernelized Bandits
    Shekhar, Shubhanshu
    Javidi, Tara
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022, : 19747 - 19772
  • [8] Quantum bandits
    Balthazar Casalé
    Giuseppe Di Molfetta
    Hachem Kadri
    Liva Ralaivola
    Quantum Machine Intelligence, 2020, 2
  • [9] Quantum bandits
    Casale, Balthazar
    Di Molfetta, Giuseppe
    Kadri, Hachem
    Ralaivola, Liva
    QUANTUM MACHINE INTELLIGENCE, 2020, 2 (01)
  • [10] Quantum algorithm for kernelized correlation filter
    Shang Gao
    Shijie Pan
    Yuguang Yang
    Science China Information Sciences, 2023, 66