Quantum Kernelized Bandits

被引：0

作者：

Hikima, Yasunari ^{[1
]}

Murao, Kazunori ^{[1
]}

Takemori, Sho ^{[1
]}

Umeda, Yuhei ^{[1
]}

机构：

[1] Fujitsu Ltd, AI Lab, Kawasaki, Kanagawa, Japan

来源：

UNCERTAINTY IN ARTIFICIAL INTELLIGENCE | 2024年 / 244卷

关键词：

D O I：

暂无

中图分类号：

学科分类号：

摘要：

We consider the quantum kernelized bandit problem, where the player observes information of rewards through quantum circuits termed the quantum reward oracle, and the mean reward function belongs to a reproducing kernel Hilbert space (RKHS). We propose a UCB-type algorithm that utilizes the quantum Monte Carlo (QMC) method and provide regret bounds in terms of the decay rate of eigenvalues of the Mercer operator of the kernel. Our algorithm achieves (O) over tilde (T3/1+beta p) log(1/delta) and (O)over tilde>(log(3(1+beta e-1)/2) (T) log(1/delta) cumulative regret bounds with probability at least 1 - delta if the kernel has a beta(p)-polynomial eigendecay and beta(e)-exponential eigendecay, respectively. In particular, in the case of the exponential eigendecay, our regret bounds exponentially improve that of classical algorithms. Moreover, our results indicate that our regret bound is better than the lower bound in the classical kernelized bandit problem if the rate of decay is sufficiently fast.

引用

页码：1640 / 1657

页数：18

共 50 条

[1] Adversarial Contextual Bandits Go Kernelized
Neu, Gergely
Olkhovskaya, Julia
Vakili, Sattar
INTERNATIONAL CONFERENCE ON ALGORITHMIC LEARNING THEORY, VOL 237, 2024, 237
[2] On Kernelized Multi-armed Bandits
Chowdhury, Sayak Ray
Gopalan, Aditya
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
[3] (Private) Kernelized Bandits with Distributed Biased Feedback
Li, Fengjiao
Zhou, Xingyu
Ji, Bo
PROCEEDINGS OF THE ACM ON MEASUREMENT AND ANALYSIS OF COMPUTING SYSTEMS, 2023, 7 (01)
[4] On Kernelized Multi-Armed Bandits with Constraints
Zhou, Xingyu
Ji, Bo
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
[5] (Private) Kernelized Bandits with Distributed Biased Feedback
Li F.
Zhou X.
Ji B.
Performance Evaluation Review, 2023, 51 (01): : 61 - 62
[6] Communication Efficient Distributed Learning for Kernelized Contextual Bandits
Li, Chuanhao
Wang, Huazheng
Wang, Mengdi
Wang, Hongning
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
[7] Instance-dependent Regret Analysis of Kernelized Bandits
Shekhar, Shubhanshu
Javidi, Tara
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022, : 19747 - 19772
[8] Quantum bandits
Balthazar Casalé
Giuseppe Di Molfetta
Hachem Kadri
Liva Ralaivola
Quantum Machine Intelligence, 2020, 2
[9] Quantum bandits
Casale, Balthazar
Di Molfetta, Giuseppe
Kadri, Hachem
Ralaivola, Liva
QUANTUM MACHINE INTELLIGENCE, 2020, 2 (01)
[10] Quantum algorithm for kernelized correlation filter
Shang Gao
Shijie Pan
Yuguang Yang
Science China Information Sciences, 2023, 66

← 1 2 3 4 5 →