共 50 条
- [2] Regret Bounds for Thompson Sampling in Episodic Restless Bandit Problems ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
- [3] Improved Regret Bounds for Thompson Sampling in Linear Quadratic Control Problems INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 80, 2018, 80
- [5] Improved Bayesian Regret Bounds for Thompson Sampling in Reinforcement Learning ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [7] Optimal Regret Analysis of Thompson Sampling in Stochastic Multi-armed Bandit Problem with Multiple Plays INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 37, 2015, 37 : 1152 - 1161
- [8] Regret lower bound and optimal algorithm for high-dimensional contextual linear bandit ELECTRONIC JOURNAL OF STATISTICS, 2021, 15 (02): : 5652 - 5695
- [10] Regret Bounds for Safe Gaussian Process Bandit Optimization 2021 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY (ISIT), 2021, : 527 - 532