共 15 条
- [1] Best-of-three-worlds Analysis for Linear Bandits with Follow-the-regularized-leader Algorithm THIRTY SIXTH ANNUAL CONFERENCE ON LEARNING THEORY, VOL 195, 2023, 195 : 657 - 673
- [2] Improved Regret Analysis for Variance-Adaptive Linear Bandits and Horizon-Free Linear Mixture MDPs ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
- [3] Adversarially Robust Multi-Armed Bandit Algorithm with Variance-Dependent Regret Bounds CONFERENCE ON LEARNING THEORY, VOL 178, 2022, 178
- [4] Regret Bounds for Robust Adaptive Control of the Linear Quadratic Regulator ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
- [5] Regret Lower Bounds for Unbiased Adaptive Control of Linear Quadratic Regulators IEEE CONTROL SYSTEMS LETTERS, 2020, 4 (03): : 785 - 790
- [6] Regret lower bound and optimal algorithm for high-dimensional contextual linear bandit ELECTRONIC JOURNAL OF STATISTICS, 2021, 15 (02): : 5652 - 5695
- [7] SAdaBoundNc: an adaptive subgradient online learning algorithm with logarithmic regret bounds Neural Computing and Applications, 2023, 35 : 8051 - 8063
- [8] SAdaBoundNc: an adaptive subgradient online learning algorithm with logarithmic regret bounds NEURAL COMPUTING & APPLICATIONS, 2023, 35 (11): : 8051 - 8063
- [10] Variance-Dependent Regret Bounds for Linear Bandits and Reinforcement Learning: Adaptivity and Computational Efficiency THIRTY SIXTH ANNUAL CONFERENCE ON LEARNING THEORY, VOL 195, 2023, 195