共 50 条
- [31] First-Order Bayesian Regret Analysis of Thompson Sampling ALGORITHMIC LEARNING THEORY, VOL 117, 2020, 117 : 196 - 233
- [33] Globally Informative Thompson Sampling for Structured Bandit Problems with Application to CrowdTranscoding 3RD INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE IN INFORMATION AND COMMUNICATION (IEEE ICAIIC 2021), 2021, : 210 - 215
- [34] Online (Multinomial) Logistic Bandit: Improved Regret and Constant Computation Cost ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [35] Improved Regret Bounds for Online Kernel Selection Under Bandit Feedback MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, ECML PKDD 2022, PT IV, 2023, 13716 : 333 - 348
- [36] Improved Regret Bounds for Projection-free Bandit Convex Optimization INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108 : 2196 - 2205
- [37] Near-Optimal Regret in Linear MDPs with Aggregate Bandit Feedback INTERNATIONAL CONFERENCE ON MACHINE LEARNING, 2024, 235
- [38] Multi-Agent Thompson Sampling for Bandit Applications with Sparse Neighbourhood Structures Scientific Reports, 10
- [39] LINEAR THOMPSON SAMPLING UNDER UNKNOWN LINEAR CONSTRAINTS 2020 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, 2020, : 3392 - 3396