共 50 条
- [31] DOUBLE-LINEAR THOMPSON SAMPLING FOR CONTEXT-ATTENTIVE BANDITS 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 3450 - 3454
- [32] Feel-Good Thompson Sampling for Contextual Bandits and Reinforcement Learning SIAM JOURNAL ON MATHEMATICS OF DATA SCIENCE, 2022, 4 (02): : 834 - 857
- [33] The Hardness Analysis of Thompson Sampling for Combinatorial Semi-bandits with Greedy Oracle ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021,
- [34] Analysis of Thompson Sampling for Partially Observable Contextual Multi-Armed Bandits IEEE CONTROL SYSTEMS LETTERS, 2022, 6 : 2150 - 2155
- [36] eLifting the Information Ratio: An Information-Theoretic Analysis of Thompson Sampling for Contextual Bandits ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
- [38] Near-Optimal Thompson Sampling-based Algorithms for Differentially Private Stochastic Bandits UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, VOL 180, 2022, 180 : 844 - +
- [39] Finite-Time Regret of Thompson Sampling Algorithms for Exponential Family Multi-Armed Bandits ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,