共 50 条
- [31] Feel-Good Thompson Sampling for Contextual Bandits and Reinforcement Learning SIAM JOURNAL ON MATHEMATICS OF DATA SCIENCE, 2022, 4 (02): : 834 - 857
- [32] Stacked Thompson Bandits 2017 IEEE/ACM 3RD INTERNATIONAL WORKSHOP ON SOFTWARE ENGINEERING FOR SMART CYBER-PHYSICAL SYSTEMS (SESCPS 2017), 2017, : 18 - 21
- [33] The Hardness Analysis of Thompson Sampling for Combinatorial Semi-bandits with Greedy Oracle ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021,
- [34] Analysis of Thompson Sampling for Partially Observable Contextual Multi-Armed Bandits IEEE CONTROL SYSTEMS LETTERS, 2022, 6 : 2150 - 2155
- [36] eLifting the Information Ratio: An Information-Theoretic Analysis of Thompson Sampling for Contextual Bandits ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
- [38] Near-Optimal Thompson Sampling-based Algorithms for Differentially Private Stochastic Bandits UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, VOL 180, 2022, 180 : 844 - +
- [39] Finite-Time Regret of Thompson Sampling Algorithms for Exponential Family Multi-Armed Bandits ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
- [40] Kolmogorov-Smirnov Test-Based Actively-Adaptive Thompson Sampling for Non-Stationary Bandits IEEE Transactions on Artificial Intelligence, 2022, 3 (01): : 11 - 19