共 50 条
- [21] Society of Agents: Regret Bounds of Concurrent Thompson Sampling ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
- [22] Feedback graph regret bounds for Thompson Sampling and UCB ALGORITHMIC LEARNING THEORY, VOL 117, 2020, 117 : 592 - 614
- [23] Regret Bounds for Expected Improvement Algorithms in Gaussian Process Bandit Optimization INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, 2022, 151
- [24] Bandit Algorithms Based on Thompson Sampling for Bounded Reward Distributions ALGORITHMIC LEARNING THEORY, VOL 117, 2020, 117 : 777 - 826
- [25] The Choice of Noninformative Priors for Thompson Sampling in Multiparameter Bandit Models THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 12, 2024, : 13383 - 13390
- [26] A Thompson Sampling Approach to Unifying Causal Inference and Bandit Learning ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2023, PT II, 2023, 13936 : 255 - 266
- [28] Linear Thompson Sampling Revisited ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 54, 2017, 54 : 176 - 184
- [30] Linear Thompson sampling revisited ELECTRONIC JOURNAL OF STATISTICS, 2017, 11 (02): : 5165 - 5197