共 50 条
- [2] Regret Bounds for Thompson Sampling in Episodic Restless Bandit Problems ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
- [4] Thompson Sampling Based Mechanisms for Stochastic Multi-Armed Bandit Problems AAMAS'17: PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2017, : 87 - 95
- [5] Contextual Bandit for Active Learning: Active Thompson Sampling NEURAL INFORMATION PROCESSING (ICONIP 2014), PT I, 2014, 8834 : 405 - 412
- [6] Bandit Algorithms Based on Thompson Sampling for Bounded Reward Distributions ALGORITHMIC LEARNING THEORY, VOL 117, 2020, 117 : 777 - 826
- [7] The Choice of Noninformative Priors for Thompson Sampling in Multiparameter Bandit Models THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 12, 2024, : 13383 - 13390
- [8] A Thompson Sampling Approach to Unifying Causal Inference and Bandit Learning ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2023, PT II, 2023, 13936 : 255 - 266
- [10] An Improved Regret Bound for Thompson Sampling in the Gaussian Linear Bandit Setting 2020 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY (ISIT), 2020, : 2783 - 2788