共 50 条
- [33] Asymptotic Performance of Thompson Sampling in the Batched Multi-Armed Bandits 2021 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY (ISIT), 2021, : 539 - 544
- [34] Thompson Sampling for High-Dimensional Sparse Linear Contextual Bandits INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 202, 2023, 202
- [35] Solving Bernoulli Rank-One Bandits with Unimodal Thompson Sampling ALGORITHMIC LEARNING THEORY, VOL 117, 2020, 117 : 862 - 889
- [36] A Unifying Theory of Thompson Sampling for Continuous Risk-Averse Bandits THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 6159 - 6166
- [37] DOUBLE-LINEAR THOMPSON SAMPLING FOR CONTEXT-ATTENTIVE BANDITS 2021 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP 2021), 2021, : 3450 - 3454
- [38] Double Doubly Robust Thompson Sampling for Generalized Linear Contextual Bandits THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 7, 2023, : 8300 - 8307
- [39] PG-TS: Improved Thompson Sampling for Logistic Contextual Bandits ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
- [40] Feel-Good Thompson Sampling for Contextual Bandits and Reinforcement Learning SIAM JOURNAL ON MATHEMATICS OF DATA SCIENCE, 2022, 4 (02): : 834 - 857