共 50 条
- [41] Optimality of Thompson Sampling for Gaussian Bandits Depends on Priors ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 33, 2014, 33 : 375 - 383
- [42] Thompson Sampling Based Mechanisms for Stochastic Multi-Armed Bandit Problems AAMAS'17: PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2017, : 87 - 95
- [43] Bandit Theory and Thompson Sampling-Guided Directed Evolution for Sequence Optimization ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
- [44] Prior-free and prior-dependent regret bounds for Thompson Sampling 2014 48TH ANNUAL CONFERENCE ON INFORMATION SCIENCES AND SYSTEMS (CISS), 2014,
- [46] Doubly Robust Thompson Sampling with Linear Payoffs ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
- [47] Control of Unknown Linear Systems with Thompson Sampling 2017 55TH ANNUAL ALLERTON CONFERENCE ON COMMUNICATION, CONTROL, AND COMPUTING (ALLERTON), 2017, : 1198 - 1205
- [48] Thompson Sampling Based Multi-Armed-Bandit Mechanism Using Neural Networks AAMAS '19: PROCEEDINGS OF THE 18TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2019, : 2111 - 2113
- [49] Analysis of Thompson Sampling for Combinatorial Multi-armed Bandit with Probabilistically Triggered Arms 22ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 89, 2019, 89
- [50] Copeland Dueling Bandit Problem: Regret Lower Bound, Optimal Algorithm, and Computationally Efficient Algorithm INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 48, 2016, 48