共 50 条
- [41] EGFA-NAS: a neural architecture search method based on explosion gravitation field algorithm Complex & Intelligent Systems, 2024, 10 : 1667 - 1687
- [44] Regret Bounds for Thompson Sampling in Episodic Restless Bandit Problems ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
- [45] Scalable Discrete Sampling as a Multi-Armed Bandit Problem INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 48, 2016, 48
- [46] A Thompson Sampling Approach to Unifying Causal Inference and Bandit Learning ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2023, PT II, 2023, 13936 : 255 - 266
- [47] Bandit Algorithms Based on Thompson Sampling for Bounded Reward Distributions ALGORITHMIC LEARNING THEORY, VOL 117, 2020, 117 : 777 - 826
- [48] The Choice of Noninformative Priors for Thompson Sampling in Multiparameter Bandit Models THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 12, 2024, : 13383 - 13390
- [50] A PAC algorithm in relative precision for bandit problem with costly sampling Mathematical Methods of Operations Research, 2022, 96 : 161 - 185