共 50 条
- [41] Near-Optimal Regret for Adversarial MDP with Delayed Bandit Feedback ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
- [42] A Bayesian reinforcement learning approach in markov games for computing near-optimal policies Annals of Mathematics and Artificial Intelligence, 2023, 91 : 675 - 690
- [45] Near-Optimal Provable Uniform Convergence in Offine Policy Evaluation for Reinforcement Learning 24TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS (AISTATS), 2021, 130
- [46] Near-Optimal Vehicular Crowdsensing Task Allocation Empowered by Deep Reinforcement Learning Jisuanji Xuebao/Chinese Journal of Computers, 2022, 45 (05): : 918 - 934
- [47] Near-Optimal Sample Complexity Bounds for Constrained MDPs ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
- [48] Near-optimal lower bounds on the multi-party communication complexity of set disjointness 18TH IEEE ANNUAL CONFERENCE ON COMPUTATIONAL COMPLEXITY, PROCEEDINGS, 2003, : 107 - 117
- [49] Near-Optimal Complexity Bounds for Fragments of the Skolem Problem 37TH INTERNATIONAL SYMPOSIUM ON THEORETICAL ASPECTS OF COMPUTER SCIENCE (STACS 2020), 2020, 154
- [50] Safe Learning for Near-Optimal Scheduling QUANTITATIVE EVALUATION OF SYSTEMS (QEST 2021), 2021, 12846 : 235 - 254