共 50 条
- [21] Polynomial-time reinforcement learning of near-optimal policies EIGHTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-02)/FOURTEENTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE (IAAI-02), PROCEEDINGS, 2002, : 205 - 210
- [22] Selecting Near-Optimal Approximate State Representations in Reinforcement Learning Algorithmic Learning Theory (ALT 2014), 2014, 8776 : 140 - 154
- [24] Near-Optimal No-Regret Learning for Correlated Equilibria in Multi-player General-Sum Games PROCEEDINGS OF THE 54TH ANNUAL ACM SIGACT SYMPOSIUM ON THEORY OF COMPUTING (STOC '22), 2022, : 736 - 749
- [25] Regret Bounds for Learning State Representations in Reinforcement Learning ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
- [26] Near-Optimal Design of Experiments via Regret Minimization INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
- [27] Variational Bayesian Reinforcement Learning with Regret Bounds ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021,
- [28] Near-Optimal Bounds for Learning Gaussian Halfspaces with Random Classification Noise ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [29] Non-stationary Risk-Sensitive Reinforcement Learning: Near-Optimal Dynamic Regret, Adaptive Detection, and Separation Design THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 6, 2023, : 7405 - 7413
- [30] Near-Optimal Offline Reinforcement Learning via Double Variance Reduction ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34