共 50 条
- [41] Distributionally Robust Model-based Reinforcement Learning with Large State Spaces INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 238, 2024, 238
- [42] A Lower Bound for the Sample Complexity of Inverse Reinforcement Learning INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
- [43] SETTLING THE SAMPLE COMPLEXITY OF MODEL-BASED OFFLINE REINFORCEMENT LEARNING ANNALS OF STATISTICS, 2024, 52 (01): : 233 - 260
- [44] Sample strategy based on TD-error for offline reinforcement learning Gongcheng Kexue Xuebao/Chinese Journal of Engineering, 2023, 45 (12): : 2118 - 2128
- [46] Corruption-Robust Offline Reinforcement Learning with General Function Approximation ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [47] Finite-sample Guarantees for Nash Q-learning with Linear Function Approximation UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2023, 216 : 424 - 432
- [48] Pessimistic Q-Learning for Offline Reinforcement Learning: Towards Optimal Sample Complexity INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,