共 50 条
- [42] Preference-based reinforcement learning: evolutionary direct policy search using a preference-based racing algorithm Machine Learning, 2014, 97 : 327 - 351
- [44] Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
- [46] Sample-Efficient Reinforcement Learning via Conservative Model-Based Actor-Critic THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 8612 - 8620
- [48] Sample-Efficient Reinforcement Learning for Linearly-Parameterized MDPs with a Generative Model ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
- [49] On Sample-Efficient Offline Reinforcement Learning: Data Diversity, Posterior Sampling, and Beyond ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [50] Sample-Efficient Reinforcement Learning Is Feasible for Linearly Realizable MDPs with Limited Revisiting ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34