共 50 条
- [3] Learning state importance for preference-based reinforcement learning Machine Learning, 2024, 113 : 1885 - 1901
- [4] Preference-Based Policy Iteration: Leveraging Preference Learning for Reinforcement Learning MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PT I, 2011, 6911 : 312 - 327
- [5] Model-Free Preference-Based Reinforcement Learning THIRTIETH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, : 2222 - 2228
- [6] Dueling Posterior Sampling for Preference-Based Reinforcement Learning CONFERENCE ON UNCERTAINTY IN ARTIFICIAL INTELLIGENCE (UAI 2020), 2020, 124 : 1029 - 1038
- [7] Preference-based reinforcement learning: evolutionary direct policy search using a preference-based racing algorithm Machine Learning, 2014, 97 : 327 - 351
- [10] Efficient Meta Reinforcement Learning for Preference-based Fast Adaptation ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,