共 50 条
- [21] Reinforcement Learning with Unbiased Policy Evaluation and Linear Function Approximation 2022 IEEE 61ST CONFERENCE ON DECISION AND CONTROL (CDC), 2022, : 801 - 806
- [22] Autonomous helicopter control using reinforcement learning policy search methods 2001 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS I-IV, PROCEEDINGS, 2001, : 1615 - 1620
- [23] Comparison of Different Domain Randomization Methods for Policy Transfer in Reinforcement Learning 2023 IEEE 12TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE, DDCLS, 2023, : 1818 - 1823
- [25] Robust On-Policy Sampling for Data-Efficient Policy Evaluation in Reinforcement Learning ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
- [27] Off-policy evaluation for tabular reinforcement learning with synthetic trajectories Statistics and Computing, 2024, 34
- [28] Policy Learning with Human Reinforcement International Journal of Fuzzy Systems, 2016, 18 : 618 - 629
- [30] Doubly Robust Off-policy Value Evaluation for Reinforcement Learning INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 48, 2016, 48