共 50 条
- [32] Asymptotically Efficient Off-Policy Evaluation for Tabular Reinforcement Learning INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 108, 2020, 108
- [33] Double Reinforcement Learning for Efficient and Robust Off-Policy Evaluation INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
- [38] Deep Reinforcement Learning for Vision-Based Robotic Grasping: A Simulated Comparative Evaluation of Off-Policy Methods 2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2018, : 6284 - 6291
- [39] Exploratory Policy Generation Methods in On-line Deep Reinforcement Learning: A Survey Jiqiren/Robot, 2024, 46 (06): : 753 - 768
- [40] Two novel on-policy reinforcement learning algorithms based on TD(λ)-methods 2007 IEEE INTERNATIONAL SYMPOSIUM ON APPROXIMATE DYNAMIC PROGRAMMING AND REINFORCEMENT LEARNING, 2007, : 280 - +