共 50 条
- [32] Off-Policy Reinforcement Learning with Delayed Rewards INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
- [33] Representations for Stable Off-Policy Reinforcement Learning INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
- [35] On the Reuse Bias in Off-Policy Reinforcement Learning PROCEEDINGS OF THE THIRTY-SECOND INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2023, 2023, : 4513 - 4521
- [36] A perspective on off-policy evaluation in reinforcement learning Frontiers of Computer Science, 2019, 13 : 911 - 912
- [39] Sequential Search with Off-Policy Reinforcement Learning PROCEEDINGS OF THE 30TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, CIKM 2021, 2021, : 4006 - 4015
- [40] Representations for Stable Off-Policy Reinforcement Learning 25TH AMERICAS CONFERENCE ON INFORMATION SYSTEMS (AMCIS 2019), 2019,