共 50 条
- [21] Off-Policy Evaluation for Human Feedback ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [22] Off-policy evaluation for slate recommendation ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
- [23] High Confidence Off-Policy Evaluation PROCEEDINGS OF THE TWENTY-NINTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2015, : 3000 - 3006
- [24] State Relevance for Off-Policy Evaluation INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139
- [25] Evaluating the Robustness of Off-Policy Evaluation 15TH ACM CONFERENCE ON RECOMMENDER SYSTEMS (RECSYS 2021), 2021, : 114 - 123
- [26] VALUE-AWARE IMPORTANCE WEIGHTING FOR OFF-POLICY REINFORCEMENT LEARNING CONFERENCE ON LIFELONG LEARNING AGENTS, VOL 232, 2023, 232 : 745 - 763
- [27] Representation Balancing MDPs for Off-Policy Policy Evaluation ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
- [28] Consistent On-Line Off-Policy Evaluation INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
- [30] Offline RL Without Off-Policy Evaluation ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34