共 50 条
- [21] Evaluating the Robustness of Off-Policy Evaluation 15TH ACM CONFERENCE ON RECOMMENDER SYSTEMS (RECSYS 2021), 2021, : 114 - 123
- [22] Towards Optimal Off-Policy Evaluation for Reinforcement Learning with Marginalized Importance Sampling ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
- [24] Representation Balancing MDPs for Off-Policy Policy Evaluation ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
- [25] Consistent On-Line Off-Policy Evaluation INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 70, 2017, 70
- [27] Off-Policy Evaluation via the Regularized Lagrangian ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
- [28] Safe Optimal Design with Applications in Off-Policy Learning INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, 2022, 151
- [29] Offline RL Without Off-Policy Evaluation ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
- [30] Learning Action Embeddings for Off-Policy Evaluation ADVANCES IN INFORMATION RETRIEVAL, ECIR 2024, PT I, 2024, 14608 : 108 - 122