共 50 条
- [1] Off-policy Learning over Heterogeneous Information for Recommendation PROCEEDINGS OF THE ACM WEB CONFERENCE 2022 (WWW'22), 2022, : 2348 - 2359
- [2] Pessimistic Off-Policy Multi-Objective Optimization INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 238, 2024, 238
- [3] Off-policy evaluation for slate recommendation ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
- [4] Doubly Pessimistic Algorithms for Strictly Safe Off-Policy Optimization 2022 56TH ANNUAL CONFERENCE ON INFORMATION SCIENCES AND SYSTEMS (CISS), 2022, : 113 - 118
- [5] Boosted Off-Policy Learning INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 206, 2023, 206
- [6] Debiased Off-Policy Evaluation for Recommendation Systems 15TH ACM CONFERENCE ON RECOMMENDER SYSTEMS (RECSYS 2021), 2021, : 372 - 379
- [8] Learning with Options that Terminate Off-Policy THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 3173 - 3182
- [9] Online Learning with Off-Policy Feedback INTERNATIONAL CONFERENCE ON ALGORITHMIC LEARNING THEORY, VOL 201, 2023, 201 : 620 - 641
- [10] Average-Reward Off-Policy Policy Evaluation with Function Approximation INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 139, 2021, 139