共 50 条
- [1] Counterfactual Reward Modification for Streaming Recommendation with Delayed Feedback SIGIR '21 - PROCEEDINGS OF THE 44TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2021, : 41 - 50
- [2] Task Replication for Vehicular Cloud: Contextual Combinatorial Bandit with Delayed Feedback IEEE CONFERENCE ON COMPUTER COMMUNICATIONS (IEEE INFOCOM 2019), 2019, : 748 - 756
- [3] Efficient Counterfactual Learning from Bandit Feedback THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 4634 - 4641
- [4] Contextual Dependent Click Bandit Algorithm for Web Recommendation COMPUTING AND COMBINATORICS (COCOON 2018), 2018, 10976 : 39 - 50
- [5] Transferable Contextual Bandit for Cross-Domain Recommendation THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 3619 - 3626
- [6] Counterfactual Risk Minimization: Learning from Logged Bandit Feedback INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 37, 2015, 37 : 814 - 823
- [7] Contextual Multi-Armed Bandit for Email Layout Recommendation PROCEEDINGS OF THE 17TH ACM CONFERENCE ON RECOMMENDER SYSTEMS, RECSYS 2023, 2023, : 400 - 402
- [8] Expert Features for a Student Support Recommendation Contextual Bandit Algorithm FOURTEENTH INTERNATIONAL CONFERENCE ON LEARNING ANALYTICS & KNOWLEDGE, LAK 2024, 2024, : 864 - 870
- [9] Budgeted Recommendation with Delayed Feedback GOOD PRACTICES AND NEW PERSPECTIVES IN INFORMATION SYSTEMS AND TECHNOLOGIES, VOL 3, WORLDCIST 2024, 2024, 987 : 202 - 213