共 50 条
- [1] Reinforcement online learning to rank with unbiased reward shaping INFORMATION RETRIEVAL JOURNAL, 2022, 25 (04): : 386 - 413
- [2] Differentiable Unbiased Online Learning to Rank CIKM'18: PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2018, : 1293 - 1302
- [4] Belief Reward Shaping in Reinforcement Learning THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTIETH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / EIGHTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2018, : 3762 - 3769
- [5] Reward Shaping in Episodic Reinforcement Learning AAMAS'17: PROCEEDINGS OF THE 16TH INTERNATIONAL CONFERENCE ON AUTONOMOUS AGENTS AND MULTIAGENT SYSTEMS, 2017, : 565 - 573
- [6] Multigrid Reinforcement Learning with Reward Shaping ARTIFICIAL NEURAL NETWORKS - ICANN 2008, PT I, 2008, 5163 : 357 - 366
- [7] Unbiased Learning to Rank: Counterfactual and Online Approaches WWW'20: COMPANION PROCEEDINGS OF THE WEB CONFERENCE 2020, 2020, : 299 - 300
- [8] Reward Shaping for Reinforcement Learning by Emotion Expressions 2014 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), 2014, : 1288 - 1293
- [9] Hindsight Reward Shaping in Deep Reinforcement Learning 2020 INTERNATIONAL SAUPEC/ROBMECH/PRASA CONFERENCE, 2020, : 653 - 659