共 50 条
- [41] Balancing multiple sources of reward in reinforcement learning ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 13, 2001, 13 : 1082 - 1088
- [43] Evolved Intrinsic Reward Functions for Reinforcement Learning PROCEEDINGS OF THE TWENTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-10), 2010, : 1955 - 1956
- [46] Hindsight Reward Shaping in Deep Reinforcement Learning 2020 INTERNATIONAL SAUPEC/ROBMECH/PRASA CONFERENCE, 2020, : 653 - 659
- [47] Robust Average-Reward Reinforcement Learning Journal of Artificial Intelligence Research, 2024, 80 : 719 - 803
- [48] Reward-Free Exploration for Reinforcement Learning INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 119, 2020, 119
- [49] AntNet with Reward-Penalty Reinforcement Learning 2010 SECOND INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE, COMMUNICATION SYSTEMS AND NETWORKS (CICSYN), 2010, : 17 - 21
- [50] Schedules of Reinforcement, Learning, and Frequency Reward Programs ADVANCES IN CONSUMER RESEARCH, VOL XXXVI, 2009, 36 : 555 - 555