共 50 条
- [31] Compatible Reward Inverse Reinforcement Learning ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 30 (NIPS 2017), 2017, 30
- [33] Distributional Reward Decomposition for Reinforcement Learning ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
- [34] Reward learning: Reinforcement, incentives, and expectations PSYCHOLOGY OF LEARNING AND MOTIVATION: ADVANCES IN RESEARCH AND THEORY, VOL 40, 2001, 40 : 223 - 278
- [36] Balance Reward and Safety Optimization for Safe Reinforcement Learning: A Perspective of Gradient Manipulation THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 19, 2024, : 21099 - 21106
- [38] Reward tampering problems and solutions in reinforcement learning: a causal influence diagram perspective Synthese, 2021, 198 : 6435 - 6467
- [40] Reward Learning for Efficient Reinforcement Learning in Extractive Document Summarisation PROCEEDINGS OF THE TWENTY-EIGHTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2019, : 2350 - 2356