共 50 条
- [1] Admissible Policy Teaching through Reward Design THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 6037 - 6045
- [4] PEDAGOGIC WORKSHOP IN THE FUNCTION OF ACTIVE TEACHING AND LEARNING THROUGH SUCCESS METODICKI OGLEDI-METHODICAL REVIEW, 2006, 13 (01): : 123 - 136
- [5] Reward Machines: Exploiting Reward Function Structure in Reinforcement Learning Journal of Artificial Intelligence Research, 2022, 73 : 173 - 208
- [6] Reward Machines: Exploiting Reward Function Structure in Reinforcement Learning JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2022, 73 : 173 - 208
- [7] Reward Certification for Policy Smoothed Reinforcement Learning THIRTY-EIGHTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 38 NO 19, 2024, : 21429 - 21437
- [8] Reward Function Learning for Dialogue Management PROCEEDINGS OF THE SIXTH STARTING AI RESEARCHERS' SYMPOSIUM (STAIRS 2012), 2012, 241 : 95 - +
- [9] Pitfalls of Learning a Reward Function Online PROCEEDINGS OF THE TWENTY-NINTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, : 1592 - 1600