共 50 条
- [31] Reinforcement temporal difference learning scheme for dynamic energy management in embedded systems 19TH INTERNATIONAL CONFERENCE ON VLSI DESIGN, PROCEEDINGS, 2005, : 645 - 650
- [32] Critical factors in the empirical performance of temporal difference and evolutionary methods for reinforcement learning Autonomous Agents and Multi-Agent Systems, 2010, 21 : 1 - 35
- [33] The Nature of Temporal Difference Errors in Multi-step Distributional Reinforcement Learning ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
- [39] A Unified Approach for Multi-step Temporal-Difference Learning with Eligibility Traces in Reinforcement Learning PROCEEDINGS OF THE TWENTY-SEVENTH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2018, : 2984 - 2990