共 50 条
- [32] Sample Complexity Bounds for Two Timescale Value-based Reinforcement Learning Algorithms 24TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS (AISTATS), 2021, 130
- [37] Mental representations distinguish value-based decisions from perceptual decisions Psychonomic Bulletin & Review, 2021, 28 : 1413 - 1422
- [39] Adaptability Analysis of Value-based and Policy-based Deep Reinforcement Learning in Nuclear Field Yuanzineng Kexue Jishu/Atomic Energy Science and Technology, 2024, 58 : 382 - 392