共 50 条
- [21] A value-based deep reinforcement learning model with human expertise in optimal treatment of sepsis npj Digital Medicine, 6
- [22] Rethinking Exploration and Experience Exploitation in Value-Based Multi-Agent Reinforcement Learning IEEE ACCESS, 2025, 13 : 13770 - 13781
- [23] Variable Sampling Period Adaptive Control Based on Reinforcement Learning CONTROLO 2022, 2022, 930 : 577 - 586
- [27] Inverse Optimal Control with Discount Factor for Continuous and Discrete-Time Control-Affine Systems and Reinforcement Learning 2022 IEEE 61ST CONFERENCE ON DECISION AND CONTROL (CDC), 2022, : 5783 - 5788
- [29] Convex Programs and Lyapunov Functions for Reinforcement Learning: A Unified Perspective on the Analysis of Value-Based Methods 2022 AMERICAN CONTROL CONFERENCE, ACC, 2022, : 3317 - 3322