共 45 条
- [22] Finite-Sample Analysis of Off-Policy Natural Actor-Critic With Linear Function Approximation IEEE CONTROL SYSTEMS LETTERS, 2022, 6 : 2611 - 2616
- [24] Max-Min Off-Policy Actor-Critic Method Focusing on Worst-Case Robustness to Model Misspecification ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,
- [29] Fast and stable learning of quasi-passive dynamic walking by an unstable biped robot based on off-policy natural actor-critic 2006 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-12, 2006, : 5226 - +