共 50 条
- [22] Finite-Sample Analysis of Off-Policy Natural Actor-Critic With Linear Function Approximation IEEE CONTROL SYSTEMS LETTERS, 2022, 6 : 2611 - 2616
- [26] Episode-Experience Replay Based Tree-Backup Method for Off-Policy Actor-Critic Algorithm PATTERN RECOGNITION AND COMPUTER VISION (PRCV 2018), PT I, 2018, 11256 : 562 - 573
- [27] Max-Min Off-Policy Actor-Critic Method Focusing on Worst-Case Robustness to Model Misspecification ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,