共 50 条
- [21] Finite-Sample Analysis of Off-Policy Natural Actor-Critic With Linear Function Approximation IEEE CONTROL SYSTEMS LETTERS, 2022, 6 : 2611 - 2616
- [23] Optimal Actor-Critic Policy With Optimized Training Datasets IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2022, 6 (06): : 1324 - 1334
- [26] Episode-Experience Replay Based Tree-Backup Method for Off-Policy Actor-Critic Algorithm PATTERN RECOGNITION AND COMPUTER VISION (PRCV 2018), PT I, 2018, 11256 : 562 - 573
- [29] Max-Min Off-Policy Actor-Critic Method Focusing on Worst-Case Robustness to Model Misspecification ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35, NEURIPS 2022, 2022,