共 50 条
- [34] Variance-constrained actor-critic algorithms for discounted and average reward MDPs Machine Learning, 2016, 105 : 367 - 417
- [35] Natural Gradient Actor-Critic Algorithms using Random Rectangular Coarse Coding 2008 PROCEEDINGS OF SICE ANNUAL CONFERENCE, VOLS 1-7, 2008, : 1945 - 1952
- [36] Beyond the Policy Gradient Theorem for Efficient Policy Updates in Actor-Critic Algorithms INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 151, 2022, 151 : 5658 - 5688
- [38] A Novel Heterogeneous Actor-critic Algorithm with Recent Emphasizing Replay Memory International Journal of Automation and Computing, 2021, 18 : 619 - 631
- [39] Model Learning Actor-Critic Algorithms: Performance Evaluation in a Motion Control Task 2012 IEEE 51ST ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2012, : 5272 - 5277
- [40] Algorithms for Variance Reduction in a Policy-Gradient Based Actor-Critic Framework ADPRL: 2009 IEEE SYMPOSIUM ON ADAPTIVE DYNAMIC PROGRAMMING AND REINFORCEMENT LEARNING, 2009, : 130 - 136