共 50 条
- [21] Generalizing Soft Actor-Critic Algorithms to Discrete Action Spaces PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2024, PT 1, 2025, 15031 : 34 - 49
- [22] Parametrized actor-critic algorithms for finite-horizon MDPs 2007 AMERICAN CONTROL CONFERENCE, VOLS 1-13, 2007, : 2701 - 2706
- [23] A Critical Point Analysis of Actor-Critic Algorithms with Neural Networks IFAC PAPERSONLINE, 2022, 55 (15): : 27 - 32
- [24] Improving Sample Complexity Bounds for (Natural) Actor-Critic Algorithms ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
- [26] Sample and Communication-Efficient Decentralized Actor-Critic Algorithms with Finite-Time Analysis INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
- [28] A Hessian Actor-Critic Algorithm 2014 IEEE 53RD ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2014, : 1131 - 1136
- [29] Sample-efficient Actor-Critic Reinforcement Learning with Supervised Data for Dialogue Management 18TH ANNUAL MEETING OF THE SPECIAL INTEREST GROUP ON DISCOURSE AND DIALOGUE (SIGDIAL 2017), 2017, : 147 - 157