共 38 条
- [21] Linear Convergence of Independent Natural Policy Gradient in Games With Entropy Regularization IEEE CONTROL SYSTEMS LETTERS, 2024, 8 : 1217 - 1222
- [22] Trajectory-wise Control Variates for Variance Reduction in Policy Gradient Methods CONFERENCE ON ROBOT LEARNING, VOL 100, 2019, 100
- [23] Improved Convergence Rate of Stochastic Gradient Langevin Dynamics with Variance Reduction and its Application to Optimization ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 35 (NEURIPS 2022), 2022,
- [25] Provably Fast Convergence of Independent Natural Policy Gradient for Markov Potential Games ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
- [27] Algorithms for Variance Reduction in a Policy-Gradient Based Actor-Critic Framework ADPRL: 2009 IEEE SYMPOSIUM ON ADAPTIVE DYNAMIC PROGRAMMING AND REINFORCEMENT LEARNING, 2009, : 130 - 136
- [28] Global Convergence of Policy Gradient Algorithms for Indefinite Least Squares Stationary Optimal Control IEEE CONTROL SYSTEMS LETTERS, 2020, 4 (03): : 638 - 643
- [30] Symmetric (Optimistic) Natural Policy Gradient for Multi-agent Learning with Parameter Convergence INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 206, 2023, 206