共 50 条
- [32] Reconsidering Stochastic Policy Gradient Methods for Traffic Signal Control ADVANCES AND TRENDS IN ARTIFICIAL INTELLIGENCE: THEORY AND APPLICATIONS, IEA-AIE 2024, 2024, 14748 : 442 - 453
- [33] A Stochastic Policy Gradient Based Adaptive Control for Biped Walking 2015 34TH CHINESE CONTROL CONFERENCE (CCC), 2015, : 3224 - 3229
- [35] A Temporal-Difference Approach to Policy Gradient Estimation INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
- [37] Infinite-horizon policy-gradient estimation JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 2001, 15 : 319 - 350
- [39] Robust Gradient Estimation Algorithm for a Stochastic System with Colored Noise International Journal of Control, Automation and Systems, 2023, 21 : 553 - 562
- [40] A Baseline for Any Order Gradient Estimation in Stochastic Computation Graphs INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 97, 2019, 97