共 50 条
- [11] Non-delusional Q-learning and Value Iteration ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), 2018, 31
- [13] Policy iteration based Q-learning for linear nonzero-sum quadratic differential games Science China Information Sciences, 2019, 62
- [16] Multiresolution State-Space Discretization Method for Q-Learning with Function Approximation and Policy Iteration 2009 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC 2009), VOLS 1-9, 2009, : 2677 - 2682
- [18] Enhanced Q-Learning Algorithm for Dynamic Power Management with Performance Constraint 2010 DESIGN, AUTOMATION & TEST IN EUROPE (DATE 2010), 2010, : 602 - 605
- [19] Stochastic Primal-Dual Q-Learning Algorithm For Discounted MDPs 2019 AMERICAN CONTROL CONFERENCE (ACC), 2019, : 4897 - 4902
- [20] Empirical Policy Iteration for Approximate Dynamic Programming 2014 IEEE 53RD ANNUAL CONFERENCE ON DECISION AND CONTROL (CDC), 2014, : 6573 - 6578