共 50 条
- [41] Efficient implementation of dynamic fuzzy Q-learning ICICS-PCM 2003, VOLS 1-3, PROCEEDINGS, 2003, : 1854 - 1858
- [42] Q-learning with Experience Replay in a Dynamic Environment PROCEEDINGS OF 2016 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2016,
- [44] A New Discrete-Time Iterative Adaptive Dynamic Programming Algorithm Based on Q-Learning ADVANCES IN NEURAL NETWORKS - ISNN 2015, 2015, 9377 : 43 - 52
- [45] Dynamic programming with ARMA, Markov, and NARMA models vs. Q-learning - case study IJCNN 2000: PROCEEDINGS OF THE IEEE-INNS-ENNS INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOL III, 2000, : 265 - 270
- [48] Error Bound Analysis of Q-Function for Discounted Optimal Control Problems With Policy Iteration IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2017, 47 (07): : 1207 - 1216
- [49] On-policy Q-learning for Adaptive Optimal Control 2014 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL), 2014, : 301 - 306
- [50] Fundamental Q-learning Algorithm in Finding Optimal Policy 2017 INTERNATIONAL CONFERENCE ON SMART GRID AND ELECTRICAL AUTOMATION (ICSGEA), 2017, : 243 - 246