共 50 条
- [31] AN EFFICIENT POLICY ITERATION ALGORITHM FOR DYNAMIC PROGRAMMING EQUATIONS SIAM JOURNAL ON SCIENTIFIC COMPUTING, 2015, 37 (01): : A181 - A200
- [36] Reinforcement Q-learning based on Multirate Generalized Policy Iteration and Its Application to a 2-DOF Helicopter International Journal of Control, Automation and Systems, 2018, 16 : 377 - 386
- [38] Multi-Agent Reward-Iteration Fuzzy Q-Learning International Journal of Fuzzy Systems, 2021, 23 : 1669 - 1679
- [40] Dynamic Choice of State Abstraction in Q-Learning ECAI 2016: 22ND EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2016, 285 : 46 - 54