共 21 条
- [1] Sutton R.S., Barto A.G., Reinforcement Learning: An Introduction, (1998)
- [2] Busoniu L., Babuska R., Schutter B.D., Et al., Reinforcement Learning and Dynamic Programming Using Function Approximators, (2010)
- [3] Lee D., Seo H., Jung M.W., Neural basis of reinforcement learning and decision making, Annual Review of Neuroscience, 35, 5, pp. 287-308, (2012)
- [4] Wiering M., Van O.M., Reinforcement learning: STATE-OF-THE-Art, (2014)
- [5] Sutton R.S., McAllester D.A., Singh S.P., Et al., Policy gradient methods for reinforcement learning with function approximation, NIPS, 99, pp. 1057-1063, (1999)
- [6] Peters J., Schaal S., Natural A.C., Neurocomputing, 71, 7-9, pp. 1180-1190, (2008)
- [7] Peters J., Vijayakumar S., Schaal S., Reinforcement learning for humanoid robotics, Autonomous Robot, 12, 1, pp. 1-20, (2003)
- [8] Van H.H., Reinforcement learning in continuous state and action spaces, Reinforcement Learning, pp. 207-251, (2012)
- [9] Wierstra D., Schaul T., Peters J., Et al., Natural evolution strategies, 2008 IEEE Congress on Evolutionary Computation (IEEE World Congress on Computational Intelligence), pp. 3381-3387, (2008)
- [10] Sun Y., Wierstra D., Schaul T., Et al., Efficient natural evolution strategies, The 11th Annual Conference on Genetic and Evolutionary Computation, pp. 539-546, (2009)