共 17 条
[1]
Barles G.(1991)Convergence of approximation schemes for fully nonlinear second order equations Asymptotic Analysis 4 271-283
[2]
Souganidis P.(1995)Learning to act using real-time dynamic programming Artificial Intelligence 72 81-138
[3]
Barto A. G.(1983)Neuronlike adaptive elements that that can learn difficult control problems IEEE Trans. in Systems Man and Cybernetics 13 835-846
[4]
Bradtke S. J.(1977)An algorithm for finding best matches in logarithmic expected time ACM Transactions on Mathematical Software 3 209-226
[5]
Singh S. P.(2000)A study of reinforcement learning in the continuous case by the means of viscosity solutions Machine Learning 40 265-299
[6]
Barto A. G.(1982)A self-learning automaton with variable resolution for high precision assembly by industrial robots IEEE Trans. on Automatic Control 27 1109-1113
[7]
Sutton R. S.(1995)Temporal difference learning and td-gammon Communication of the ACM 38 58-68
[8]
Anderson C. W.(undefined)undefined undefined undefined undefined-undefined
[9]
Friedman J. H.(undefined)undefined undefined undefined undefined-undefined
[10]
Bentley J. L.(undefined)undefined undefined undefined undefined-undefined

