共 151 条
- [1] Zhang H.G., Zhang X., Luo Y.H., Yang J., An overview of research on adaptive dynamic programming, Acta Automatica Sinica, 39, 4, pp. 303-311, (2013)
- [2] Liu D.R., Li H.L., Wang D., Data-based self-learning optimal control: research progress and prospects, Acta Automatica Sinica, 39, 11, pp. 1858-1870, (2013)
- [3] Werbos P., Beyond Regression: New Tools for Prediction and Analysis in the Behavioral Sciences, (1974)
- [4] Prokhorov D.V., Wunsch D.C., Adaptive critic designs, IEEE Transactions on Neural Networks, 8, 5, pp. 997-1007, (1997)
- [5] Padhi R., Unnikrishnan N., Wang X.H., Balakrishnan S.N., A single network adaptive critic (SNAC) architecture for optimal control synthesis for a class of nonlinear systems, Neural Networks, 19, 10, pp. 1648-1660, (2006)
- [6] Wang Y., O'Donoghue B., Boyd S., Approximate dynamic programming via iterated Bellman inequalities, International Journal of Robust and Nonlinear Control, 25, 10, pp. 1472-1496, (2015)
- [7] Bertsekas D.P., Tsitsiklis J.N., Neuro-dynamic programming: an overview, Proceedings of the 34th IEEE Conference on Decision and Control, pp. 560-564, (1995)
- [8] Zhu L.M., Modares H., Peen G.O., Lewis F.L., Yue B.Z., Adaptive suboptimal output-feedback control for linear systems using integral reinforcement learning, IEEE Transactions on Control Systems Technology, 23, 1, pp. 264-273, (2015)
- [9] Bhasin S., Reinforcement Learning and Optimal Control Methods for Uncertain Nonlinear Systems, (2011)
- [10] Vrabie D., Vamvoudakis K.G., Lewis F.L., Optimal Adaptive Control and Differential Games by Reinforcement Learning Principles, (2012)