Research Progress on Learning-based Robust Adaptive Critic Control

被引:0
|
作者
Wang D. [1 ,2 ]
机构
[1] Faculty of Information Technology, Beijing University of Technology, Beijing
[2] Beijing Key Laboratory of Computational Intelligence and Intelligent System, Beijing
来源
基金
中国国家自然科学基金;
关键词
Adaptive critic control; Intelligent learning; Neural networks; Robust control; Uncertain systems;
D O I
10.16383/j.aas.c170701
中图分类号
学科分类号
摘要
In the machine learning fleld, the core technique of artiflcial intelligence, reinforcement learning is a class of strategies focusing on learning during the interaction process between machine and environment. As an important branch of reinforcement learning, the adaptive critic technique is closely related to dynamic programming and optimization design. In order to efiectively solve optimal control problems of complex dynamical systems, the adaptive dynamic programming approach was proposed by combining adaptive critic, dynamic programming with artiflcial neural networks and has been attracted extensive attention. Particularly, great progress has been obtained on robust adaptive critic control design with uncertainties and disturbances. Now, it has been regarded as a necessary outlet to construct intelligent learning systems and achieve true brain-like intelligence. This paper presents a comprehensive survey on the learning-based robust adaptive critic control theory and methods, including self-learning robust stabilization, adaptive trajectory tracking, event-driven robust control, and adaptive H∞ control design. Therein, it covers a general analysis for adaptive critic systems in terms of stability, convergence, optimality, and robustness. In addition, considering novel techniques such as artiflcial intelligence, big data, deep learning, and knowledge automation, it also discusses future prospects of robust adaptive critic control. Copyright © 2019 Acta Automatica Sinica. All rights reserved.
引用
收藏
页码:1031 / 1043
页数:12
相关论文
共 110 条
  • [51] Adhyaru D.M., Kar I.N., Gopal M., Fixed flnal time optimal control approach for bounded robust controller design using Hamilton-Jacobi-Bellman solution, IET Control Theory & Applications, 3, 9, pp. 1183-1195, (2009)
  • [52] Adhyaru D.M., Kar I.N., Gopal M., Bounded robust control of nonlinear systems using neural network-based HJB solution, Neural Computing & Applications, 20, 1, pp. 91-103, (2011)
  • [53] Wang D., Liu D.R., Li H.L., Policy iteration algorithm for online design of robust control for a class of continuoustime nonlinear systems, IEEE Transactions on Automation Science and Engineering, 11, 2, pp. 627-632, (2014)
  • [54] Wang D., Liu D.R., Li H.L., Ma H.W., Neural-network-based robust optimal control design for a class of uncertain nonlinear systems via adaptive dynamic programming, Information Sciences, 282, pp. 167-179, (2014)
  • [55] Wang D., Liu D.R., Zhang Q.C., Zhao D.B., Data-based adaptive critic designs for nonlinear robust optimal control with uncertain dynamics, IEEE Transactions on Systems, Man, and Cybernetics: Systems, 46, 11, pp. 1544-1555, (2016)
  • [56] Liu D.R., Yang X., Wang D., Wei Q.L., Reinforcementlearning-based robust controller design for continuoustime uncertain nonlinear systems subject to input constraints, IEEE Transactions on Cybernetics, 45, 7, pp. 1372-1385, (2015)
  • [57] Wang D., Liu D.R., Li H.L., Luo B., Ma H.W., An approximate optimal control approach for robust stabilization of a class of discrete-time nonlinear systems with uncertainties, IEEE Transactions on Systems, Man, and Cybernetics: Systems, 46, 5, pp. 713-717, (2016)
  • [58] Wang D., Adaptation-oriented near-optimal control and robust synthesis of an overhead crane system, Proceedings of the 2017 International Conference on Neural Information Processing, pp. 42-50, (2017)
  • [59] Zhong X.N., He H.B., Prokhorov D.V., Robust controller design of continuous-time nonlinear system using neural network, Proceedings of the 2013 International Joint Conference on Neural Networks, pp. 1-8, (2013)
  • [60] Sun J.L., Liu C.S., Ye Q., Robust difierential game guidance laws design for uncertain interceptor-target engagement via adaptive dynamic programming, International Journal of Control, 90, 5, pp. 990-1004, (2017)