Adaptive dynamic programming-based optimal control of unknown nonaffine nonlinear discrete-time systems with proof of convergence

被引:53
|
作者
Zhang, Xin [1 ]
Zhang, Huaguang [1 ]
Sun, Qiuye [1 ]
Luo, Yanhong [1 ]
机构
[1] Northeastern Univ, Sch Informat Sci & Engn, Shenyang 110819, Liaoning, Peoples R China
基金
中国国家自然科学基金; 国家高技术研究发展计划(863计划);
关键词
Optimal control; Adaptive dynamic programming; Recurrent neural network; System identification; NEURAL-NETWORKS; OUTPUT DATA; IDENTIFICATION;
D O I
10.1016/j.neucom.2012.01.025
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, a novel neuro-optimal control scheme is proposed for unknown nonaffine nonlinear discrete-time systems by using adaptive dynamic programming (ADP) method. A neuro identifier is established by employing recurrent neural networks (RNNs) model to reconstruct the unknown system dynamics. The convergence of the identification error is proved by using the Lyapunov theory. Then based on the established RNN model, the ADP method is utilized to design the approximate optimal controller. Two neural networks (NNs) are used to implement the iterative algorithm. The convergence of the action NN error and weight estimation errors is demonstrated while considering the NN approximation errors. Finally, two numerical examples are used to demonstrate the effectiveness of the proposed control scheme. (C) 2012 Published by Elsevier B.V.
引用
收藏
页码:48 / 55
页数:8
相关论文
共 50 条
  • [31] Generalized Adaptive Dynamic Programming Algorithm for Discrete-Time Nonlinear Systems: Convergence and Stability Analysis
    Liu, Derong
    Wei, Qinglai
    2013 INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND TECHNOLOGY (ICIST), 2013, : 134 - 141
  • [32] MRAC for unknown discrete-time nonlinear systems based on supervised neural dynamic programming
    Fu, Hao
    Chen, Xin
    Wang, Wei
    Wu, Min
    NEUROCOMPUTING, 2020, 384 : 130 - 141
  • [33] A discrete-time multivariable neuro-adaptive control for nonlinear unknown dynamic systems
    Hwang, CL
    Lin, CH
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2000, 30 (06): : 865 - 877
  • [34] Optimal control of unknown affine nonlinear discrete-time systems using offline-trained neural networks with proof of convergence
    Dierks, Travis
    Thumati, Balaje T.
    Jagannathan, S.
    NEURAL NETWORKS, 2009, 22 (5-6) : 851 - 860
  • [35] Adaptive dynamic programming-based stabilization of nonlinear systems with unknown actuator saturation
    Zhao, Bo
    Jia, Lihao
    Xia, Hongbing
    Li, Yuanchun
    NONLINEAR DYNAMICS, 2018, 93 (04) : 2089 - 2103
  • [36] Adaptive dynamic programming-based stabilization of nonlinear systems with unknown actuator saturation
    Bo Zhao
    Lihao Jia
    Hongbing Xia
    Yuanchun Li
    Nonlinear Dynamics, 2018, 93 : 2089 - 2103
  • [37] Twin Deterministic Policy Gradient Adaptive Dynamic Programming for Optimal Control of Affine Nonlinear Discrete-time Systems
    Xu, Jiahui
    Wang, Jingcheng
    Rao, Jun
    Zhong, Yanjiu
    Zhao, Shangwei
    INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2022, 20 (09) : 3098 - 3109
  • [38] Adaptive Dynamic Programming for Finite-Horizon Optimal Control of Discrete-Time Nonlinear Systems with ε-Error Bound
    Wang, Fei-Yue
    Jin, Ning
    Liu, Derong
    Wei, Qinglai
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 2011, 22 (01): : 24 - 36
  • [39] Policy Optimization Adaptive Dynamic Programming for Optimal Control of Input-Affine Discrete-Time Nonlinear Systems
    Lin, Mingduo
    Zhao, Bo
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 53 (07): : 4339 - 4350
  • [40] Twin Deterministic Policy Gradient Adaptive Dynamic Programming for Optimal Control of Affine Nonlinear Discrete-time Systems
    Jiahui Xu
    Jingcheng Wang
    Jun Rao
    Yanjiu Zhong
    Shangwei Zhao
    International Journal of Control, Automation and Systems, 2022, 20 : 3098 - 3109