Adaptive Optimal Control of Unknown Constrained-Input Systems Using Policy Iteration and Neural Networks

被引:361
|
作者
Modares, Hamidreza [1 ]
Lewis, Frank L. [2 ]
Naghibi-Sistani, Mohammad-Bagher [1 ]
机构
[1] Ferdowsi Univ Mashhad, Dept Elect Engn, Mashhad, Iran
[2] Univ Texas Arlington, Res Inst, Ft Worth, TX 76118 USA
基金
美国国家科学基金会;
关键词
Input constraints; neural networks; optimal control; reinforcement learning; unknown dynamics; CONTINUOUS-TIME;
D O I
10.1109/TNNLS.2013.2276571
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents an online policy iteration (PI) algorithm to learn the continuous-time optimal control solution for unknown constrained-input systems. The proposed PI algorithm is implemented on an actor-critic structure where two neural networks (NNs) are tuned online and simultaneously to generate the optimal bounded control policy. The requirement of complete knowledge of the system dynamics is obviated by employing a novel NN identifier in conjunction with the actor and critic NNs. It is shown how the identifier weights estimation error affects the convergence of the critic NN. A novel learning rule is developed to guarantee that the identifier weights converge to small neighborhoods of their ideal values exponentially fast. To provide an easy-to-check persistence of excitation condition, the experience replay technique is used. That is, recorded past experiences are used simultaneously with current data for the adaptation of the identifier weights. Stability of the whole system consisting of the actor, critic, system state, and system identifier is guaranteed while all three networks undergo adaptation. Convergence to a near-optimal control law is also shown. The effectiveness of the proposed method is illustrated with a simulation example.
引用
收藏
页码:1513 / 1525
页数:13
相关论文
共 50 条
  • [41] Recurrent neural networks control of dynamic systems with unknown input hysteresis
    Wang, XS
    Li, L
    Su, CY
    Hong, H
    PROCEEDINGS OF 2003 INTERNATIONAL CONFERENCE ON NEURAL NETWORKS & SIGNAL PROCESSING, PROCEEDINGS, VOLS 1 AND 2, 2003, : 297 - 300
  • [42] Adaptive Fuzzy Control for Nonlinear State Constrained Systems With Input Delay and Unknown Control Coefficients
    You, Fuqiang
    Chen, Nan
    Zhu, Zhu
    Cheng, Shiya
    Yang, Hongliang
    Jia, Mingxing
    IEEE ACCESS, 2019, 7 : 53718 - 53730
  • [43] Adaptive Optimal Control for a Class of Nonlinear Systems: The Online Policy Iteration Approach
    He, Shuping
    Fang, Haiyang
    Zhang, Maoguang
    Liu, Fei
    Ding, Zhengtao
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (02) : 549 - 558
  • [44] Online approximate solution of HJI equation for unknown constrained-input nonlinear continuous-time systems
    Yang, Xiong
    Liu, Derong
    Ma, Hongwen
    Xu, Yancai
    INFORMATION SCIENCES, 2016, 328 : 435 - 454
  • [45] Fuzzy neural adaptive tracking control of unknown chaotic systems with input saturation
    Lin, Da
    Wang, Xingyuan
    Yao, Yi
    NONLINEAR DYNAMICS, 2012, 67 (04) : 2889 - 2897
  • [46] Fuzzy neural adaptive tracking control of unknown chaotic systems with input saturation
    Da Lin
    Xingyuan Wang
    Yi Yao
    Nonlinear Dynamics, 2012, 67 : 2889 - 2897
  • [47] Adaptive Control of Uncertain Nonaffine Nonlinear Systems With Input Saturation Using Neural Networks
    Esfandiari, Kasra
    Abdollahi, Farzaneh
    Talebi, Heidar Ali
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 26 (10) : 2311 - 2322
  • [48] Reinforcement learning and neural networks for multi-agent nonzero-sum games of nonlinear constrained-input systems
    Sholeh Yasini
    Mohammad Bagher Naghibi Sitani
    Ali Kirampor
    International Journal of Machine Learning and Cybernetics, 2016, 7 : 967 - 980
  • [49] Reinforcement learning and neural networks for multi-agent nonzero-sum games of nonlinear constrained-input systems
    Yasini, Sholeh
    Sitani, Mohammad Bagher Naghibi
    Kirampor, Ali
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2016, 7 (06) : 967 - 980
  • [50] Robust adaptive tracking control for input and state constrained uncertain nonlinear systems with unknown control directions
    Wang, Chunxiao
    Qi, Lu
    Zhao, Yan
    Yu, Jiali
    INTERNATIONAL JOURNAL OF CONTROL, 2023, 96 (07) : 1681 - 1694