Adaptive Optimal Control of Unknown Constrained-Input Systems Using Policy Iteration and Neural Networks

被引:361
|
作者
Modares, Hamidreza [1 ]
Lewis, Frank L. [2 ]
Naghibi-Sistani, Mohammad-Bagher [1 ]
机构
[1] Ferdowsi Univ Mashhad, Dept Elect Engn, Mashhad, Iran
[2] Univ Texas Arlington, Res Inst, Ft Worth, TX 76118 USA
基金
美国国家科学基金会;
关键词
Input constraints; neural networks; optimal control; reinforcement learning; unknown dynamics; CONTINUOUS-TIME;
D O I
10.1109/TNNLS.2013.2276571
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents an online policy iteration (PI) algorithm to learn the continuous-time optimal control solution for unknown constrained-input systems. The proposed PI algorithm is implemented on an actor-critic structure where two neural networks (NNs) are tuned online and simultaneously to generate the optimal bounded control policy. The requirement of complete knowledge of the system dynamics is obviated by employing a novel NN identifier in conjunction with the actor and critic NNs. It is shown how the identifier weights estimation error affects the convergence of the critic NN. A novel learning rule is developed to guarantee that the identifier weights converge to small neighborhoods of their ideal values exponentially fast. To provide an easy-to-check persistence of excitation condition, the experience replay technique is used. That is, recorded past experiences are used simultaneously with current data for the adaptation of the identifier weights. Stability of the whole system consisting of the actor, critic, system state, and system identifier is guaranteed while all three networks undergo adaptation. Convergence to a near-optimal control law is also shown. The effectiveness of the proposed method is illustrated with a simulation example.
引用
收藏
页码:1513 / 1525
页数:13
相关论文
共 50 条
  • [21] Data-based robust adaptive control for a class of unknown nonlinear constrained-input systems via integral reinforcement learning
    Yang, Xiong
    Liu, Derong
    Luo, Biao
    Li, Chao
    INFORMATION SCIENCES, 2016, 369 : 731 - 747
  • [22] Value iteration with deep neural networks for optimal control of input-affine nonlinear systems
    Beppu H.
    Maruta I.
    Fujimoto K.
    SICE Journal of Control, Measurement, and System Integration, 2021, 14 (01) : 140 - 149
  • [23] Integral reinforcement learning based decentralized optimal tracking control of unknown nonlinear large-scale interconnected systems with constrained-input
    Liu, Chong
    Zhang, Huaguang
    Xiao, Geyang
    Sun, Shaoxin
    NEUROCOMPUTING, 2019, 323 : 1 - 11
  • [24] Optimal iterative control of unknown nonlinear systems using neural networks
    Wang, FL
    Li, MZ
    PROCEEDINGS OF THE 36TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-5, 1997, : 2201 - 2206
  • [25] Event-triggered-based integral reinforcement learning output feedback optimal control for partially unknown constrained-input nonlinear systems
    Zou, Haoming
    Zhang, Guoshan
    ASIAN JOURNAL OF CONTROL, 2023, 25 (05) : 3843 - 3858
  • [26] Optimal Control Laws for Nonlinear Oscillator Systems with Saturating Actuators using Neural Networks Based on Policy Iteration
    Xing, Shi
    Song, Ruizhuo
    PROCEEDINGS OF THE 2016 12TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2016, : 2294 - 2297
  • [27] Adaptive Critics for Decentralized Stabilization of Constrained-Input Nonlinear Interconnected Systems
    Yang, Xiong
    Zhou, Yingjiang
    Dong, Na
    Wei, Qinglai
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 52 (07): : 4187 - 4199
  • [28] Adaptive neural network control for nonlinear state constrained systems with unknown dead-zones input
    Zhao, Wei
    Liu, Lei
    Liu, Yan-Jun
    AIMS MATHEMATICS, 2020, 5 (05): : 4065 - 4084
  • [29] Neural networks-based adaptive control of uncertain nonlinear systems with unknown input constraints
    Guo, Jian-lan
    Chen, Yu-qiang
    Lai, Guan-yu
    Liu, Hong-ling
    Tian, Yuan
    Al-Nabhan, Najla
    Wang, Jingjing
    Wang, Zhenhai
    JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2021, 15 (Suppl 1) : 167 - 167
  • [30] Optimal Guaranteed Cost Sliding Mode Control for Constrained-Input Nonlinear Systems With Matched and Unmatched Disturbances
    Zhang, Huaguang
    Qu, Qiuxia
    Xiao, Geyang
    Cui, Yang
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (06) : 2112 - 2126