Neural-network-observer-based optimal control for unknown nonlinear systems using adaptive dynamic programming

被引:118
|
作者
Liu, Derong [1 ]
Huang, Yuzhu [1 ]
Wang, Ding [1 ]
Wei, Qinglai [1 ]
机构
[1] Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, Beijing 100190, Peoples R China
基金
中国国家自然科学基金;
关键词
nonlinear observer; adaptive dynamic programming; neural network; uniformly ultimately bounded; nonlinear system; OPTIMAL TRACKING CONTROL; DISCRETE-TIME-SYSTEMS; CONTROL SCHEME; DESIGN;
D O I
10.1080/00207179.2013.790562
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, an observer-based optimal control scheme is developed for unknown nonlinear systems using adaptive dynamic programming (ADP) algorithm. First, a neural-network (NN) observer is designed to estimate system states. Then, based on the observed states, a neuro-controller is constructed via ADP method to obtain the optimal control. In this design, two NN structures are used: a three-layer NN is used to construct the observer which can be applied to systems with higher degrees of nonlinearity and without a priori knowledge of system dynamics, and a critic NN is employed to approximate the value function. The optimal control law is computed using the critic NN and the observer NN. Uniform ultimate boundedness of the closed-loop system is guaranteed. The actor, critic, and observer structures are all implemented in real-time, continuously and simultaneously. Finally, simulation results are presented to demonstrate the effectiveness of the proposed control scheme.
引用
收藏
页码:1554 / 1566
页数:13
相关论文
共 50 条
  • [1] Adaptive Neural Network Observer-Based Fault Tolerant Control for Partially Unknown Nonlinear Systems via Adaptive Dynamic Programming
    Zeng, Chujian
    Zhao, Bo
    Liu, Derong
    Proceedings of 13th IEEE International Conference on CYBER Technology in Automation, Control, and Intelligent Systems, CYBER 2023, 2023, : 212 - 216
  • [2] Neural network observer-based optimal control for unknown nonlinear systems with control constraints
    Huang, Yuzhu
    Jiang, Hongde
    2015 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2015,
  • [3] Diagonal recurrent neural network observer-based adaptive control for unknown nonlinear systems
    Elkenawy, Ahmed
    El-Nagar, Ahmad M.
    El-Bardini, Mohammad
    El-Rabaie, Nabila M.
    TRANSACTIONS OF THE INSTITUTE OF MEASUREMENT AND CONTROL, 2020, 42 (15) : 2833 - 2856
  • [4] A novel optimal control design for unknown nonlinear systems based on adaptive dynamic programming and nonlinear model predictive control
    Hu, Wei
    Zhang, Guoshan
    Zheng, Yuqing
    ASIAN JOURNAL OF CONTROL, 2022, 24 (04) : 1638 - 1649
  • [5] Neural network-based direct adaptive robust control of unknown MIMO nonlinear systems using state observer
    Cheng, Cheng
    Liu, Songyong
    Wu, Hongzhuang
    Zhang, Ying
    INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 2020, 34 (01) : 1 - 14
  • [6] Optimal control of unknown nonaffine nonlinear discrete-time systems based on adaptive dynamic programming
    Wang, Ding
    Liu, Derong
    Wei, Qinglai
    Zhao, Dongbin
    Jin, Ning
    AUTOMATICA, 2012, 48 (08) : 1825 - 1832
  • [7] Robust Optimal Control for a Class of Nonlinear Dynamic Systems Using Single Network Adaptive Dynamic Programming
    Wang Ding
    Liu Derong
    Li Hongliang
    Yang Xiong
    2014 33RD CHINESE CONTROL CONFERENCE (CCC), 2014, : 8711 - 8716
  • [8] Multivariable adaptive control of nonlinear unknown dynamic systems using recurrent neural-network
    Hwang, Chih-Lyang
    2006 IEEE INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORK PROCEEDINGS, VOLS 1-10, 2006, : 5006 - 5011
  • [9] Disturbance observer-based control with adaptive neural network for unknown nonlinear system
    Hu, Jing
    Zheng, Meirong
    MEASUREMENT & CONTROL, 2023, 56 (1-2): : 287 - 294
  • [10] Online optimal control of unknown discrete-time nonlinear systems by using time-based adaptive dynamic programming
    Xiao, Geyang
    Zhang, Huaguang
    Luo, Yanhong
    NEUROCOMPUTING, 2015, 165 : 163 - 170