Adaptive actor-critic neural optimal control for constrained nonstrict feedback nonlinear systems via command filter

被引:4
|
作者
Hua, Yu [1 ,2 ]
Zhang, Tianping [1 ,2 ]
机构
[1] Yangzhou Univ, Coll Informat Engn, Dept Automat, Yangzhou, Peoples R China
[2] Yangzhou Univ, Coll Math Sci, Yangzhou, Peoples R China
基金
中国国家自然科学基金;
关键词
adaptive dynamic programming; command filter; dynamic surface control; mapping rule; optimal control; state-constrained; unmodeled dynamics; DYNAMIC SURFACE CONTROL; CONTINUOUS-TIME SYSTEMS; BACKSTEPPING CONTROL; STATE CONSTRAINTS; OUTPUT-FEEDBACK; CONTROL DESIGN; INPUT;
D O I
10.1002/rnc.6840
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The actor-critic neural optimal control is investigated for the state-constrained nonlinear systems in the nonstrict feedback form with unmodeled dynamics in this paper. The filtering errors in the traditional dynamic surface control (DSC) are countervailed by the introduced compensation signals. Two design phases together determine the input: the feedforward input design and the near optimal input design. In the feedforward input design, a mapping rule is established to keep all the states in the finite range, and a first-order adjunctive signal is designed to treat the unmodeled dynamics. In the near optimal input design, the cost function relying on the reconstructed error system is minimized by the near optimal input via adaptive dynamic programming (ADP). In the whole design processing, the unknown nonlinear uncertain parts are fitted by the radial basis function neural networks (RBFNNs). The stability analysis illustrates all the signals are bounded in the controlled system. Two simulation examples are employed to verify the theoretical findings.
引用
收藏
页码:8588 / 8614
页数:27
相关论文
共 50 条
  • [21] Optimal Elevator Group Control via Deep Asynchronous Actor-Critic Learning
    Wei, Qinglai
    Wang, Lingxiao
    Liu, Yu
    Polycarpou, Marios M.
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 31 (12) : 5245 - 5256
  • [22] Adaptive neural network asymptotic tracking control for nonstrict feedback stochastic nonlinear systems
    Liu, Yongchao
    Zhu, Qidan
    NEURAL NETWORKS, 2021, 143 : 283 - 290
  • [23] Adaptive Neural Control of MIMO Nonstrict-Feedback Nonlinear Systems With Time Delay
    Zhao, Xudong
    Yang, Haijiao
    Karimi, Hamid Reza
    Zhu, Yanzheng
    IEEE TRANSACTIONS ON CYBERNETICS, 2016, 46 (06) : 1337 - 1349
  • [24] Adaptive output-feedback fault-tolerant control for space manipulator via actor-critic learning
    Yin, Yuwan
    Ning, Xin
    Xia, Dongdong
    ADVANCES IN SPACE RESEARCH, 2025, 75 (04) : 3914 - 3932
  • [25] Adaptive Neural Tracking Control of Nonlinear Nonstrict-Feedback Systems With Unmodeled Dynamics
    Zhao, Yuzhuo
    Niu, Ben
    Wang, Huanqing
    Yang, Dong
    IEEE ACCESS, 2019, 7 : 90206 - 90214
  • [26] Hierarchical Sliding-Mode Surface-Based Adaptive Actor-Critic Optimal Control for Switched Nonlinear Systems With Unknown Perturbation
    Zhang, Haoyan
    Zhao, Xudong
    Wang, Huanqing
    Zong, Guangdeng
    Xu, Ning
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 35 (02) : 1559 - 1571
  • [27] Adaptive Traffic Signal Control Based on Graph Neural Networks and Dynamic Entropy-Constrained Soft Actor-Critic
    Jia, Xianguang
    Guo, Mengyi
    Lyu, Yingying
    Qu, Jie
    Li, Dong
    Guo, Fengxiang
    ELECTRONICS, 2024, 13 (23):
  • [28] Disturbance observer based actor-critic learning control for uncertain nonlinear systems
    Liang, Xianglong
    Yao, Zhikai
    Ge, Yaowen
    Yao, Jianyong
    CHINESE JOURNAL OF AERONAUTICS, 2023, 36 (11) : 271 - 280
  • [29] Sliding-mode surface-based adaptive actor-critic optimal control for switched nonlinear systems with average dwell time
    Zhang, Haoyan
    Wang, Huanqing
    Niu, Ben
    Zhang, Liang
    Ahmad, Adil M.
    INFORMATION SCIENCES, 2021, 580 : 756 - 774
  • [30] Adaptive output-feedback neural tracking control for a class of nonstrict-feedback nonlinear systems
    Yang, Haijiao
    Shi, Peng
    Zhao, Xudong
    Shi, Yan
    INFORMATION SCIENCES, 2016, 334 : 205 - 218