Reinforcement Learning-Based Constrained Optimal Control of Strict-feedback Nonlinear Systems: Application to Autonomous Underwater Vehicles

被引:0
|
作者
Farzanegan, Behzad [1 ]
Jagannathan, S. [1 ]
机构
[1] Missouri Univ Sci & Technol, Dept Elec & Comp Engn, Rolla, MO 65409 USA
关键词
Autonomous vehicles; Lifelong learning; Optimal control; Control barrier function; Reinforcement learning;
D O I
10.1109/CCTA60707.2024.10666630
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper addresses a constrained neural network (NN)-based optimal tracking scheme for a class of uncertain nonlinear discrete-time systems in strict-feedback form by using a control barrier function (CBF). First, a modified barriertype cost function is introduced for each subsystem, guiding the actual system trajectory toward the safe set or desired trajectory while avoiding unwanted sets. To address the tracking problem, an augmented system is employed to convert the time-varying optimal tracking to a time-invariant optimal regulation. Then, an actor-critic framework is employed with the backstepping technique to obtain both virtual and actual optimal control policies for each subsystem to avoid the noncausality problem. Additionally, a novel online regularizer method is introduced to reduce catastrophic forgetting in multitasking scenarios by maintaining the significance of weight connections in the critic NN without directly computing the Fisher information matrix (FIM). Further, to guarantee safety during online learning, the actor update law incorporates the safety condition through the utilization of the CBF. Simulation results using underwater vehicles are carried out to verify the effectiveness of the proposed approach.
引用
收藏
页码:651 / 656
页数:6
相关论文
共 50 条
  • [1] Learning-Based Adaptive Optimal Tracking Control of Strict-Feedback Nonlinear Systems
    Gao, Weinan
    Jiang, Zhong-Ping
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (06) : 2614 - 2624
  • [2] Reinforcement learning-based adaptive predefined-time optimal tracking control for strict-feedback nonlinear systems
    Chen, Yilin
    Pan, Yingnan
    Lu, Qing
    INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 2024, 38 (02) : 492 - 512
  • [3] Deterministic learning-based neural control for output-constrained strict-feedback nonlinear systems
    Yang, Qinchen
    Zhang, Fukai
    Wang, Cong
    ISA TRANSACTIONS, 2023, 138 : 384 - 396
  • [4] Reinforcement learning-based optimized output feedback control of nonlinear strict-feedback systems with event sampled states
    Xin, Chun
    Li, Yuan-Xin
    INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 2023, 37 (01) : 38 - 58
  • [5] Reinforcement learning-based optimized backstepping control for strict-feedback nonlinear systems subject to external disturbances
    Qin, Yan
    Cao, Liang
    Lu, Qing
    Pan, Yingnan
    OPTIMAL CONTROL APPLICATIONS & METHODS, 2023, 44 (05): : 2724 - 2743
  • [6] Learning-based Fuzzy Control for Strict-feedback Nonlinear Systems with Unknown Uncertainties
    Ma, Min
    Liu, Xiaokun
    Huang, He
    Wang, Tong
    Qiu, Jianbin
    39TH YOUTH ACADEMIC ANNUAL CONFERENCE OF CHINESE ASSOCIATION OF AUTOMATION, YAC 2024, 2024, : 2089 - 2093
  • [7] Adaptive reinforcement learning optimal tracking control for strict-feedback nonlinear systems with prescribed performance
    Huang, Zongsheng
    Bai, Weiwei
    Li, Tieshan
    Long, Yue
    Chen, C. L. Philip
    Liang, Hongjing
    Yang, Hanqing
    INFORMATION SCIENCES, 2023, 621 : 407 - 423
  • [8] Constrained iterative learning control of a class of strict-feedback systems
    Chen J.-Y.
    Sun M.-X.
    Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2021, 38 (05): : 561 - 570
  • [9] Finite-time adaptive optimal control of uncertain strict-feedback nonlinear systems based on fuzzy observer and reinforcement learning
    Sun, Yue
    Chen, Ming
    Peng, Kaixiang
    Wu, Libing
    Liu, Cungen
    INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2024, 55 (08) : 1553 - 1570
  • [10] Distributed Fuzzy Optimal Consensus Control of State-Constrained Nonlinear Strict-Feedback Systems
    Wang, Wei
    Li, Yongming
    IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (05) : 2914 - 2929