Reinforcement Learning-Based Constrained Optimal Control of Strict-feedback Nonlinear Systems: Application to Autonomous Underwater Vehicles

被引：0

作者：

Farzanegan, Behzad ^{[1
]}

Jagannathan, S. ^{[1
]}

机构：

[1] Missouri Univ Sci & Technol, Dept Elec & Comp Engn, Rolla, MO 65409 USA

来源：

2024 IEEE CONFERENCE ON CONTROL TECHNOLOGY AND APPLICATIONS, CCTA 2024 | 2024年

关键词：

Autonomous vehicles; Lifelong learning; Optimal control; Control barrier function; Reinforcement learning;

D O I：

10.1109/CCTA60707.2024.10666630

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper addresses a constrained neural network (NN)-based optimal tracking scheme for a class of uncertain nonlinear discrete-time systems in strict-feedback form by using a control barrier function (CBF). First, a modified barriertype cost function is introduced for each subsystem, guiding the actual system trajectory toward the safe set or desired trajectory while avoiding unwanted sets. To address the tracking problem, an augmented system is employed to convert the time-varying optimal tracking to a time-invariant optimal regulation. Then, an actor-critic framework is employed with the backstepping technique to obtain both virtual and actual optimal control policies for each subsystem to avoid the noncausality problem. Additionally, a novel online regularizer method is introduced to reduce catastrophic forgetting in multitasking scenarios by maintaining the significance of weight connections in the critic NN without directly computing the Fisher information matrix (FIM). Further, to guarantee safety during online learning, the actor update law incorporates the safety condition through the utilization of the CBF. Simulation results using underwater vehicles are carried out to verify the effectiveness of the proposed approach.

引用

页码：651 / 656

页数：6

共 50 条

[1] Learning-Based Adaptive Optimal Tracking Control of Strict-Feedback Nonlinear Systems
Gao, Weinan
Jiang, Zhong-Ping
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 29 (06) : 2614 - 2624
[2] Reinforcement learning-based adaptive predefined-time optimal tracking control for strict-feedback nonlinear systems
Chen, Yilin
Pan, Yingnan
Lu, Qing
INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 2024, 38 (02) : 492 - 512
[3] Deterministic learning-based neural control for output-constrained strict-feedback nonlinear systems
Yang, Qinchen
Zhang, Fukai
Wang, Cong
ISA TRANSACTIONS, 2023, 138 : 384 - 396
[4] Reinforcement learning-based optimized output feedback control of nonlinear strict-feedback systems with event sampled states
Xin, Chun
Li, Yuan-Xin
INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 2023, 37 (01) : 38 - 58
[5] Reinforcement learning-based optimized backstepping control for strict-feedback nonlinear systems subject to external disturbances
Qin, Yan
Cao, Liang
Lu, Qing
Pan, Yingnan
OPTIMAL CONTROL APPLICATIONS & METHODS, 2023, 44 (05): : 2724 - 2743
[6] Learning-based Fuzzy Control for Strict-feedback Nonlinear Systems with Unknown Uncertainties
Ma, Min
Liu, Xiaokun
Huang, He
Wang, Tong
Qiu, Jianbin
39TH YOUTH ACADEMIC ANNUAL CONFERENCE OF CHINESE ASSOCIATION OF AUTOMATION, YAC 2024, 2024, : 2089 - 2093
[7] Adaptive reinforcement learning optimal tracking control for strict-feedback nonlinear systems with prescribed performance
Huang, Zongsheng
Bai, Weiwei
Li, Tieshan
Long, Yue
Chen, C. L. Philip
Liang, Hongjing
Yang, Hanqing
INFORMATION SCIENCES, 2023, 621 : 407 - 423
[8] Constrained iterative learning control of a class of strict-feedback systems
Chen J.-Y.
Sun M.-X.
Kongzhi Lilun Yu Yingyong/Control Theory and Applications, 2021, 38 (05): : 561 - 570
[9] Finite-time adaptive optimal control of uncertain strict-feedback nonlinear systems based on fuzzy observer and reinforcement learning
Sun, Yue
Chen, Ming
Peng, Kaixiang
Wu, Libing
Liu, Cungen
INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 2024, 55 (08) : 1553 - 1570
[10] Distributed Fuzzy Optimal Consensus Control of State-Constrained Nonlinear Strict-Feedback Systems
Wang, Wei
Li, Yongming
IEEE TRANSACTIONS ON CYBERNETICS, 2023, 53 (05) : 2914 - 2929

← 1 2 3 4 5 →