Adaptive Fault-Tolerant Tracking Control for Affine Nonlinear Systems With Unknown Dynamics via Reinforcement Learning

被引：18

作者：

Roshanravan, Sajad ^{[1
]}

Shamaghdari, Saeed ^{[1
]}

机构：

[1] Iran Univ Sci & Technol IUST, Elect Engn Dept, Tehran 1311416846, Iran

来源：

IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING | 2024年 / 21卷 / 01期

关键词：

Fault detection; fault-tolerant tracking control; reinforcement learning; affine nonlinear systems; process and actuator faults;

D O I：

10.1109/TASE.2022.3223702

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper investigates the optimal fault-tolerant tracking control (FTTC) problem for unknown affine nonlinear continuous-time systems with process and actuator faults in the framework of reinforcement learning (RL). The proposed novel active FTTC scheme is based on adaptive optimal control theory. In this way, the FTTC problem is formulated as an optimal regulation problem for the augmented system, which consists of the controlled system and the reference trajectory. To solve the Hamilton-Jacobi-Bellman (HJB) equation of the augmented system, an identifier-critic-based online RL strategy is employed with a dual neural network (NN) approximation structure. Initially, in order to remove the requirement of prior knowledge of the system dynamics, an adaptive NN identifier is designed. The forgetting factor in the proposed identifier update law is variable and a function of the filtered state estimation error and filtered state error. Choosing this variable forgetting factor increases the convergence speed and decreases the estimation error of identifier NN weights compared to the constant one while maintaining its robustness. When a fault occurs, the system continues to operate under the former FTTC until the fault is detected. Meanwhile, the optimal FTTC design in the RL framework requires the initial admissible control condition. In order to make it possible to initiate the FTTC learning process from the former FTTC, we employed a stabilizing term in the critical update rule. The Uniformly Ultimately Boundedness (UUB) of identifier and critic NN weight errors and, as a result, the convergence of the control input to the neighborhood of the optimal solution are all proved by Lyapunov theory. In the proposed method, changes in the values of faults are detected by comparing the HJB error to a predefined threshold. Finally, the simulation results are given to validate the effectiveness of the developed method. Note to Practitioners-long-time operations and the influence of external perturbations often make the faults inevitable for many practical engineering systems which can lead to unpredictable behaviors and catastrophic impacts. In general, the faults are naturally uncertain in time, value, and pattern, that is, it is unknown when, how much, and which system components fail. Therefore, the control system must be able to tolerate an extensive set of component faults. The design of optimal model-free FTTC strategies in an adaptive manner is challenging in nonlinear systems. The proposed method is suitable for a large class of nonlinear systems with input-affine form, and guarantees the system stability in the presence of process and actuator faults.

引用

页码：569 / 580

页数：12

共 50 条

[31] Tracking Differentiator-Based Adaptive Fault-Tolerant Control for Stochastic Nonlinear Systems
Liu, Yanli
Ma, Hongjun
IEEE ACCESS, 2020, 8 : 72112 - 72120
[32] Adaptive Fuzzy Fault-Tolerant Control for Nonlinear Multi-Agent Systems with Unknown Control Direction
Wang, Dongyang
Li, Yongming
Tong, Shaocheng
PROCEEDINGS 2018 33RD YOUTH ACADEMIC ANNUAL CONFERENCE OF CHINESE ASSOCIATION OF AUTOMATION (YAC), 2018, : 706 - 711
[33] Fuzzy Adaptive Fault-Tolerant Control of Unknown Nonlinear Systems With Time-Varying Structure
Zhang, Jin-Xi
Yang, Guang-Hong
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2019, 27 (10) : 1904 - 1916
[34] Adaptive fault-tolerant attitude tracking control for spacecraft formation with unknown inertia
Zhu, Zhihao
Guo, Yu
INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 2018, 32 (01) : 13 - 26
[35] Prescribed performance adaptive fault-tolerant tracking control for nonlinear time-delay systems with input quantization and unknown control directions
Wang, Cai-Cheng
Yang, Guang-Hong
NEUROCOMPUTING, 2018, 311 : 333 - 343
[36] Adaptive Fault-Tolerant Control of a Class of Nonlinear MIMO Systems
Zhang, Xiaodong
Polycarpou, Marios M.
Parisini, Thomas
47TH IEEE CONFERENCE ON DECISION AND CONTROL, 2008 (CDC 2008), 2008, : 398 - 403
[37] Fuzzy fault-tolerant containment control for multi-agent systems with unknown nonlinear dynamics
Sader, Malika
Wang, Fuyong
Liu, Zhongxin
Chen, Zengqiang
TRANSACTIONS OF THE INSTITUTE OF MEASUREMENT AND CONTROL, 2021, 43 (16) : 3663 - 3671
[38] Neuroadaptive Fault-tolerant PI Control of Nonlinear Systems with Unknown Control Direction
Zhang, Yanan
Lai, Junfeng
Zhang, Zhirong
Tan, Shilei
2019 3RD INTERNATIONAL SYMPOSIUM ON AUTONOMOUS SYSTEMS (ISAS 2019), 2019, : 102 - 107
[39] Adaptive Reinforcement Learning for Fault-Tolerant Optimal Consensus Control of Nonlinear Canonical Multiagent Systems With Actuator Loss of Effectiveness
Zhu, Boyan
Zhang, Liang
Niu, Ben
Zhao, Ning
IEEE SYSTEMS JOURNAL, 2024, 18 (03): : 1681 - 1692
[40] Neuro-Adaptive Fault-Tolerant Tracking Control of Lagrange Systems Pursuing Targets With Unknown Trajectory
Song, Yongduan
Guo, Junxia
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2017, 64 (05) : 3913 - 3920

← 1 2 3 4 5 →