Adaptive Fault-Tolerant Tracking Control for Affine Nonlinear Systems With Unknown Dynamics via Reinforcement Learning

被引:18
|
作者
Roshanravan, Sajad [1 ]
Shamaghdari, Saeed [1 ]
机构
[1] Iran Univ Sci & Technol IUST, Elect Engn Dept, Tehran 1311416846, Iran
关键词
Fault detection; fault-tolerant tracking control; reinforcement learning; affine nonlinear systems; process and actuator faults;
D O I
10.1109/TASE.2022.3223702
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper investigates the optimal fault-tolerant tracking control (FTTC) problem for unknown affine nonlinear continuous-time systems with process and actuator faults in the framework of reinforcement learning (RL). The proposed novel active FTTC scheme is based on adaptive optimal control theory. In this way, the FTTC problem is formulated as an optimal regulation problem for the augmented system, which consists of the controlled system and the reference trajectory. To solve the Hamilton-Jacobi-Bellman (HJB) equation of the augmented system, an identifier-critic-based online RL strategy is employed with a dual neural network (NN) approximation structure. Initially, in order to remove the requirement of prior knowledge of the system dynamics, an adaptive NN identifier is designed. The forgetting factor in the proposed identifier update law is variable and a function of the filtered state estimation error and filtered state error. Choosing this variable forgetting factor increases the convergence speed and decreases the estimation error of identifier NN weights compared to the constant one while maintaining its robustness. When a fault occurs, the system continues to operate under the former FTTC until the fault is detected. Meanwhile, the optimal FTTC design in the RL framework requires the initial admissible control condition. In order to make it possible to initiate the FTTC learning process from the former FTTC, we employed a stabilizing term in the critical update rule. The Uniformly Ultimately Boundedness (UUB) of identifier and critic NN weight errors and, as a result, the convergence of the control input to the neighborhood of the optimal solution are all proved by Lyapunov theory. In the proposed method, changes in the values of faults are detected by comparing the HJB error to a predefined threshold. Finally, the simulation results are given to validate the effectiveness of the developed method. Note to Practitioners-long-time operations and the influence of external perturbations often make the faults inevitable for many practical engineering systems which can lead to unpredictable behaviors and catastrophic impacts. In general, the faults are naturally uncertain in time, value, and pattern, that is, it is unknown when, how much, and which system components fail. Therefore, the control system must be able to tolerate an extensive set of component faults. The design of optimal model-free FTTC strategies in an adaptive manner is challenging in nonlinear systems. The proposed method is suitable for a large class of nonlinear systems with input-affine form, and guarantees the system stability in the presence of process and actuator faults.
引用
收藏
页码:569 / 580
页数:12
相关论文
共 50 条
  • [31] Tracking Differentiator-Based Adaptive Fault-Tolerant Control for Stochastic Nonlinear Systems
    Liu, Yanli
    Ma, Hongjun
    IEEE ACCESS, 2020, 8 : 72112 - 72120
  • [32] Adaptive Fuzzy Fault-Tolerant Control for Nonlinear Multi-Agent Systems with Unknown Control Direction
    Wang, Dongyang
    Li, Yongming
    Tong, Shaocheng
    PROCEEDINGS 2018 33RD YOUTH ACADEMIC ANNUAL CONFERENCE OF CHINESE ASSOCIATION OF AUTOMATION (YAC), 2018, : 706 - 711
  • [33] Fuzzy Adaptive Fault-Tolerant Control of Unknown Nonlinear Systems With Time-Varying Structure
    Zhang, Jin-Xi
    Yang, Guang-Hong
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2019, 27 (10) : 1904 - 1916
  • [34] Adaptive fault-tolerant attitude tracking control for spacecraft formation with unknown inertia
    Zhu, Zhihao
    Guo, Yu
    INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 2018, 32 (01) : 13 - 26
  • [35] Prescribed performance adaptive fault-tolerant tracking control for nonlinear time-delay systems with input quantization and unknown control directions
    Wang, Cai-Cheng
    Yang, Guang-Hong
    NEUROCOMPUTING, 2018, 311 : 333 - 343
  • [36] Adaptive Fault-Tolerant Control of a Class of Nonlinear MIMO Systems
    Zhang, Xiaodong
    Polycarpou, Marios M.
    Parisini, Thomas
    47TH IEEE CONFERENCE ON DECISION AND CONTROL, 2008 (CDC 2008), 2008, : 398 - 403
  • [37] Fuzzy fault-tolerant containment control for multi-agent systems with unknown nonlinear dynamics
    Sader, Malika
    Wang, Fuyong
    Liu, Zhongxin
    Chen, Zengqiang
    TRANSACTIONS OF THE INSTITUTE OF MEASUREMENT AND CONTROL, 2021, 43 (16) : 3663 - 3671
  • [38] Neuroadaptive Fault-tolerant PI Control of Nonlinear Systems with Unknown Control Direction
    Zhang, Yanan
    Lai, Junfeng
    Zhang, Zhirong
    Tan, Shilei
    2019 3RD INTERNATIONAL SYMPOSIUM ON AUTONOMOUS SYSTEMS (ISAS 2019), 2019, : 102 - 107
  • [39] Adaptive Reinforcement Learning for Fault-Tolerant Optimal Consensus Control of Nonlinear Canonical Multiagent Systems With Actuator Loss of Effectiveness
    Zhu, Boyan
    Zhang, Liang
    Niu, Ben
    Zhao, Ning
    IEEE SYSTEMS JOURNAL, 2024, 18 (03): : 1681 - 1692
  • [40] Neuro-Adaptive Fault-Tolerant Tracking Control of Lagrange Systems Pursuing Targets With Unknown Trajectory
    Song, Yongduan
    Guo, Junxia
    IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2017, 64 (05) : 3913 - 3920