Composite Observer-Based Optimal Attitude-Tracking Control With Reinforcement Learning for Hypersonic Vehicles

被引:30
|
作者
Zhao, Shangwei [1 ,2 ]
Wang, Jingcheng [1 ,2 ]
Xu, Haotian [3 ]
Wang, Bohui [4 ,5 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Automat, Minist Educ China, Shanghai 200240, Peoples R China
[2] Shanghai Jiao Tong Univ, Key Lab Syst Control & Informat Proc, Minist Educ China, Shanghai 200240, Peoples R China
[3] Shandong Univ, Sch Control Sci & Engn, Jinan 250061, Peoples R China
[4] Xi An Jiao Tong Univ, Sch Cyber Sci & Engn, Xian 710049, Peoples R China
[5] Xidian Univ, Sch Aerosp Sci & Technol, Xian 710071, Peoples R China
基金
中国国家自然科学基金;
关键词
Hypersonic vehicles; Nonlinear dynamical systems; Optimal control; Observers; Attitude control; Aerodynamics; Vehicle dynamics; Attitude-tracking control; near-optimal control; observer design; reinforcement learning (RL); ROBUST OPTIMAL-CONTROL; NONLINEAR-SYSTEMS; EXPERIENCE REPLAY; NEURAL-NETWORK; DESIGN;
D O I
10.1109/TCYB.2022.3192871
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This article proposes an observer-based reinforcement learning (RL) control approach to address the optimal attitude-tracking problem and application for hypersonic vehicles in the reentry phase. Due to the unknown uncertainty and nonlinearity caused by parameter perturbation and external disturbance, accurate model information of hypersonic vehicles in the reentry phase is generally unavailable. For this reason, a novel synchronous estimation is proposed to construct a composite observer for hypersonic vehicles, which consists of a neural-network (NN)-based Luenberger-type observer and a synchronous disturbance observer. This solves the identification problem of nonlinear dynamics in the reference control and realizes the estimation of the system state when unknown nonlinear dynamics and unknown disturbance exist at the same time. By synthesizing the information from the composite observer, an RL tracking controller is developed to solve the optimal attitude-tracking control problem. To improve the convergence performance of critic network weights, concurrent learning is employed to replace the traditional persistent excitation condition with a historical experience replay manner. In addition, this article proves that the weight estimation error is bounded when the learning rate satisfies the given sufficient condition. Finally, the numerical simulation demonstrates the effectiveness and superiority of the proposed approaches to attitude-tracking control systems for hypersonic vehicles.
引用
收藏
页码:913 / 926
页数:14
相关论文
共 50 条
  • [21] Finite-Time Extended State Observer-Based Attitude Control for Hypersonic Vehicles with Angle-of-Attack Constraint
    Lu, Qingli
    Sun, Ruisheng
    Lu, Yu
    Liu, Xuanting
    MATHEMATICS, 2024, 12 (07)
  • [22] Predictive Sliding Mode Control for Attitude Tracking of Hypersonic Vehicles Using Fuzzy Disturbance Observer
    Cheng, Xianlei
    Tang, Guojian
    Wang, Peng
    Liu, Luhua
    MATHEMATICAL PROBLEMS IN ENGINEERING, 2015, 2015
  • [23] Disturbance observer-based quadrotor attitude tracking control for aggressive maneuvers
    Castillo, Alberto
    Sanz, Ricardo
    Garcia, Pedro
    Qiu, Wei
    Wang, Hongda
    Xu, Chao
    CONTROL ENGINEERING PRACTICE, 2019, 82 : 14 - 23
  • [24] Observer-based optimal tracking control for linear systems with control delay
    College of Information Science and Engineering, Ocean University of China, Qingdao 266071, China
    Dianji yu Kongzhi Xuebao, 2007, 3 (271-274+281):
  • [25] Observer-based Optimal Control for Dual Motor Tracking and Synchronization
    Ding, Jiacheng
    Wang, Shubo
    Chu, Kongqing
    2024 14TH ASIAN CONTROL CONFERENCE, ASCC 2024, 2024, : 1796 - 1801
  • [26] Reentry attitude tracking control for hypersonic vehicles with RCS
    Cheng, Xianlei
    Wang, Peng
    Tang, Guojian
    PROCEEDINGS OF THE 36TH CHINESE CONTROL CONFERENCE (CCC 2017), 2017, : 11447 - 11452
  • [27] Safe Reinforcement Learning-Based Robust Approximate Optimal Control for Hypersonic Flight Vehicles
    Shi, Lei
    Wang, Xuesong
    Cheng, Yuhu
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 72 (09) : 11401 - 11414
  • [28] Observer-Based Optimal Backstepping Security Control for Nonlinear Systems Using Reinforcement Learning Strategy
    Wei, Qinglai
    Chen, Wendi
    Tan, Xiangmin
    Xiao, Jun
    Dong, Qi
    IEEE TRANSACTIONS ON CYBERNETICS, 2024, 54 (11) : 7011 - 7023
  • [29] Disturbance observer-based fixed-time control for hypersonic morphing vehicles with uncertainties
    Zhang, H.
    Wang, P.
    Tang, G.
    Bao, W.
    AERONAUTICAL JOURNAL, 2024, 128 (1326): : 1844 - 1874
  • [30] Disturbance observer-based fixed-time control for hypersonic morphing vehicles with uncertainties
    Zhang, H.
    Wang, P.
    Tang, G.
    Bao, W.
    Aeronautical Journal, 2024,