Composite Observer-Based Optimal Attitude-Tracking Control With Reinforcement Learning for Hypersonic Vehicles

被引:30
|
作者
Zhao, Shangwei [1 ,2 ]
Wang, Jingcheng [1 ,2 ]
Xu, Haotian [3 ]
Wang, Bohui [4 ,5 ]
机构
[1] Shanghai Jiao Tong Univ, Dept Automat, Minist Educ China, Shanghai 200240, Peoples R China
[2] Shanghai Jiao Tong Univ, Key Lab Syst Control & Informat Proc, Minist Educ China, Shanghai 200240, Peoples R China
[3] Shandong Univ, Sch Control Sci & Engn, Jinan 250061, Peoples R China
[4] Xi An Jiao Tong Univ, Sch Cyber Sci & Engn, Xian 710049, Peoples R China
[5] Xidian Univ, Sch Aerosp Sci & Technol, Xian 710071, Peoples R China
基金
中国国家自然科学基金;
关键词
Hypersonic vehicles; Nonlinear dynamical systems; Optimal control; Observers; Attitude control; Aerodynamics; Vehicle dynamics; Attitude-tracking control; near-optimal control; observer design; reinforcement learning (RL); ROBUST OPTIMAL-CONTROL; NONLINEAR-SYSTEMS; EXPERIENCE REPLAY; NEURAL-NETWORK; DESIGN;
D O I
10.1109/TCYB.2022.3192871
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This article proposes an observer-based reinforcement learning (RL) control approach to address the optimal attitude-tracking problem and application for hypersonic vehicles in the reentry phase. Due to the unknown uncertainty and nonlinearity caused by parameter perturbation and external disturbance, accurate model information of hypersonic vehicles in the reentry phase is generally unavailable. For this reason, a novel synchronous estimation is proposed to construct a composite observer for hypersonic vehicles, which consists of a neural-network (NN)-based Luenberger-type observer and a synchronous disturbance observer. This solves the identification problem of nonlinear dynamics in the reference control and realizes the estimation of the system state when unknown nonlinear dynamics and unknown disturbance exist at the same time. By synthesizing the information from the composite observer, an RL tracking controller is developed to solve the optimal attitude-tracking control problem. To improve the convergence performance of critic network weights, concurrent learning is employed to replace the traditional persistent excitation condition with a historical experience replay manner. In addition, this article proves that the weight estimation error is bounded when the learning rate satisfies the given sufficient condition. Finally, the numerical simulation demonstrates the effectiveness and superiority of the proposed approaches to attitude-tracking control systems for hypersonic vehicles.
引用
收藏
页码:913 / 926
页数:14
相关论文
共 50 条
  • [41] High-Order Disturbance Observer-Based Attitude Control with Prescribed Performance for Hypersonic Vehicle
    Yang, GuangHui
    Wang, Xinming
    2021 PROCEEDINGS OF THE 40TH CHINESE CONTROL CONFERENCE (CCC), 2021, : 7766 - 7771
  • [42] Composite Adaptive Attitude-Tracking Control With Parameter Convergence Under Finite Excitation
    Dong, Hongyang
    Hu, Qinglei
    Akella, Maruthi R.
    Yang, Haoyang
    IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2020, 28 (06) : 2657 - 2664
  • [43] Observer-based region tracking control for underwater vehicles without velocity measurement
    Liu, Xing
    Zhang, Mingjun
    Yao, Feng
    Chu, Zhenzhong
    NONLINEAR DYNAMICS, 2022, 108 (04) : 3543 - 3560
  • [44] Fuzzy Observer-Based Transitional Path-Tracking Control for Autonomous Vehicles
    Hu, Chuan
    Chen, Yimin
    Wang, Junmin
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2021, 22 (05) : 3078 - 3088
  • [45] Observer-based region tracking control for underwater vehicles without velocity measurement
    Xing Liu
    Mingjun Zhang
    Feng Yao
    Zhenzhong Chu
    Nonlinear Dynamics, 2022, 108 : 3543 - 3560
  • [46] Observer-Based Reinforcement Learning Control for Electric Servo Mechanisms With Disturbance
    Harbin Institute Of Technology, Control And Simulation Center, Harbin, China
    不详
    Proc. Chin. Control Decis. Conf., CCDC, (3607-3612):
  • [47] Observer-Based Deep Reinforcement Learning for Robust Missile Guidance and Control
    Wang, Wenwen
    Chen, Zhihua
    IEEE ACCESS, 2025, 13 : 32769 - 32780
  • [48] OBSERVER-BASED ATTITUDE CONTROL WITH MEASUREMENT UNCERTAINTIES
    Gui, Haichao
    Dang, Qingqing
    ASTRODYNAMICS 2018, PTS I-IV, 2019, 167 : 2715 - 2734
  • [49] Disturbance Observer-based Attitude Control for a Quadrotor
    Zhao, Yongsheng
    Cao, Yabo
    Fan, Yunsheng
    2017 4TH INTERNATIONAL CONFERENCE ON INFORMATION, CYBERNETICS AND COMPUTATIONAL SOCIAL SYSTEMS (ICCSS), 2017, : 355 - 360
  • [50] Online Observer-Based Inverse Reinforcement Learning
    Self, Ryan
    Coleman, Kevin
    Bai, He
    Kamalapurkar, Rushikesh
    2021 AMERICAN CONTROL CONFERENCE (ACC), 2021, : 1959 - 1964