Event-triggered reinforcement learning control for the quadrotor UAV with actuator saturation

被引:42
|
作者
Lin, Xiaobo [1 ]
Liu, Jian [2 ]
Yu, Yao [1 ]
Sun, Changyin [2 ]
机构
[1] Univ Sci & Technol Beijing, Sch Automat & Elect Engn, Key Lab Knowledge Automat Ind Proc, Minist Educ, Beijing 100083, Peoples R China
[2] Southeast Univ, Sch Automat, Nanjing 210096, Peoples R China
基金
中国国家自然科学基金;
关键词
Quadrotor; UAV; Reinforcement learning; Flight control; Event-triggered control; Actuator saturation; TRACKING CONTROL;
D O I
10.1016/j.neucom.2020.07.042
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes an event-triggered reinforcement learning (RL) control strategy to stabilize the quadrotor unmanned aerial vehicle (UAV) with actuator saturation. As the quadrotor UAV equips with a complex dynamic is difficult to be model accurately, a model free reinforcement learning scheme is designed. Due to the practical limitation of actuators, the end of controller is constrained with a bounded function. In order to reduce the calculation consumption for the onboard computer, an event-triggered mechanism is developed, which only update the controller when the triggered condition is satisfied. The proposed controller is implemented with two neural networks which are called critic and actor. Some advanced RL technologies are utilized for speeding up the train process, e.g. off-policy training, experience replay, etc. The stability of closed-loop system is proved by the Lyapunov analysis. The simulation results including a stability task and a tracking task verify the theoretical analysis, in which we find the updating frequency of controller is decreased greatly. (C) 2020 Elsevier B.V. All rights reserved.
引用
收藏
页码:135 / 145
页数:11
相关论文
共 50 条
  • [21] Dynamic event-triggered control for linear systems subject to asymmetric actuator saturation
    Li, Hongchao
    Zhang, Nan
    Deng, Huimin
    TRANSACTIONS OF THE INSTITUTE OF MEASUREMENT AND CONTROL, 2022, 44 (07) : 1440 - 1448
  • [22] Online Optimal Event-triggered Tracking Control with Actuator Saturation via ADP
    Liu, Lu
    Song, Ruizhuo
    Ren, Yi
    Zhao, Yong
    Song, Biao
    PROCEEDINGS OF 2020 IEEE 9TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE (DDCLS'20), 2020, : 123 - 127
  • [23] Event-triggered control for semi-global stabilisation of systems with actuator saturation
    Zhang, Liangyin
    Chen, Michael Z. Q.
    INTERNATIONAL JOURNAL OF CONTROL, 2016, 89 (05) : 1047 - 1064
  • [24] Event-triggered model predictive control of positive systems with random actuator saturation
    Junfeng Zhang
    Suhuan Zhang
    Peng Lin
    Nonlinear Dynamics, 2021, 105 : 417 - 437
  • [25] Compound learning based event-triggered adaptive attitude control for underwater gliders with actuator saturation and faults
    Gao, Jian
    Min, Boxu
    Chen, Yimin
    Jing, Anyan
    Wang, Jiarun
    Pan, Guang
    OCEAN ENGINEERING, 2023, 280
  • [26] Event-triggered model predictive control of positive systems with random actuator saturation
    Zhang, Junfeng
    Zhang, Suhuan
    Lin, Peng
    NONLINEAR DYNAMICS, 2021, 105 (01) : 417 - 437
  • [27] Output-Based Event-Triggered Control for Linear Systems with Actuator Saturation
    Quan, Zhenzhen
    Ma, Dan
    PROCEEDINGS OF THE 28TH CHINESE CONTROL AND DECISION CONFERENCE (2016 CCDC), 2016, : 1500 - 1505
  • [28] Event-triggered finite-time adaptive neural network control for quadrotor UAV with input saturation and tracking error constraints
    Wang, Changhui
    Li, Wencheng
    Liang, Mei
    AEROSPACE SCIENCE AND TECHNOLOGY, 2024, 155
  • [29] Event-triggered Suboptimal Control Based Adaptive Reinforcement Learning
    Ma, Tengcong
    Song, Ruizhuo
    2021 7TH INTERNATIONAL CONFERENCE ON ROBOTICS AND ARTIFICIAL INTELLIGENCE, ICRAI 2021, 2021, : 64 - 69
  • [30] Event-triggered nonlinear control for attitude stabilization of a quadrotor
    Guerrero-Castellanos, J. F.
    Tellez-Guzman, J. J.
    Durand, S.
    Marchand, N.
    Alvarez-Munoz, J. U.
    2013 INTERNATIONAL CONFERENCE ON UNMANNED AIRCRAFT SYSTEMS (ICUAS), 2013, : 584 - 591