Automatic Landing Control for Fixed-Wing UAV in Longitudinal Channel Based on Deep Reinforcement Learning

被引:2
|
作者
Li, Jinghang [1 ]
Xu, Shuting [1 ]
Wu, Yu [2 ,3 ]
Zhang, Zhe [4 ]
机构
[1] Beijing Forestry Univ, Coll Engn, Beijing 100083, Peoples R China
[2] Chongqing Univ, Coll Aerosp Engn, Chongqing 400044, Peoples R China
[3] Nanyang Technol Univ, Sch Mech & Aerosp Engn, Singapore 639798, Singapore
[4] Beijing Inst Technol, Sch Aerosp Engn, Beijing 100081, Peoples R China
基金
中国国家自然科学基金;
关键词
fixed-wing UAV; automatic landing control; parameter tuning; deep reinforcement learning; Deep Q-learning Network (DQN); FUTURE;
D O I
10.3390/drones8100568
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
The objective is to address the control problem associated with the landing process of unmanned aerial vehicles (UAVs), with a particular focus on fixed-wing UAVs. The Proportional-Integral-Derivative (PID) controller is a widely used control method, which requires the tuning of its parameters to account for the specific characteristics of the landing environment and the potential for external disturbances. In contrast, neural networks can be modeled to operate under given inputs, allowing for a more precise control strategy. In light of these considerations, a control system based on reinforcement learning is put forth, which is integrated with the conventional PID guidance law to facilitate the autonomous landing of fixed-wing UAVs and the automated tuning of PID parameters through the use of a Deep Q-learning Network (DQN). A traditional PID control system is constructed based on a fixed-wing UAV dynamics model, with the flight state being discretized. The landing problem is transformed into a Markov Decision Process (MDP), and the reward function is designed in accordance with the landing conditions and the UAV's attitude, respectively. The state vectors are fed into the neural network framework, and the optimized PID parameters are output by the reinforcement learning algorithm. The optimal policy is obtained through the training of the network, which enables the automatic adjustment of parameters and the optimization of the traditional PID control system. Furthermore, the efficacy of the control algorithms in actual scenarios is validated through the simulation of UAV state vector perturbations and ideal gliding curves. The results demonstrate that the controller modified by the DQN network exhibits a markedly superior convergence effect and maneuverability compared to the unmodified traditional controller.
引用
收藏
页数:24
相关论文
共 50 条
  • [1] Deep Reinforcement Learning Automatic Landing Control of Fixed-Wing Aircraft Using Deep Deterministic Policy Gradient
    Tang, Chi
    Lai, Ying-Chih
    2020 INTERNATIONAL CONFERENCE ON UNMANNED AIRCRAFT SYSTEMS (ICUAS'20), 2020, : 1 - 9
  • [2] Landing Control of Fixed-wing UAV Based on ADRC
    Zhu, Guojun
    Qi, Juntong
    Wu, Chong
    PROCEEDINGS OF THE 38TH CHINESE CONTROL CONFERENCE (CCC), 2019, : 8020 - 8025
  • [3] Coordination control method for fixed-wing UAV formation through deep reinforcement learning
    Xiang X.
    Yan C.
    Wang C.
    Yin D.
    Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica, 2021, 42 (04):
  • [4] Automatic Landing Control Based on GPS for Fixed-Wing Aircraft
    Jantawong, Jirayuth
    Deelertpaiboon, Chirdpong
    2018 15TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING/ELECTRONICS, COMPUTER, TELECOMMUNICATIONS AND INFORMATION TECHNOLOGY (ECTI-CON), 2018, : 317 - 320
  • [5] Control and motion planning of fixed-wing UAV through reinforcement learning
    Giral, Francisco
    Gomez, Ignacio
    Le Clainche, Soledad
    RESULTS IN ENGINEERING, 2024, 23
  • [6] Deep Reinforcement Learning Attitude Control of Fixed-Wing UAVs
    Zhen, Yan
    Hao, Mingrui
    Sun, Wendi
    PROCEEDINGS OF 2020 3RD INTERNATIONAL CONFERENCE ON UNMANNED SYSTEMS (ICUS), 2020, : 239 - 244
  • [7] Design of longitudinal control law for small fixed-wing UAV during auto landing
    Gao J.-Z.
    Jia H.-G.
    Gao, Jiu-Zhou (gaojiuzhou@126.com), 1799, Chinese Academy of Sciences (24): : 1799 - 1806
  • [8] Fixed-Wing Stalled Maneuver Control Technology Based on Deep Reinforcement Learning
    Hu, Weijun
    Gao, Zhiqiang
    Quan, Jiale
    Ma, Xianlong
    Xiong, Jingyi
    Zhang, Weijie
    2022 IEEE THE 5TH INTERNATIONAL CONFERENCE ON BIG DATA AND ARTIFICIAL INTELLIGENCE (BDAI 2022), 2022, : 19 - 25
  • [9] Cooperative formation control of fixed-wing UAVs based on deep reinforcement learning
    Yue, Keyuan
    Yuan, Jianquan
    Hao, Mingrui
    SEVENTH ASIA PACIFIC CONFERENCE ON OPTICS MANUFACTURE (APCOM 2021), 2022, 12166
  • [10] Vision-assisted deep stall landing for a fixed-wing UAV
    Kim, Doyoung
    Park, Sanghyuk
    JOURNAL OF FIELD ROBOTICS, 2022, 39 (07) : 1138 - 1152