Automatic Landing Control for Fixed-Wing UAV in Longitudinal Channel Based on Deep Reinforcement Learning

被引:2
|
作者
Li, Jinghang [1 ]
Xu, Shuting [1 ]
Wu, Yu [2 ,3 ]
Zhang, Zhe [4 ]
机构
[1] Beijing Forestry Univ, Coll Engn, Beijing 100083, Peoples R China
[2] Chongqing Univ, Coll Aerosp Engn, Chongqing 400044, Peoples R China
[3] Nanyang Technol Univ, Sch Mech & Aerosp Engn, Singapore 639798, Singapore
[4] Beijing Inst Technol, Sch Aerosp Engn, Beijing 100081, Peoples R China
基金
中国国家自然科学基金;
关键词
fixed-wing UAV; automatic landing control; parameter tuning; deep reinforcement learning; Deep Q-learning Network (DQN); FUTURE;
D O I
10.3390/drones8100568
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
The objective is to address the control problem associated with the landing process of unmanned aerial vehicles (UAVs), with a particular focus on fixed-wing UAVs. The Proportional-Integral-Derivative (PID) controller is a widely used control method, which requires the tuning of its parameters to account for the specific characteristics of the landing environment and the potential for external disturbances. In contrast, neural networks can be modeled to operate under given inputs, allowing for a more precise control strategy. In light of these considerations, a control system based on reinforcement learning is put forth, which is integrated with the conventional PID guidance law to facilitate the autonomous landing of fixed-wing UAVs and the automated tuning of PID parameters through the use of a Deep Q-learning Network (DQN). A traditional PID control system is constructed based on a fixed-wing UAV dynamics model, with the flight state being discretized. The landing problem is transformed into a Markov Decision Process (MDP), and the reward function is designed in accordance with the landing conditions and the UAV's attitude, respectively. The state vectors are fed into the neural network framework, and the optimized PID parameters are output by the reinforcement learning algorithm. The optimal policy is obtained through the training of the network, which enables the automatic adjustment of parameters and the optimization of the traditional PID control system. Furthermore, the efficacy of the control algorithms in actual scenarios is validated through the simulation of UAV state vector perturbations and ideal gliding curves. The results demonstrate that the controller modified by the DQN network exhibits a markedly superior convergence effect and maneuverability compared to the unmodified traditional controller.
引用
收藏
页数:24
相关论文
共 50 条
  • [41] Longitudinal Control Law Design for Fixed-wing UAV Based on Multi-model Technique
    Kong, Desheng
    Geng, Qingbo
    Hu, Qiong
    Shao, Jianbo
    FIFTH INTERNATIONAL CONFERENCE ON INTELLIGENT CONTROL AND INFORMATION PROCESSING (ICICIP), 2014, : 48 - 52
  • [42] Deep Reinforcement Learning based running-track path design for fixed-wing UAV assisted mobile relaying network
    Wang, Tao
    Ji, Xiaodong
    Zhu, Xuan
    He, Cheng
    Gu, Jian-Feng
    VEHICULAR COMMUNICATIONS, 2024, 50
  • [43] Automatic Tuning and Turbulence Mitigation for Fixed-Wing UAV with Segmented Control Surfaces
    Sattar, Abdul
    Wang, Liuping
    Hoshu, Ayaz Ahmed
    Ansari, Shahzeb
    Karar, Haider-E
    Mohamed, Abdulghani
    DRONES, 2022, 6 (10)
  • [44] Gain Scheduled Attitude Control of Fixed-Wing UAV With Automatic Controller Tuning
    Poksawat, Pakorn
    Wang, Liuping
    Mohamed, Abdulghani
    IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2018, 26 (04) : 1192 - 1203
  • [45] Deep Reinforcement Learning Attitude Control of Fixed-Wing UAVs Using Proximal Policy Optimization
    Bohn, Eivind
    Coates, Erlend M.
    Moe, Signe
    Johansen, Tor Arne
    2019 INTERNATIONAL CONFERENCE ON UNMANNED AIRCRAFT SYSTEMS (ICUAS' 19), 2019, : 523 - 533
  • [46] Leader-Follower Formation Control for Fixed-Wing UAVs using Deep Reinforcement Learning
    Shi, Yu
    Song, Jianshuang
    Hua, Yongzhao
    Yu, Jianglong
    Dong, Xiwang
    Ren, Zhang
    2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 3456 - 3461
  • [47] Performance of Variable Pitch Propeller for Longitudinal Control in an Agile Fixed-Wing UAV
    Kumar, Sajith K. K.
    Arya, Hemendra
    Joshi, Ashok
    2019 IEEE AEROSPACE CONFERENCE, 2019,
  • [48] Multi-model Control Design for Longitudinal Dynamics of Fixed-wing UAV
    Shao, Jianbo
    Fei, Qing
    Hu, Qiong
    Geng, Qingbo
    2014 33RD CHINESE CONTROL CONFERENCE (CCC), 2014, : 8826 - 8830
  • [49] Low level control architecture for automatic takeoff and landing of fixed wing UAV
    Manjarrez, Hector
    Davila, Jorge
    Lozano, Rogelio
    2018 ANNUAL AMERICAN CONTROL CONFERENCE (ACC), 2018, : 6737 - 6742
  • [50] Automatic Landing for Fixed-wing UAV Using Stereo Vision with A Single Camera and An Orientation Sensor: A Concept
    Sereewattana, Montika
    Ruchanurucks, Miti
    Rakprayoon, Panjawee
    Siddhichai, Supakorn
    Hasegawa, Shoichi
    2015 IEEE/ASME INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT MECHATRONICS (AIM), 2015, : 29 - 34