Automatic Landing Control for Fixed-Wing UAV in Longitudinal Channel Based on Deep Reinforcement Learning

被引：2

作者：

Li, Jinghang ^{[1
]}

Xu, Shuting ^{[1
]}

Wu, Yu ^{[2
,3
]}

Zhang, Zhe ^{[4
]}

机构：

[1] Beijing Forestry Univ, Coll Engn, Beijing 100083, Peoples R China

[2] Chongqing Univ, Coll Aerosp Engn, Chongqing 400044, Peoples R China

[3] Nanyang Technol Univ, Sch Mech & Aerosp Engn, Singapore 639798, Singapore

[4] Beijing Inst Technol, Sch Aerosp Engn, Beijing 100081, Peoples R China

来源：

DRONES | 2024年 / 8卷 / 10期

基金：

中国国家自然科学基金;

关键词：

fixed-wing UAV; automatic landing control; parameter tuning; deep reinforcement learning; Deep Q-learning Network (DQN); FUTURE;

D O I：

10.3390/drones8100568

中图分类号：

TP7 [遥感技术];

学科分类号：

081102 ; 0816 ; 081602 ; 083002 ; 1404 ;

摘要：

The objective is to address the control problem associated with the landing process of unmanned aerial vehicles (UAVs), with a particular focus on fixed-wing UAVs. The Proportional-Integral-Derivative (PID) controller is a widely used control method, which requires the tuning of its parameters to account for the specific characteristics of the landing environment and the potential for external disturbances. In contrast, neural networks can be modeled to operate under given inputs, allowing for a more precise control strategy. In light of these considerations, a control system based on reinforcement learning is put forth, which is integrated with the conventional PID guidance law to facilitate the autonomous landing of fixed-wing UAVs and the automated tuning of PID parameters through the use of a Deep Q-learning Network (DQN). A traditional PID control system is constructed based on a fixed-wing UAV dynamics model, with the flight state being discretized. The landing problem is transformed into a Markov Decision Process (MDP), and the reward function is designed in accordance with the landing conditions and the UAV's attitude, respectively. The state vectors are fed into the neural network framework, and the optimized PID parameters are output by the reinforcement learning algorithm. The optimal policy is obtained through the training of the network, which enables the automatic adjustment of parameters and the optimization of the traditional PID control system. Furthermore, the efficacy of the control algorithms in actual scenarios is validated through the simulation of UAV state vector perturbations and ideal gliding curves. The results demonstrate that the controller modified by the DQN network exhibits a markedly superior convergence effect and maneuverability compared to the unmodified traditional controller.

引用

页数：24

共 50 条

[41] Longitudinal Control Law Design for Fixed-wing UAV Based on Multi-model Technique
Kong, Desheng
Geng, Qingbo
Hu, Qiong
Shao, Jianbo
FIFTH INTERNATIONAL CONFERENCE ON INTELLIGENT CONTROL AND INFORMATION PROCESSING (ICICIP), 2014, : 48 - 52
[42] Deep Reinforcement Learning based running-track path design for fixed-wing UAV assisted mobile relaying network
Wang, Tao
Ji, Xiaodong
Zhu, Xuan
He, Cheng
Gu, Jian-Feng
VEHICULAR COMMUNICATIONS, 2024, 50
[43] Automatic Tuning and Turbulence Mitigation for Fixed-Wing UAV with Segmented Control Surfaces
Sattar, Abdul
Wang, Liuping
Hoshu, Ayaz Ahmed
Ansari, Shahzeb
Karar, Haider-E
Mohamed, Abdulghani
DRONES, 2022, 6 (10)
[44] Gain Scheduled Attitude Control of Fixed-Wing UAV With Automatic Controller Tuning
Poksawat, Pakorn
Wang, Liuping
Mohamed, Abdulghani
IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2018, 26 (04) : 1192 - 1203
[45] Deep Reinforcement Learning Attitude Control of Fixed-Wing UAVs Using Proximal Policy Optimization
Bohn, Eivind
Coates, Erlend M.
Moe, Signe
Johansen, Tor Arne
2019 INTERNATIONAL CONFERENCE ON UNMANNED AIRCRAFT SYSTEMS (ICUAS' 19), 2019, : 523 - 533
[46] Leader-Follower Formation Control for Fixed-Wing UAVs using Deep Reinforcement Learning
Shi, Yu
Song, Jianshuang
Hua, Yongzhao
Yu, Jianglong
Dong, Xiwang
Ren, Zhang
2022 41ST CHINESE CONTROL CONFERENCE (CCC), 2022, : 3456 - 3461
[47] Performance of Variable Pitch Propeller for Longitudinal Control in an Agile Fixed-Wing UAV
Kumar, Sajith K. K.
Arya, Hemendra
Joshi, Ashok
2019 IEEE AEROSPACE CONFERENCE, 2019,
[48] Multi-model Control Design for Longitudinal Dynamics of Fixed-wing UAV
Shao, Jianbo
Fei, Qing
Hu, Qiong
Geng, Qingbo
2014 33RD CHINESE CONTROL CONFERENCE (CCC), 2014, : 8826 - 8830
[49] Low level control architecture for automatic takeoff and landing of fixed wing UAV
Manjarrez, Hector
Davila, Jorge
Lozano, Rogelio
2018 ANNUAL AMERICAN CONTROL CONFERENCE (ACC), 2018, : 6737 - 6742
[50] Automatic Landing for Fixed-wing UAV Using Stereo Vision with A Single Camera and An Orientation Sensor: A Concept
Sereewattana, Montika
Ruchanurucks, Miti
Rakprayoon, Panjawee
Siddhichai, Supakorn
Hasegawa, Shoichi
2015 IEEE/ASME INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT MECHATRONICS (AIM), 2015, : 29 - 34

← 1 2 3 4 5 →