Automatic Landing Control for Fixed-Wing UAV in Longitudinal Channel Based on Deep Reinforcement Learning

被引：2

作者：

Li, Jinghang ^{[1
]}

Xu, Shuting ^{[1
]}

Wu, Yu ^{[2
,3
]}

Zhang, Zhe ^{[4
]}

机构：

[1] Beijing Forestry Univ, Coll Engn, Beijing 100083, Peoples R China

[2] Chongqing Univ, Coll Aerosp Engn, Chongqing 400044, Peoples R China

[3] Nanyang Technol Univ, Sch Mech & Aerosp Engn, Singapore 639798, Singapore

[4] Beijing Inst Technol, Sch Aerosp Engn, Beijing 100081, Peoples R China

来源：

DRONES | 2024年 / 8卷 / 10期

基金：

中国国家自然科学基金;

关键词：

fixed-wing UAV; automatic landing control; parameter tuning; deep reinforcement learning; Deep Q-learning Network (DQN); FUTURE;

D O I：

10.3390/drones8100568

中图分类号：

TP7 [遥感技术];

学科分类号：

081102 ; 0816 ; 081602 ; 083002 ; 1404 ;

摘要：

The objective is to address the control problem associated with the landing process of unmanned aerial vehicles (UAVs), with a particular focus on fixed-wing UAVs. The Proportional-Integral-Derivative (PID) controller is a widely used control method, which requires the tuning of its parameters to account for the specific characteristics of the landing environment and the potential for external disturbances. In contrast, neural networks can be modeled to operate under given inputs, allowing for a more precise control strategy. In light of these considerations, a control system based on reinforcement learning is put forth, which is integrated with the conventional PID guidance law to facilitate the autonomous landing of fixed-wing UAVs and the automated tuning of PID parameters through the use of a Deep Q-learning Network (DQN). A traditional PID control system is constructed based on a fixed-wing UAV dynamics model, with the flight state being discretized. The landing problem is transformed into a Markov Decision Process (MDP), and the reward function is designed in accordance with the landing conditions and the UAV's attitude, respectively. The state vectors are fed into the neural network framework, and the optimized PID parameters are output by the reinforcement learning algorithm. The optimal policy is obtained through the training of the network, which enables the automatic adjustment of parameters and the optimization of the traditional PID control system. Furthermore, the efficacy of the control algorithms in actual scenarios is validated through the simulation of UAV state vector perturbations and ideal gliding curves. The results demonstrate that the controller modified by the DQN network exhibits a markedly superior convergence effect and maneuverability compared to the unmodified traditional controller.

引用

页数：24

共 50 条

[1] Deep Reinforcement Learning Automatic Landing Control of Fixed-Wing Aircraft Using Deep Deterministic Policy Gradient
Tang, Chi
Lai, Ying-Chih
2020 INTERNATIONAL CONFERENCE ON UNMANNED AIRCRAFT SYSTEMS (ICUAS'20), 2020, : 1 - 9
[2] Landing Control of Fixed-wing UAV Based on ADRC
Zhu, Guojun
Qi, Juntong
Wu, Chong
PROCEEDINGS OF THE 38TH CHINESE CONTROL CONFERENCE (CCC), 2019, : 8020 - 8025
[3] Coordination control method for fixed-wing UAV formation through deep reinforcement learning
Xiang X.
Yan C.
Wang C.
Yin D.
Hangkong Xuebao/Acta Aeronautica et Astronautica Sinica, 2021, 42 (04):
[4] Automatic Landing Control Based on GPS for Fixed-Wing Aircraft
Jantawong, Jirayuth
Deelertpaiboon, Chirdpong
2018 15TH INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING/ELECTRONICS, COMPUTER, TELECOMMUNICATIONS AND INFORMATION TECHNOLOGY (ECTI-CON), 2018, : 317 - 320
[5] Control and motion planning of fixed-wing UAV through reinforcement learning
Giral, Francisco
Gomez, Ignacio
Le Clainche, Soledad
RESULTS IN ENGINEERING, 2024, 23
[6] Deep Reinforcement Learning Attitude Control of Fixed-Wing UAVs
Zhen, Yan
Hao, Mingrui
Sun, Wendi
PROCEEDINGS OF 2020 3RD INTERNATIONAL CONFERENCE ON UNMANNED SYSTEMS (ICUS), 2020, : 239 - 244
[7] Design of longitudinal control law for small fixed-wing UAV during auto landing
Gao J.-Z.
Jia H.-G.
Gao, Jiu-Zhou (gaojiuzhou@126.com), 1799, Chinese Academy of Sciences (24): : 1799 - 1806
[8] Fixed-Wing Stalled Maneuver Control Technology Based on Deep Reinforcement Learning
Hu, Weijun
Gao, Zhiqiang
Quan, Jiale
Ma, Xianlong
Xiong, Jingyi
Zhang, Weijie
2022 IEEE THE 5TH INTERNATIONAL CONFERENCE ON BIG DATA AND ARTIFICIAL INTELLIGENCE (BDAI 2022), 2022, : 19 - 25
[9] Cooperative formation control of fixed-wing UAVs based on deep reinforcement learning
Yue, Keyuan
Yuan, Jianquan
Hao, Mingrui
SEVENTH ASIA PACIFIC CONFERENCE ON OPTICS MANUFACTURE (APCOM 2021), 2022, 12166
[10] Vision-assisted deep stall landing for a fixed-wing UAV
Kim, Doyoung
Park, Sanghyuk
JOURNAL OF FIELD ROBOTICS, 2022, 39 (07) : 1138 - 1152

← 1 2 3 4 5 →