Enhancing UAV Aerial Docking: A Hybrid Approach Combining Offline and Online Reinforcement Learning

被引:0
|
作者
Feng, Yuting [1 ]
Yang, Tao [1 ]
Yu, Yushu [1 ]
机构
[1] Beijing Inst Technol, Sch Mechatron Engn, Beijing 100081, Peoples R China
基金
中国国家自然科学基金;
关键词
uav aerial docking; offline reinforcement learning; online reinforcement learning; INTERNAL DYNAMICS; QUADROTORS; PLATFORM;
D O I
10.3390/drones8050168
中图分类号
TP7 [遥感技术];
学科分类号
081102 ; 0816 ; 081602 ; 083002 ; 1404 ;
摘要
In our study, we explore the task of performing docking maneuvers between two unmanned aerial vehicles (UAVs) using a combination of offline and online reinforcement learning (RL) methods. This task requires a UAV to accomplish external docking while maintaining stable flight control, representing two distinct types of objectives at the task execution level. Direct online RL training could lead to catastrophic forgetting, resulting in training failure. To overcome these challenges, we design a rule-based expert controller and accumulate an extensive dataset. Based on this, we concurrently design a series of rewards and train a guiding policy through offline RL. Then, we conduct comparative verification on different RL methods, ultimately selecting online RL to fine-tune the model trained offline. This strategy effectively combines the efficiency of offline RL with the exploratory capabilities of online RL. Our approach improves the success rate of the UAV's aerial docking task, increasing it from 40% under the expert policy to 95%.
引用
收藏
页数:18
相关论文
共 50 条
  • [31] Hybrid approach to Reinforcement Learning
    Boulebtateche, Brahim
    Fezari, Mourad
    Boughazi, Mohamed
    INTELLIGENT SYSTEMS AND AUTOMATION, 2008, 1019 : 216 - 220
  • [32] Hybrid Offline-Online UAV Trajectory Design and Subchannel Allocation in UAV Relaying OFDMA Networks
    Park, Young-Ik
    Kim, Do-Yup
    Lee, Jang-Won
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (15): : 26173 - 26190
  • [33] Energy Management of Hybrid UAV Based on Reinforcement Learning
    Shen, Huan
    Zhang, Yao
    Mao, Jianguo
    Yan, Zhiwei
    Wu, Linwei
    ELECTRONICS, 2021, 10 (16)
  • [34] Enhancing HVAC Control Efficiency: A Hybrid Approach Using Imitation and Reinforcement Learning
    Kadamala, Kevlyn
    Chambers, Des
    Barrett, Enda
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES-APPLIED DATA SCIENCE TRACK, PT IX, ECML PKDD 2024, 2024, 14949 : 256 - 270
  • [35] Weighting Online Decision Transformer with Episodic Memory for Offline-to-Online Reinforcement Learning
    Ma, Xiao
    Li, Wu-Jun
    2024 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2024), 2024, : 10793 - 10799
  • [36] An offline-to-online reinforcement learning approach based on multi-action evaluation with policy extension
    Cheng, Xuebo
    Huang, Xiaohui
    Huang, Zhichao
    Jiang, Nan
    APPLIED INTELLIGENCE, 2024, 54 (23) : 12246 - 12271
  • [37] Effective Traffic Signal Control with Offline-to-Online Reinforcement Learning
    Ma, Jinming
    Wu, Feng
    2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2023, : 5567 - 5573
  • [38] Offline Meta-Reinforcement Learning with Online Self-Supervision
    Pong, Vitchyr H.
    Nair, Ashvin
    Smith, Laura
    Huang, Catherine
    Levine, Sergey
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [39] Online Tuning for Offline Decentralized Multi-Agent Reinforcement Learning
    Jiang, Jiechuan
    Lu, Zongqing
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 7, 2023, : 8050 - +
  • [40] A maintenance planning framework using online and offline deep reinforcement learning
    Bukhsh, Zaharah A.
    Molegraaf, Hajo
    Jansen, Nils
    NEURAL COMPUTING & APPLICATIONS, 2023,