Path-Following Control of Unmanned Underwater Vehicle Based on an Improved TD3 Deep Reinforcement Learning

被引:10
|
作者
Fan, Yexin [1 ]
Dong, Hongyang [1 ]
Zhao, Xiaowei [1 ]
Denissenko, Petr [2 ]
机构
[1] Univ Warwick, Sch Engn, Intelligent Control & Smart Energy ICSE Res Grp, Coventry CV4 7AL, England
[2] Univ Warwick, Sch Engn, Coventry CV4 7AL, England
基金
英国工程与自然科学研究理事会;
关键词
Vehicle dynamics; Attitude control; Uncertainty; Training; Oscillators; Convergence; Complexity theory; Deep reinforcement learning (DRL); intelligent control; path-following control; unmanned underwater vehicles (UUVs); ADAPTIVE-CONTROL; TRACKING CONTROL; AUV;
D O I
10.1109/TCST.2024.3377876
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This work proposes an innovative path-following control method, anchored in deep reinforcement learning (DRL), for unmanned underwater vehicles (UUVs). This approach is driven by several new designs, all of which aim to enhance learning efficiency and effectiveness and achieve high-performance UUV control. Specifically, a novel experience replay strategy is designed and integrated within the twin-delayed deep deterministic policy gradient algorithm (TD3). It distinguishes the significance of stored transitions by making a trade-off between rewards and temporal-difference (TD) errors, thus enabling the UUV agent to explore optimal control policies more efficiently. Another major challenge within this control problem arises from action oscillations associated with DRL policies. This issue leads to excessive system wear on actuators and makes real-time application difficult. To mitigate this challenge, a newly improved regularization method is proposed, which provides a moderate level of smoothness to the control policy. Furthermore, a dynamic reward function featuring adaptive constraints is designed to avoid unproductive exploration and expedite learning convergence speed further. Simulation results show that our method garners higher rewards in fewer training episodes compared with mainstream DRL-based control approaches (e.g., deep deterministic policy gradient (DDPG) and vanilla TD3) in UUV applications. Moreover, it can adapt to varying path configurations amid uncertainties and disturbances, all while ensuring high tracking accuracy. Simulation and experimental studies are conducted to verify the effectiveness.
引用
收藏
页码:1904 / 1919
页数:16
相关论文
共 50 条
  • [21] On the performance of different Deep Reinforcement Learning based controllers for the path-following of a ship
    Sivaraj, Sivaraman
    Dubey, Awanish
    Rajendran, Suresh
    OCEAN ENGINEERING, 2023, 286
  • [22] Path-following control for an unmanned aerial vehicle slung load system
    Al Lawati, Mohamed
    Lynch, Alan F.
    INTERNATIONAL JOURNAL OF CONTROL, 2024, 97 (11) : 2467 - 2481
  • [23] A DMPC-Based Approach to Circular Cooperative Path-following Control of Unmanned Underwater Vehicles
    Hu, Jiwei
    Jin, Bo
    Li, Huiping
    Yan, Weisheng
    Liu, Mingyong
    Cui, Rongxin
    2019 IEEE 28TH INTERNATIONAL SYMPOSIUM ON INDUSTRIAL ELECTRONICS (ISIE), 2019, : 1207 - 1212
  • [24] Charging Station Management Strategy for Returns Maximization via Improved TD3 Deep Reinforcement Learning
    Li, Hengjie
    Zhu, Jianghao
    Zhou, Yun
    Feng, Qi
    Feng, Donghan
    INTERNATIONAL TRANSACTIONS ON ELECTRICAL ENERGY SYSTEMS, 2022, 2022
  • [25] Path Following Backstepping Control of Underactuated Unmanned Underwater Vehicle
    Juan, Li
    Zhang, Qingyan
    Cheng, Xinghua
    Mohammed, Naeim Farouk
    2015 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION, 2015, : 2267 - 2272
  • [26] USV Formation and Path-Following Control via Deep Reinforcement Learning With Random Braking
    Zhao, Yujiao
    Ma, Yong
    Hu, Songlin
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (12) : 5468 - 5478
  • [27] Path-following Control of Fish-like Robots: A Deep Reinforcement Learning Approach
    Zhang, Tianhao
    Tian, Runyu
    Wang, Chen
    Xie, Guangming
    IFAC PAPERSONLINE, 2020, 53 (02): : 8163 - 8168
  • [28] Path-Following and Obstacle Avoidance Control of Nonholonomic Wheeled Mobile Robot Based on Deep Reinforcement Learning
    Cheng, Xiuquan
    Zhang, Shaobo
    Cheng, Sizhu
    Xia, Qinxiang
    Zhang, Junhao
    APPLIED SCIENCES-BASEL, 2022, 12 (14):
  • [29] Finite -time unknown observer based coordinated path-following control of unmanned underwater vehicles
    Liang, Xiao
    Qu, Xingru
    Hou, Yuanhang
    Li, Ye
    Zhang, Rubo
    JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2021, 358 (05): : 2703 - 2721
  • [30] Research on Control of Unmanned Surface Vehicle Based on Deep Reinforcement Learning
    Li, Baoan
    Ship Building of China, 2020, 61 : 14 - 20