Reinforcement Learning-based path tracking for underactuated UUV under intermittent communication

被引:4
|
作者
Liu Z. [1 ]
Cai W. [1 ]
Zhang M. [2 ]
机构
[1] Hangzhou Dianzi University, 2nd Street, Zhejiang, Hangzhou
[2] Zhejiang University of Water Resources and Electric Power, 2nd Street, Zhejiang, Hangzhou
基金
中国国家自然科学基金;
关键词
Intermittent communication; Path control; Self-attention mechanism; Soft Actor and Critic (SAC); Unmanned Underwater Vehicle (UUV);
D O I
10.1016/j.oceaneng.2023.116076
中图分类号
学科分类号
摘要
This paper studies the path control of a six-degree-of-freedom underactuated Unmanned Underwater Vehicle (UUV) under limited communication conditions. Considering the large number of coupling between six-degree-of-freedom underactuated UUV of unknown dynamic models, traditional model-based control methods are difficult to effectively solve the three-dimensional path control problem. A self-attention based soft actor and critic (A-SAC) algorithm is designed to learn effective control policy from random paths. The problem of limited target acquisition by UUV in the actual underwater environment is effectively overcome, which is mainly caused by the inability of UUV to consistently receive information about their expected path. A new state space is designed and a self-attention mechanism is introduced to improve the efficiency of using discontinuous path information. Furthermore, the validation experiment compares classical Reinforcement Learning methods such as DDPG, PPO, and etc. Compared to other existing methods, the proposed A-SAC algorithm can more quickly and effectively learn the path control policy for a six-degree-of-freedom UUV that operates in a complex environment. © 2023 Elsevier Ltd
引用
收藏
相关论文
共 50 条
  • [1] Deep reinforcement learning-based path planning of underactuated surface vessels
    Xu H.
    Wang N.
    Zhao H.
    Zheng Z.
    Cyber-Physical Systems, 2019, 5 (01): : 1 - 17
  • [2] Path tracking-based underactuated UUV formation coordinated control
    Bian, Xinqian
    Mou, Chunhui
    Zhang, Xun
    Yan, Zheping
    Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2012, 40 (04): : 37 - 42
  • [3] Curriculum-based reinforcement learning for path tracking in an underactuated nonholonomic system
    Chivkula, Prashanth
    Rodwell, Colin
    Tallapragada, Phanindra
    IFAC PAPERSONLINE, 2022, 55 (37): : 339 - 344
  • [4] Path Following of Underactuated UUV Based On Backstepping
    Yan Zheping
    Chi Dongnan
    Jia Heming
    Zhou Jiajia
    PROCEEDINGS OF THE 10TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA 2012), 2012, : 2734 - 2739
  • [5] UGV Navigation Optimization Aided by Reinforcement Learning-Based Path Tracking
    Wei, Minggao
    Wang, Song
    Zheng, Jinfan
    Chen, Dan
    IEEE ACCESS, 2018, 6 : 57814 - 57825
  • [6] A Reinforcement Learning-Based Adaptive Path Tracking Approach for Autonomous Driving
    Shan, Yunxiao
    Zheng, Boli
    Chen, Longsheng
    Chen, Long
    Chen, De
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 69 (10) : 10581 - 10595
  • [7] Reinforcement Learning-Based Approach to Robot Path Tracking in Nonlinear Dynamic Environments
    Chen, Wei
    Zhou, Zebin
    INTERNATIONAL JOURNAL OF HUMANOID ROBOTICS, 2024, 21 (04)
  • [8] Reinforcement learning-based fuzzy controller for autonomous guided vehicle path tracking
    Kuo, Ping-Huan
    Chen, Sing-Yan
    Feng, Po-Hsun
    Chang, Chen-Wen
    Huang, Chiou-Jye
    Peng, Chao-Chung
    ADVANCED ENGINEERING INFORMATICS, 2025, 65
  • [9] Stable Learning-Based Tracking Control of Underactuated Balance Robots
    Han, Feng
    Yi, Jingang
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (02) : 1543 - 1550
  • [10] Reinforcement learning-based unknown reference tracking control of HMASs with nonidentical communication delays
    Yong XU
    Zheng-Guang WU
    Wei-Wei CHE
    Deyuan MENG
    ScienceChina(InformationSciences), 2023, 66 (07) : 46 - 57