An expert-demonstrated soft actor-critic based adaptive trajectory tracking control of Autonomous Underwater Vehicle with Long Short-Term Memory

被引:0
|
作者
Wang, Yuxuan [1 ]
Hou, Yaochun [1 ]
Lai, Zhounian [2 ]
Cao, Linlin [1 ]
Hong, Weirong [1 ]
Wu, Dazhuan [1 ]
机构
[1] Zhejiang Univ, Inst Proc Equipment, Coll Energy Engn, Hangzhou 310027, Peoples R China
[2] Zhejiang Univ, Huzhou Inst, Huzhou 313000, Peoples R China
关键词
Autonomous underwater vehicle; Trajectory tracking control; Reinforcement learning; Soft actor-critic; Long Short-Term Memory;
D O I
10.1016/j.oceaneng.2025.120405
中图分类号
U6 [水路运输]; P75 [海洋工程];
学科分类号
0814 ; 081505 ; 0824 ; 082401 ;
摘要
In recent years, Autonomous Underwater Vehicles (AUVs) have seen remarkable technological progress, and their trajectory tracking control has emerged as a crucial research focus. To address the challenges of obtaining precise model parameters and dealing with the complex and dynamic underwater environment, data-driven approaches, such as reinforcement learning (RL), have gradually emerged. However, traditional RL methods often require large datasets and face unpredictability during the early exploration stages, making them challenging for real-world applications. To overcome these limitations, this paper proposes an expert- demonstrated soft actor-critic (ESAC) control scheme for AUV trajectory tracking. This method utilizes expert control data as demonstrations for the RL agent, accelerating the learning process and improving safety. Additionally, Long Short-Term Memory (LSTM) is employed as the policy network to effectively process the sequential state information of the AUV, enhancing control precision. Through simulations and comparisons with other typical RL-based controllers, the superiority of the proposed method is demonstrated. Finally, lake trials further validate the feasibility of the approach. The results demonstrate that the ESAC-LSTM scheme achieves faster convergence and higher control accuracy, making it well-suited for complex underwater environments.
引用
收藏
页数:11
相关论文
共 26 条
  • [1] An adaptive PID controller for path following of autonomous underwater vehicle based on Soft Actor-Critic
    Wang, Yuxuan
    Hou, Yaochun
    Lai, Zhounian
    Cao, Linlin
    Hong, Weirong
    Wu, Dazhuan
    OCEAN ENGINEERING, 2024, 307
  • [2] Autonomous Underwater Vehicle Path Planning Method of Soft Actor-Critic Based on Game Training
    Wang, Zhuo
    Lu, Hao
    Qin, Hongde
    Sui, Yancheng
    JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2022, 10 (12)
  • [3] Trajectory Tracking Control for Robotic Manipulator Based on Soft Actor-Critic and Generative Adversarial Imitation Learning
    Hu, Jintao
    Wang, Fujie
    Li, Xing
    Qin, Yi
    Guo, Fang
    Jiang, Ming
    BIOMIMETICS, 2024, 9 (12)
  • [4] End-to-end autonomous underwater vehicle path following control method based on improved soft actor-critic for deep space exploration
    Dong, Na
    Liu, Shoufu
    Ip, Andrew W. H.
    Yung, Kai Leung
    Gao, Zhongke
    Juan, Rongshun
    Wang, Yanhui
    JOURNAL OF INDUSTRIAL INFORMATION INTEGRATION, 2025, 45
  • [5] Adaptive Generalized Dynamic Inversion based Trajectory Tracking Control of Autonomous Underwater Vehicle
    Ansari, Uzair
    Bajodah, Abdulrahman H.
    Alam, Saqib
    2018 26TH MEDITERRANEAN CONFERENCE ON CONTROL AND AUTOMATION (MED), 2018, : 588 - 594
  • [6] Comprehensive Analysis of Adaptive Soft Actor-Critic Reinforcement Learning-Based Control Framework for Autonomous Driving in Varied Scenarios
    Liu, Hebing
    Sun, Jinhong
    Wang, Heshou
    Cheng, Ka Wai Eric
    IEEE TRANSACTIONS ON TRANSPORTATION ELECTRIFICATION, 2025, 11 (01): : 3667 - 3679
  • [7] Nonlinear trajectory-tracking control for autonomous underwater vehicle based on iterative adaptive dynamic programming
    Che, Gaofeng
    Liu, Lijun
    Yu, Zhen
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2019, 37 (03) : 4205 - 4215
  • [8] Long Short-Term Memory-Based Human-Driven Vehicle Longitudinal Trajectory Prediction in a Connected and Autonomous Vehicle Environment
    Lin, Lei
    Gong, Siyuan
    Peeta, Srinivas
    Wu, Xia
    TRANSPORTATION RESEARCH RECORD, 2021, 2675 (06) : 380 - 390
  • [9] Vehicle Trajectory Prediction Based on Mixed Teaching Force Long Short-term Memory
    Fang H.-Z.
    Liu L.
    Xiao X.-F.
    Gu Q.
    Meng Y.
    Jiaotong Yunshu Xitong Gongcheng Yu Xinxi/Journal of Transportation Systems Engineering and Information Technology, 2023, 23 (04): : 80 - 87
  • [10] Procapra Przewalskii Tracking Autonomous Unmanned Aerial Vehicle Based on Improved Long and Short-Term Memory Kalman Filters
    Luo, Wei
    Zhao, Yongxiang
    Shao, Quanqin
    Li, Xiaoliang
    Wang, Dongliang
    Zhang, Tongzuo
    Liu, Fei
    Duan, Longfang
    He, Yuejun
    Wang, Yancang
    Zhang, Guoqing
    Wang, Xinghui
    Yu, Zhongde
    SENSORS, 2023, 23 (08)