An expert-demonstrated soft actor-critic based adaptive trajectory tracking control of Autonomous Underwater Vehicle with Long Short-Term Memory

被引：0

作者：

Wang, Yuxuan ^{[1
]}

Hou, Yaochun ^{[1
]}

Lai, Zhounian ^{[2
]}

Cao, Linlin ^{[1
]}

Hong, Weirong ^{[1
]}

Wu, Dazhuan ^{[1
]}

机构：

[1] Zhejiang Univ, Inst Proc Equipment, Coll Energy Engn, Hangzhou 310027, Peoples R China

[2] Zhejiang Univ, Huzhou Inst, Huzhou 313000, Peoples R China

来源：

OCEAN ENGINEERING | 2025年 / 321卷

关键词：

Autonomous underwater vehicle; Trajectory tracking control; Reinforcement learning; Soft actor-critic; Long Short-Term Memory;

D O I：

10.1016/j.oceaneng.2025.120405

中图分类号：

U6 [水路运输]; P75 [海洋工程];

学科分类号：

0814 ; 081505 ; 0824 ; 082401 ;

摘要：

In recent years, Autonomous Underwater Vehicles (AUVs) have seen remarkable technological progress, and their trajectory tracking control has emerged as a crucial research focus. To address the challenges of obtaining precise model parameters and dealing with the complex and dynamic underwater environment, data-driven approaches, such as reinforcement learning (RL), have gradually emerged. However, traditional RL methods often require large datasets and face unpredictability during the early exploration stages, making them challenging for real-world applications. To overcome these limitations, this paper proposes an expert- demonstrated soft actor-critic (ESAC) control scheme for AUV trajectory tracking. This method utilizes expert control data as demonstrations for the RL agent, accelerating the learning process and improving safety. Additionally, Long Short-Term Memory (LSTM) is employed as the policy network to effectively process the sequential state information of the AUV, enhancing control precision. Through simulations and comparisons with other typical RL-based controllers, the superiority of the proposed method is demonstrated. Finally, lake trials further validate the feasibility of the approach. The results demonstrate that the ESAC-LSTM scheme achieves faster convergence and higher control accuracy, making it well-suited for complex underwater environments.

引用

页数：11

共 26 条

[21] An adaptive internal model control approach for unmanned surface vehicle based on bidirectional long short-term memory neural network: Implementation and field testing
Meng, Yuhang
Ye, Hui
Xiang, Zhengrong
Yang, Xiaofei
Zhang, Hao
MECHATRONICS, 2024, 99
[22] Multi unmanned vehicle cooperative encirclement control based on bidirectional long short-term memory and mixed reward functions
Gu, Jian
Wang, Yin
Su, MuQing
Kong, XiaoPing
Duan, KeXiang
Yu, Meng
Zhongguo Kexue Jishu Kexue/Scientia Sinica Technologica, 2024, 54 (09): : 1665 - 1675
[23] Lyapunov-Based Physics-Informed Long Short-Term Memory (LSTM) Neural Network-Based Adaptive Control
Hart R.G.
Griffis E.J.
Patil O.S.
Dixon W.E.
IEEE Control Systems Letters, 2024, 8 : 13 - 18
[24] Power reserve predictive control strategy for hybrid electric vehicle using recognition-based long short-term memory network
Chen, Ruihu
Yang, Chao
Han, Lijin
Wang, Weida
Ma, Yue
Xiang, Changle
JOURNAL OF POWER SOURCES, 2022, 520
[25] Optimization of node deployment in underwater internet of things using novel adaptive long short-term memory-based egret swarm optimization algorithm
Simon, Judy
Kapileswar, Nellore
Padmavathi, Baskaran
Devi, Krishnamoorthy Durga
Kumar, Polasi Phani
INTERNATIONAL JOURNAL OF COMMUNICATION SYSTEMS, 2024, 37 (17)
[26] Convolutional Long Short-Term Memory Predictor for Collaborative Remotely Operated Vehicle Trajectory Tracking in a Leader-Follower Formation Subject to Communication and Sensor Latency in the Presence of External Disturbances
Perez-Alvarado, Milton Eduardo
Gomez-Espinosa, Alfonso
Gonzalez-Garcia, Josue
Garcia-Valdovinos, Luis Govinda
Salgado-Jimenez, Tomas
MACHINES, 2024, 12 (10)

← 1 2 3 →