An expert-demonstrated soft actor-critic based adaptive trajectory tracking control of Autonomous Underwater Vehicle with Long Short-Term Memory

被引:0
|
作者
Wang, Yuxuan [1 ]
Hou, Yaochun [1 ]
Lai, Zhounian [2 ]
Cao, Linlin [1 ]
Hong, Weirong [1 ]
Wu, Dazhuan [1 ]
机构
[1] Zhejiang Univ, Inst Proc Equipment, Coll Energy Engn, Hangzhou 310027, Peoples R China
[2] Zhejiang Univ, Huzhou Inst, Huzhou 313000, Peoples R China
关键词
Autonomous underwater vehicle; Trajectory tracking control; Reinforcement learning; Soft actor-critic; Long Short-Term Memory;
D O I
10.1016/j.oceaneng.2025.120405
中图分类号
U6 [水路运输]; P75 [海洋工程];
学科分类号
0814 ; 081505 ; 0824 ; 082401 ;
摘要
In recent years, Autonomous Underwater Vehicles (AUVs) have seen remarkable technological progress, and their trajectory tracking control has emerged as a crucial research focus. To address the challenges of obtaining precise model parameters and dealing with the complex and dynamic underwater environment, data-driven approaches, such as reinforcement learning (RL), have gradually emerged. However, traditional RL methods often require large datasets and face unpredictability during the early exploration stages, making them challenging for real-world applications. To overcome these limitations, this paper proposes an expert- demonstrated soft actor-critic (ESAC) control scheme for AUV trajectory tracking. This method utilizes expert control data as demonstrations for the RL agent, accelerating the learning process and improving safety. Additionally, Long Short-Term Memory (LSTM) is employed as the policy network to effectively process the sequential state information of the AUV, enhancing control precision. Through simulations and comparisons with other typical RL-based controllers, the superiority of the proposed method is demonstrated. Finally, lake trials further validate the feasibility of the approach. The results demonstrate that the ESAC-LSTM scheme achieves faster convergence and higher control accuracy, making it well-suited for complex underwater environments.
引用
收藏
页数:11
相关论文
共 26 条
  • [21] An adaptive internal model control approach for unmanned surface vehicle based on bidirectional long short-term memory neural network: Implementation and field testing
    Meng, Yuhang
    Ye, Hui
    Xiang, Zhengrong
    Yang, Xiaofei
    Zhang, Hao
    MECHATRONICS, 2024, 99
  • [22] Multi unmanned vehicle cooperative encirclement control based on bidirectional long short-term memory and mixed reward functions
    Gu, Jian
    Wang, Yin
    Su, MuQing
    Kong, XiaoPing
    Duan, KeXiang
    Yu, Meng
    Zhongguo Kexue Jishu Kexue/Scientia Sinica Technologica, 2024, 54 (09): : 1665 - 1675
  • [23] Lyapunov-Based Physics-Informed Long Short-Term Memory (LSTM) Neural Network-Based Adaptive Control
    Hart R.G.
    Griffis E.J.
    Patil O.S.
    Dixon W.E.
    IEEE Control Systems Letters, 2024, 8 : 13 - 18
  • [24] Power reserve predictive control strategy for hybrid electric vehicle using recognition-based long short-term memory network
    Chen, Ruihu
    Yang, Chao
    Han, Lijin
    Wang, Weida
    Ma, Yue
    Xiang, Changle
    JOURNAL OF POWER SOURCES, 2022, 520
  • [25] Optimization of node deployment in underwater internet of things using novel adaptive long short-term memory-based egret swarm optimization algorithm
    Simon, Judy
    Kapileswar, Nellore
    Padmavathi, Baskaran
    Devi, Krishnamoorthy Durga
    Kumar, Polasi Phani
    INTERNATIONAL JOURNAL OF COMMUNICATION SYSTEMS, 2024, 37 (17)
  • [26] Convolutional Long Short-Term Memory Predictor for Collaborative Remotely Operated Vehicle Trajectory Tracking in a Leader-Follower Formation Subject to Communication and Sensor Latency in the Presence of External Disturbances
    Perez-Alvarado, Milton Eduardo
    Gomez-Espinosa, Alfonso
    Gonzalez-Garcia, Josue
    Garcia-Valdovinos, Luis Govinda
    Salgado-Jimenez, Tomas
    MACHINES, 2024, 12 (10)