Autonomous underwater vehicle path planning based on actor-multi-critic reinforcement learning

被引:16
|
作者
Wang, Zhuo [1 ,2 ]
Zhang, Shiwei [1 ]
Feng, Xiaoning [3 ]
Sui, Yancheng [1 ]
机构
[1] Harbin Engn Univ, Sci & Technol Underwater Vehicle Lab, Harbin, Peoples R China
[2] Peng Cheng Lab, Shenzhen, Peoples R China
[3] Harbin Engn Univ, Coll Comp Sci & Technol, Harbin 150001, Peoples R China
基金
中国国家自然科学基金;
关键词
Autonomous underwater vehicle; path planning; dynamic obstacle avoidance; actor-critic; neural networks; FEEDFORWARD NETWORKS; ENVIRONMENT;
D O I
10.1177/0959651820937085
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The environmental adaptability of autonomous underwater vehicles is always a problem for its path planning. Although reinforcement learning can improve the environmental adaptability, the slow convergence of reinforcement learning is caused by multi-behavior coupling, so it is difficult for autonomous underwater vehicle to avoid moving obstacles. This article proposes a multi-behavior critic reinforcement learning algorithm applied to autonomous underwater vehicle path planning to overcome problems associated with oscillating amplitudes and low learning efficiency in the early stages of training which are common in traditional actor-critic algorithms. Behavior critic reinforcement learning assesses the actions of the actor from perspectives such as energy saving and security, combining these aspects into a whole evaluation of the actor. In this article, the policy gradient method is selected as the actor part, and the value function method is selected as the critic part. The strategy gradient and the value function methods for actor and critic, respectively, are approximated by a backpropagation neural network, the parameters of which are updated using the gradient descent method. The simulation results show that the method has the ability of optimizing learning in the environment and can improve learning efficiency, which meets the needs of real time and adaptability for autonomous underwater vehicle dynamic obstacle avoidance.
引用
收藏
页码:1787 / 1796
页数:10
相关论文
共 50 条
  • [21] Asynchronous Multithreading Reinforcement-Learning-Based Path Planning and Tracking for Unmanned Underwater Vehicle
    He, Zichen
    Dong, Lu
    Sun, Changyin
    Wang, Jiawei
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 52 (05): : 2757 - 2769
  • [22] PATH PLANNING FOR AN AUTONOMOUS UNDERWATER VEHICLE IN A CLUTTERED UNDERWATER ENVIRONMENT BASED ON THE HEAT METHOD
    Sun, Kaiyue
    Liu, Xiangyang
    INTERNATIONAL JOURNAL OF APPLIED MATHEMATICS AND COMPUTER SCIENCE, 2021, 31 (02) : 289 - 301
  • [23] Three-Dimensional Path Tracking Control of Autonomous Underwater Vehicle Based on Deep Reinforcement Learning
    Sun, Yushan
    Zhang, Chenming
    Zhang, Guocheng
    Xu, Hao
    Ran, Xiangrui
    JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2019, 7 (12)
  • [24] Intelligent Path Planning of Underwater Robot Based on Reinforcement Learning
    Yang, Jiachen
    Ni, Jingfei
    Xi, Meng
    Wen, Jiabao
    Li, Yang
    IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2023, 20 (03) : 1983 - 1996
  • [25] Evolutionary Actor-Multi-Critic Model for VNF-FG Embedding
    Pham Tran Anh Quang
    Hadjadj-Aoul, Yassine
    Outtagarts, Abdelkader
    2020 IEEE 17TH ANNUAL CONSUMER COMMUNICATIONS & NETWORKING CONFERENCE (CCNC 2020), 2020,
  • [26] Path planning for an autonomous underwater vehicle in pole inspection
    Song, Yoong Siang
    Arshad, Mohd Rizal
    INDIAN JOURNAL OF GEO-MARINE SCIENCES, 2018, 47 (12) : 2477 - 2484
  • [27] A Novel Path Planning Algorithm for Autonomous Underwater Vehicle
    Yin, Bo
    Liu, Bing
    Cao, Jing
    ADVANCED RESEARCH IN MATERIAL SCIENCE AND MECHANICAL ENGINEERING, PTS 1 AND 2, 2014, 446-447 : 1271 - 1278
  • [28] Review on path planning methods for autonomous underwater vehicle
    Mohanty, Prases K.
    Chaudhary, Vishnu
    Prajapati, Rahul
    PROCEEDINGS OF THE INSTITUTION OF MECHANICAL ENGINEERS PART M-JOURNAL OF ENGINEERING FOR THE MARITIME ENVIRONMENT, 2025, 239 (01) : 3 - 37
  • [29] Path Planning Based on Deep Reinforcement Learning for Autonomous Underwater Vehicles Under Ocean Current Disturbance
    Chu, Zhenzhong
    Wang, Fulun
    Lei, Tingjun
    Luo, Chaomin
    IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2023, 8 (01): : 108 - 120
  • [30] Actor-Critic based Improper Reinforcement Learning
    Zaki, Mohammadi
    Mohan, Avinash
    Gopalan, Aditya
    Mannor, Shie
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,