Autonomous underwater vehicle path planning based on actor-multi-critic reinforcement learning

被引:16
|
作者
Wang, Zhuo [1 ,2 ]
Zhang, Shiwei [1 ]
Feng, Xiaoning [3 ]
Sui, Yancheng [1 ]
机构
[1] Harbin Engn Univ, Sci & Technol Underwater Vehicle Lab, Harbin, Peoples R China
[2] Peng Cheng Lab, Shenzhen, Peoples R China
[3] Harbin Engn Univ, Coll Comp Sci & Technol, Harbin 150001, Peoples R China
基金
中国国家自然科学基金;
关键词
Autonomous underwater vehicle; path planning; dynamic obstacle avoidance; actor-critic; neural networks; FEEDFORWARD NETWORKS; ENVIRONMENT;
D O I
10.1177/0959651820937085
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The environmental adaptability of autonomous underwater vehicles is always a problem for its path planning. Although reinforcement learning can improve the environmental adaptability, the slow convergence of reinforcement learning is caused by multi-behavior coupling, so it is difficult for autonomous underwater vehicle to avoid moving obstacles. This article proposes a multi-behavior critic reinforcement learning algorithm applied to autonomous underwater vehicle path planning to overcome problems associated with oscillating amplitudes and low learning efficiency in the early stages of training which are common in traditional actor-critic algorithms. Behavior critic reinforcement learning assesses the actions of the actor from perspectives such as energy saving and security, combining these aspects into a whole evaluation of the actor. In this article, the policy gradient method is selected as the actor part, and the value function method is selected as the critic part. The strategy gradient and the value function methods for actor and critic, respectively, are approximated by a backpropagation neural network, the parameters of which are updated using the gradient descent method. The simulation results show that the method has the ability of optimizing learning in the environment and can improve learning efficiency, which meets the needs of real time and adaptability for autonomous underwater vehicle dynamic obstacle avoidance.
引用
收藏
页码:1787 / 1796
页数:10
相关论文
共 50 条
  • [31] An obstacle avoiding method of autonomous underwater vehicle based on the reinforcement learning
    Li, Wenbiao
    Yang, Xian
    Yan, Jing
    Luo, Xiaoyuan
    PROCEEDINGS OF THE 39TH CHINESE CONTROL CONFERENCE, 2020, : 4538 - 4543
  • [32] Path Planning for Multi-Arm Manipulators Using Deep Reinforcement Learning: Soft Actor-Critic with Hindsight Experience Replay
    Prianto, Evan
    Kim, MyeongSeop
    Park, Jae-Han
    Bae, Ji-Hun
    Kim, Jung-Su
    SENSORS, 2020, 20 (20) : 1 - 23
  • [33] Reinforcement Learning for Underwater Spatiotemporal Path Planning, with Application to an Autonomous Marine Current Turbine
    Hasankhani, Arezoo
    Tang, Yufei
    VanZwieten, James
    2023 AMERICAN CONTROL CONFERENCE, ACC, 2023, : 3715 - 3721
  • [34] A deep residual reinforcement learning algorithm based on Soft Actor-Critic for autonomous navigation
    Wen, Shuhuan
    Shu, Yili
    Rad, Ahmad
    Wen, Zeteng
    Guo, Zhengzheng
    Gong, Simeng
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 259
  • [35] Actor Critic-based Multi Objective Reinforcement Learning for Multi Access Edge Computing
    Khot, Vishal
    Vallisha, M.
    Pai, Sharan S.
    Shekar, R. K. Chandra
    Kayarvizhy, N.
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2024, 15 (02) : 382 - 389
  • [36] Autonomous Underwater Vehicle Path Planning Based on Improved Salp Swarm Algorithm
    Guo, Xuan
    Zhao, Dongming
    Fan, Tingting
    Long, Fei
    Fang, Caihua
    Long, Yang
    JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2024, 12 (08)
  • [37] THREE-DIMENSIONAL PATH-FOLLOWING CONTROL OF AN AUTONOMOUS UNDERWATER VEHICLE BASED ON DEEP REINFORCEMENT LEARNING
    Liang, Zhenyu
    Qu, Xingru
    Zhang, Zhao
    Chen, Cong
    POLISH MARITIME RESEARCH, 2022, 29 (04) : 36 - 44
  • [38] Recovery Path Planning Algorithm Based on Dubins Curve for Autonomous Underwater Vehicle
    Shi, Binghua
    Su, Yixin
    Wang, Chen
    Wan, Lili
    Qi, Yue
    2018 IEEE 8TH INTERNATIONAL CONFERENCE ON UNDERWATER SYSTEM TECHNOLOGY: THEORY AND APPLICATIONS (USYS), 2018,
  • [39] Complete Coverage Path Planning of Autonomous Underwater Vehicle Based on GBNN Algorithm
    Daqi Zhu
    Chen Tian
    Bing Sun
    Chaomin Luo
    Journal of Intelligent & Robotic Systems, 2019, 94 : 237 - 249
  • [40] Lexicographic Actor-Critic Deep Reinforcement Learning for Urban Autonomous Driving
    Zhang, Hengrui
    Lin, Youfang
    Han, Sheng
    Lv, Kai
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 72 (04) : 4308 - 4319