An Efficient Multi-AUV Cooperative Navigation Method Based on Hierarchical Reinforcement Learning

被引:1
|
作者
Zhu, Zixiao [1 ,2 ]
Zhang, Lichuan [1 ,2 ]
Liu, Lu [1 ,2 ]
Wu, Dongwei [3 ]
Bai, Shuchang [1 ,2 ]
Ren, Ranzhen [1 ,2 ]
Geng, Wenlong [1 ,2 ]
机构
[1] Northwestern Polytech Univ, Res & Dev Inst, Shenzhen 518057, Peoples R China
[2] Northwestern Polytech Univ, Sch Marine Sci & Technol, Xian 710072, Peoples R China
[3] Shanghai Suixun Elect Technol Co Ltd, Shanghai 200438, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
SMDP; AUV; cooperative navigation; hierarchical reinforcement learning; abstract action; Q-learning; FILTER;
D O I
10.3390/jmse11101863
中图分类号
U6 [水路运输]; P75 [海洋工程];
学科分类号
0814 ; 081505 ; 0824 ; 082401 ;
摘要
Positioning errors introduced by low-precision navigation devices can affect the overall accuracy of a positioning system. To address this issue, this paper proposes a master-slave multi-AUV collaborative navigation method based on hierarchical reinforcement learning. First, a collaborative navigation system is modeled as a discrete semi-Markov process with defined state and action sets and reward functions. Second, trajectory planning is performed using a hierarchical reinforcement learning-based approach combined with the polar Kalman filter to reduce the positioning error of slave AUVs, realizing collaborative navigation in multi-slave AUV scenarios. The proposed collaborative navigation method is analyzed and validated by simulation experiments in terms of the relative distance between the master and slave AUVs and the positioning error of a slave AUV. The research results show that the proposed method can not only successfully reduce the observation and positioning errors of slave AUVs in the collaborative navigation process but can also effectively maintain the relative measurement distance between the master and slave AUVs within an appropriate range.
引用
收藏
页数:21
相关论文
共 50 条
  • [21] Multi-AUV Target Recognition Method Based on GAN-meta Learning
    Sun, Qiankun
    Cai, Lei
    2020 5TH INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND MECHATRONICS (ICARM 2020), 2020, : 374 - 379
  • [22] An Improved DSA-Based Approach for Multi-AUV Cooperative Search
    Ni, Jianjun
    Yang, Liu
    Shi, Pengfei
    Luo, Chengming
    COMPUTATIONAL INTELLIGENCE AND NEUROSCIENCE, 2018, 2018
  • [23] Multi-AUV Cooperative Task Allocation Based on Improved Contract Network
    Li, Juan
    Zhang, Kunyu
    Xia, Guoqing
    2017 IEEE INTERNATIONAL CONFERENCE ON MECHATRONICS AND AUTOMATION (ICMA), 2017, : 608 - 613
  • [24] A multi-AUV cooperative localization method based on adaptive neuro-fuzzy inference system
    Xu B.
    Li S.
    Wang L.
    Duan T.
    Yao H.
    Zhongguo Guanxing Jishu Xuebao/Journal of Chinese Inertial Technology, 2019, 27 (04): : 440 - 447
  • [25] A dynamic velocity potential field method for multi-AUV cooperative hunting tasks
    Zhao, Zhenyi
    Zhang, Yuzhong
    Feng, Xinglong
    Jiang, Chuan
    Su, Wenbin
    Hu, Qiao
    OCEAN ENGINEERING, 2024, 295
  • [26] Potential field hierarchical reinforcement learning approach for target search by multi-AUV in 3-D underwater environments
    Cao, Xiang
    Sun, Hongbing
    Guo, Liqiang
    INTERNATIONAL JOURNAL OF CONTROL, 2020, 93 (07) : 1677 - 1683
  • [27] Communication-Constrained Multi-AUV Cooperative SLAM
    Paull, Liam
    Huang, Guoquan
    Seto, Mae
    Leonard, John J.
    2015 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2015, : 509 - 516
  • [28] Cooperative Multi-AUV Convoy Protection with Ocean Currents
    Yang Yang
    Xu Demin
    Zhang Bingyu
    2013 32ND CHINESE CONTROL CONFERENCE (CCC), 2013, : 2287 - 2292
  • [29] An Efficient Underwater Coverage Method for Multi-AUV with Sea Current Disturbances
    Jung, Yeun-Soo
    Lee, Kong-Woo
    Lee, Seong-Yong
    Choi, Myoung Hwan
    Lee, Beom-Hee
    INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2009, 7 (04) : 615 - 629
  • [30] An efficient underwater coverage method for multi-AUV with sea current disturbances
    Yeun-Soo Jung
    Kong-Woo Lee
    Seong-Yong Lee
    Myoung Hwan Choi
    Beom-Hee Lee
    International Journal of Control, Automation and Systems, 2009, 7 : 615 - 629