AUV 3D docking control using deep reinforcement learning

被引:8
|
作者
Zhang, Tianze [1 ]
Miao, Xuhong [2 ]
Li, Yibin [1 ]
Jia, Lei [3 ]
Wei, Zheng [2 ]
Gong, Qingtao [4 ]
Wen, Tao [5 ]
机构
[1] Shandong Univ, Inst Marine Sci & Technol, Qingdao 266237, Shandong, Peoples R China
[2] Naval Res Acad, Beijing 100161, Peoples R China
[3] Shandong Univ, Sch Control Sci & Engn, Jinan 250061, Shandong, Peoples R China
[4] Ludong Univ, Ulsan Ship & Ocean Coll, Yantai 264025, Shandong, Peoples R China
[5] Beijing Jiaotong Univ, Sch Elect & Informat Engn, Beijing 100044, Peoples R China
关键词
Autonomous underwater vehicle; Deep reinforcement learning; Docking control; Ocean currents; Wave disturbance; SYSTEM;
D O I
10.1016/j.oceaneng.2023.115021
中图分类号
U6 [水路运输]; P75 [海洋工程];
学科分类号
0814 ; 081505 ; 0824 ; 082401 ;
摘要
Autonomous docking can enable AUV to have long endurance, so it is necessary to consider the issue of robust docking control under current and wave disturbances. In this work, based on the proximal policy optimization (PPO) algorithm, we developed a model-free docking controller to complete three-dimensional docking tasks under disturbances. To improve the performance of PPO, two mechanisms are proposed, including adaptive rollback clipping and self-generated demonstration replay. A simulation environment is constructed, including fuzzy hydrodynamic parameters, ocean current and wave disturbance model. Simulation results demonstrate that our proposed method has faster learning speed, higher robustness, and can control AUV to achieve 3D docking tasks in complex environments with a high success rate.
引用
收藏
页数:12
相关论文
共 50 条
  • [1] Deep Reinforcement Learning on a Budget: 3D Control and Reasoning Without a Supercomputer
    Beeching, Edward
    Debangoye, Jilles
    Simonin, Oliver
    Wolf, Christian
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 158 - 165
  • [2] AUV Position Tracking Control Using End-to-End Deep Reinforcement Learning
    Carlucho, Ignacio
    De Paula, Mariano
    Wang, Sen
    Menna, Bruno V.
    Petillot, Yvan R.
    Acosta, Gerardo G.
    OCEANS 2018 MTS/IEEE CHARLESTON, 2018,
  • [3] Deep Reinforcement Learning Based Unmanned Aerial Vehicle (UAV) Control Using 3D Hand Gestures
    Khan, Fawad Salam
    Mohd, Mohd Norzali Haji
    Zulkifli, Saiful Azrin B. M.
    Abro, Ghulam E. Mustafa
    Kazi, Suhail
    Soomro, Dur Muhammad
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 72 (03): : 5741 - 5759
  • [4] Target Search Control of AUV in Underwater Environment With Deep Reinforcement Learning
    Cao, Xiang
    Sun, Changyin
    Yan, Mingzhong
    IEEE ACCESS, 2019, 7 : 96549 - 96559
  • [5] Efficient 3D Homing Path Planning for AUV Docking
    Shi, Kai
    Wang, Xiaohui
    Wang, Yiqun
    Ma, Xiaoou
    2020 35TH YOUTH ACADEMIC ANNUAL CONFERENCE OF CHINESE ASSOCIATION OF AUTOMATION (YAC), 2020, : 353 - 358
  • [6] Autonomous 3D positional control of a magnetic microrobot using reinforcement learning
    Sarmad Ahmad Abbasi
    Awais Ahmed
    Seungmin Noh
    Nader Latifi Gharamaleki
    Seonhyoung Kim
    A. M. Masum Bulbul Chowdhury
    Jin-young Kim
    Salvador Pané
    Bradley J. Nelson
    Hongsoo Choi
    Nature Machine Intelligence, 2024, 6 : 92 - 105
  • [7] Autonomous 3D positional control of a magnetic microrobot using reinforcement learning
    Abbasi, Sarmad Ahmad
    Ahmed, Awais
    Noh, Seungmin
    Gharamaleki, Nader Latifi
    Kim, Seonhyoung
    Chowdhury, A. M. Masum Bulbul
    Kim, Jin-young
    Pane, Salvador
    Nelson, Bradley J.
    Choi, Hongsoo
    NATURE MACHINE INTELLIGENCE, 2024, 6 (01) : 92 - 105
  • [8] Automatic Drone Navigation in Realistic 3D Landscapes using Deep Reinforcement Learning
    Shin, Sang-Yun
    Kang, Yong -Won
    Kim, Yong-Guk
    2019 6TH INTERNATIONAL CONFERENCE ON CONTROL, DECISION AND INFORMATION TECHNOLOGIES (CODIT 2019), 2019, : 1072 - 1077
  • [9] Adaptive 3D UI Placement in Mixed Reality Using Deep Reinforcement Learning
    Lu, Feiyu
    Chen, Mengyu
    Hsu, Hsiang
    Deshpande, Pranav
    Wang, Cheng Yao
    MacIntyre, Blair
    EXTENDED ABSTRACTS OF THE 2024 CHI CONFERENCE ON HUMAN FACTORS IN COMPUTING SYSTEMS, CHI 2024, 2024,
  • [10] Direct Adaptive Pole-Placement Controller using Deep Reinforcement Learning: Application to AUV Control
    Chaffre, Thomas
    Le Chenadec, Gilles
    Sammut, Karl
    Chauveau, Estelle
    Clement, Benoit
    IFAC PAPERSONLINE, 2021, 54 (16): : 333 - 340