Enhancing efficiency and propulsion in bio-mimetic robotic fish through end-to-end deep reinforcement learning

被引:7
|
作者
Cui, Xinyu [1 ,2 ]
Sun, Boai [3 ]
Zhu, Yi [4 ,5 ]
Yang, Ning [1 ,2 ]
Zhang, Haifeng [1 ,2 ]
Cui, Weicheng [4 ,5 ]
Fan, Dixia [4 ,5 ]
Wang, Jun [6 ]
机构
[1] Inst Automat, Inst Automat, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China
[3] Zhejiang Univ, Zhejiang Univ Westlake Univ Joint Training, Hangzhou 310027, Peoples R China
[4] Westlake Univ, Sch Engn, Key Lab Coastal Environm & Resources Zhejiang Pro, Hangzhou 310030, Peoples R China
[5] Inst Adv Technol, Westlake Inst Adv Study, Hangzhou 310024, Peoples R China
[6] UCL, Dept Comp Sci, London WC1E 6BT, England
关键词
CARTESIAN-GRID SIMULATIONS; MULTIJOINT; OPTIMIZATION;
D O I
10.1063/5.0192993
中图分类号
O3 [力学];
学科分类号
08 ; 0801 ;
摘要
Aquatic organisms are known for their ability to generate efficient propulsion with low energy expenditure. While existing research has sought to leverage bio-inspired structures to reduce energy costs in underwater robotics, the crucial role of control policies in enhancing efficiency has often been overlooked. In this study, we optimize the motion of a bio-mimetic robotic fish using deep reinforcement learning (DRL) to maximize propulsion efficiency and minimize energy consumption. Our novel DRL approach incorporates extended pressure perception, a transformer model processing sequences of observations, and a policy transfer scheme. Notably, significantly improved training stability and speed within our approach allow for end-to-end training of the robotic fish. This enables agiler responses to hydrodynamic environments and possesses greater optimization potential compared to pre-defined motion pattern controls. Our experiments are conducted on a serially connected rigid robotic fish in a free stream with a Reynolds number of 6000 using computational fluid dynamics simulations. The DRL-trained policies yield impressive results, demonstrating both high efficiency and propulsion. The policies also showcase the agent's embodiment, skillfully utilizing its body structure and engaging with surrounding fluid dynamics, as revealed through flow analysis. This study provides valuable insights into the bio-mimetic underwater robots optimization through DRL training, capitalizing on their structural advantages, and ultimately contributing to more efficient underwater propulsion systems.
引用
收藏
页数:13
相关论文
共 50 条
  • [21] An End-to-End Robotic Visual Localization Algorithm Based on Deep Learning
    Chen, Niansheng
    Wang, Hongcheng
    Fan, Guangyu
    Yang, Dingyu
    Rao, Lei
    JOURNAL OF SENSORS, 2023, 2023
  • [22] End-to-End Navigation Strategy With Deep Reinforcement Learning for Mobile Robots
    Shi, Haobin
    Shi, Lin
    Xu, Meng
    Hwang, Kao-Shing
    IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2020, 16 (04) : 2393 - 2402
  • [23] End-to-End Autonomous Driving Decision Based on Deep Reinforcement Learning
    Huang Z.-Q.
    Qu Z.-W.
    Zhang J.
    Zhang Y.-X.
    Tian R.
    Tien Tzu Hsueh Pao/Acta Electronica Sinica, 2020, 48 (09): : 1711 - 1719
  • [24] Deep reinforcement learning framework for end-to-end semiconductor process control
    Hirtz T.
    Tian H.
    Shahzad S.
    Wu F.
    Yang Y.
    Ren T.-L.
    Neural Computing and Applications, 2024, 36 (20) : 12443 - 12460
  • [25] GeneWorker: An end-to-end robotic reinforcement learning approach with collaborative generator and worker networks
    Wang, Hao
    Man, Hengyu
    Cui, Wenxue
    Lu, Riyu
    Cai, Chenxin
    Fan, Xiaopeng
    NEURAL NETWORKS, 2024, 178
  • [26] Deep Reinforcement Learning Based End-to-End Multiuser Channel Prediction and Beamforming
    Chu, Man
    Liu, An
    Lau, Vincent K. N.
    Jiang, Chen
    Yang, Tingting
    IEEE TRANSACTIONS ON WIRELESS COMMUNICATIONS, 2022, 21 (12) : 10271 - 10285
  • [27] AUV Position Tracking Control Using End-to-End Deep Reinforcement Learning
    Carlucho, Ignacio
    De Paula, Mariano
    Wang, Sen
    Menna, Bruno V.
    Petillot, Yvan R.
    Acosta, Gerardo G.
    OCEANS 2018 MTS/IEEE CHARLESTON, 2018,
  • [28] Early Failure Detection of Deep End-to-End Control Policy by Reinforcement Learning
    Lee, Keuntaek
    Saigol, Kamil
    Theodorou, Evangelos A.
    2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2019, : 8543 - 8549
  • [29] End-to-End Multitarget Flexible Job Shop Scheduling With Deep Reinforcement Learning
    Wang, Rongkai
    Jing, Yiyang
    Gu, Chaojie
    He, Shibo
    Chen, Jiming
    IEEE INTERNET OF THINGS JOURNAL, 2025, 12 (04): : 4420 - 4434
  • [30] An End-to-End Deep Reinforcement Learning Framework for Electric Vehicle Routing Problem
    Wang, Mengqin
    Wei, Yanling
    Huang, Xueliang
    Gao, Shan
    IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (20): : 33671 - 33682