Enhancing efficiency and propulsion in bio-mimetic robotic fish through end-to-end deep reinforcement learning

被引:7
|
作者
Cui, Xinyu [1 ,2 ]
Sun, Boai [3 ]
Zhu, Yi [4 ,5 ]
Yang, Ning [1 ,2 ]
Zhang, Haifeng [1 ,2 ]
Cui, Weicheng [4 ,5 ]
Fan, Dixia [4 ,5 ]
Wang, Jun [6 ]
机构
[1] Inst Automat, Inst Automat, Beijing 100190, Peoples R China
[2] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China
[3] Zhejiang Univ, Zhejiang Univ Westlake Univ Joint Training, Hangzhou 310027, Peoples R China
[4] Westlake Univ, Sch Engn, Key Lab Coastal Environm & Resources Zhejiang Pro, Hangzhou 310030, Peoples R China
[5] Inst Adv Technol, Westlake Inst Adv Study, Hangzhou 310024, Peoples R China
[6] UCL, Dept Comp Sci, London WC1E 6BT, England
关键词
CARTESIAN-GRID SIMULATIONS; MULTIJOINT; OPTIMIZATION;
D O I
10.1063/5.0192993
中图分类号
O3 [力学];
学科分类号
08 ; 0801 ;
摘要
Aquatic organisms are known for their ability to generate efficient propulsion with low energy expenditure. While existing research has sought to leverage bio-inspired structures to reduce energy costs in underwater robotics, the crucial role of control policies in enhancing efficiency has often been overlooked. In this study, we optimize the motion of a bio-mimetic robotic fish using deep reinforcement learning (DRL) to maximize propulsion efficiency and minimize energy consumption. Our novel DRL approach incorporates extended pressure perception, a transformer model processing sequences of observations, and a policy transfer scheme. Notably, significantly improved training stability and speed within our approach allow for end-to-end training of the robotic fish. This enables agiler responses to hydrodynamic environments and possesses greater optimization potential compared to pre-defined motion pattern controls. Our experiments are conducted on a serially connected rigid robotic fish in a free stream with a Reynolds number of 6000 using computational fluid dynamics simulations. The DRL-trained policies yield impressive results, demonstrating both high efficiency and propulsion. The policies also showcase the agent's embodiment, skillfully utilizing its body structure and engaging with surrounding fluid dynamics, as revealed through flow analysis. This study provides valuable insights into the bio-mimetic underwater robots optimization through DRL training, capitalizing on their structural advantages, and ultimately contributing to more efficient underwater propulsion systems.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] End-to-end UAV obstacle avoidance decision based on deep reinforcement learning
    Zhang, Yunyan
    Wei, Yao
    Liu, Hao
    Yang, Yao
    Xibei Gongye Daxue Xuebao/Journal of Northwestern Polytechnical University, 2022, 40 (05): : 1055 - 1064
  • [32] An End-to-End Deep Reinforcement Learning Method for UAV Autonomous Motion Planning
    Cui, Yangjie
    Dong, Xin
    Li, Daochun
    Tu, Zhan
    2022 7TH INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION ENGINEERING, ICRAE, 2022, : 100 - 104
  • [33] End-to-end Deep Reinforcement Learning for Multi-agent Collaborative Exploration
    Chen, Zichen
    Subagdja, Budhitama
    Tan, Ah-Hwee
    2019 IEEE INTERNATIONAL CONFERENCE ON AGENTS (ICA), 2019, : 99 - 102
  • [34] Interpretable End-to-End Urban Autonomous Driving With Latent Deep Reinforcement Learning
    Chen, Jianyu
    Li, Shengbo Eben
    Tomizuka, Masayoshi
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (06) : 5068 - 5078
  • [35] End-to-end deep reinforcement learning and control with multimodal perception for planetary robotic dual peg-in-hole assembly
    Li, Boxin
    Wang, Zhaokui
    ADVANCES IN SPACE RESEARCH, 2024, 74 (11) : 5860 - 5873
  • [36] Enhancing scene understanding based on deep learning for end-to-end autonomous driving
    Hu, Jie
    Kong, Huifang
    Zhang, Qian
    Liu, Runwu
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2022, 116
  • [37] Model-Based: End-to-End Molecular Communication System Through Deep Reinforcement Learning Auto Encoder
    Mohamed, Soha
    Dong, Jian
    Junejo, A. R.
    Zuo, De Cheng
    IEEE ACCESS, 2019, 7 : 70279 - 70286
  • [38] Eagle: End-to-end Deep Reinforcement Learning based Autonomous Control of PTZ Cameras
    Sandha, Sandeep Singh
    Balaji, Bharathan
    Garcia, Luis
    Srivastava, Mani
    PROCEEDINGS 8TH ACM/IEEE CONFERENCE ON INTERNET OF THINGS DESIGN AND IMPLEMENTATION, IOTDI 2023, 2023, : 144 - 157
  • [39] End-to-End AUV Local Motion Planning Method Based on Deep Reinforcement Learning
    Lyu, Xi
    Sun, Yushan
    Wang, Lifeng
    Tan, Jiehui
    Zhang, Liwen
    JOURNAL OF MARINE SCIENCE AND ENGINEERING, 2023, 11 (09)
  • [40] End-to-End Autonomous Navigation Based on Deep Reinforcement Learning with a Survival Penalty Function
    Jeng, Shyr-Long
    Chiang, Chienhsun
    SENSORS, 2023, 23 (20)