High-speed quadrupedal locomotion by imitation-relaxation reinforcement learning

被引:26
|
作者
Jin, Yongbin [1 ,2 ,3 ,4 ]
Liu, Xianwei [1 ]
Shao, Yecheng [1 ,4 ]
Wang, Hongtao [1 ,2 ,3 ,4 ]
Yang, Wei [1 ,2 ,3 ,4 ]
机构
[1] Zhejiang Univ, Ctr X Mech, Hangzhou, Peoples R China
[2] Hangzhou Global Sci & Technol Innovat Ctr, ZJU, Hangzhou, Peoples R China
[3] Zhejiang Univ, State Key Lab Fluid Power & Mechatron Syst, Hangzhou, Peoples R China
[4] Zhejiang Univ, Inst Appl Mech, Hangzhou, Peoples R China
关键词
ENTROPY STABILITY; DYNAMICS; DESIGN; ROBOT; MODEL;
D O I
10.1038/s42256-022-00576-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Fast and stable locomotion of legged robots involves demanding and contradictory requirements, in particular rapid control frequency as well as an accurate dynamics model. Benefiting from universal approximation ability and offline optimization of neural networks, reinforcement learning has been used to solve various challenging problems in legged robot locomotion; however, the optimal control of quadruped robot requires optimizing multiple objectives such as keeping balance, improving efficiency, realizing periodic gait and following commands. These objectives cannot always be achieved simultaneously, especially at high speed. Here, we introduce an imitation-relaxation reinforcement learning (IRRL) method to optimize the objectives in stages. To bridge the gap between simulation and reality, we further introduce the concept of stochastic stability into system robustness analysis. The state space entropy decreasing rate is a quantitative metric and can sharply capture the occurrence of period-doubling bifurcation and possible chaos. By employing IRRL in training and the stochastic stability analysis, we are able to demonstrate a stable running speed of 5.0 m s(-1) for a MIT-MiniCheetah-like robot.
引用
收藏
页码:1198 / 1208
页数:11
相关论文
共 50 条
  • [31] Hierarchical Terrain-Aware Control for Quadrupedal Locomotion by Combining Deep Reinforcement Learning and Optimal Control
    Yao, Qingfeng
    Wang, Jilong
    Wang, Donglin
    Yang, Shuyu
    Zhang, Hongyin
    Wang, Yinuo
    Wu, Zhengqing
    2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 4546 - 4551
  • [32] CPG-Based Hierarchical Locomotion Control for Modular Quadrupedal Robots Using Deep Reinforcement Learning
    Wang, Jiayu
    Hu, Chuxiong
    Zhu, Yu
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (04) : 7193 - 7200
  • [33] Learning Advanced Locomotion for Quadrupedal Robots: A Distributed Multi-Agent Reinforcement Learning Framework with Riemannian Motion Policies
    Wang, Yuliu
    Sagawa, Ryusuke
    Yoshiyasu, Yusuke
    ROBOTICS, 2024, 13 (06)
  • [34] Robust High-Speed Running for Quadruped Robots via Deep Reinforcement Learning
    Bellegarda, Guillaume
    Chen, Yiyu
    Liu, Zhuochen
    Quan Nguyen
    2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 10364 - 10370
  • [35] Closing the Dynamics Gap via Adversarial and Reinforcement Learning for High-Speed Racing
    Niu, Jingyu
    Hu, Yu
    Li, Wei
    Huang, Guangyan
    Han, Yinhe
    Li, Xiaowei
    2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [36] Two-Stage Safe Reinforcement Learning for High-Speed Autonomous Racing
    Niu, Jingyu
    Hu, Yu
    Jin, Beibei
    Han, Yinhe
    Li, Xiaowei
    2020 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2020, : 3934 - 3941
  • [37] Interactive Force Analysis of High-speed Locomotion of Quadruped
    Huang, Senwei
    Zhang, Xiuli
    2022 INTERNATIONAL CONFERENCE ON INDUSTRIAL AUTOMATION, ROBOTICS AND CONTROL ENGINEERING, IARCE, 2022, : 9 - 14
  • [38] Potential Game Based Task Offloading in the High-Speed Railway With Reinforcement Learning
    Wu, Wei
    Song, Haifeng
    Wang, Hongwei
    Dong, Hairong
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (11) : 12671 - 12685
  • [39] Altitude control for high-speed vehicles in the cruise phase based on reinforcement learning
    Chi H.
    Yu F.
    Guo Z.
    Harbin Gongcheng Daxue Xuebao/Journal of Harbin Engineering University, 2021, 42 (09): : 1340 - 1346and1362
  • [40] A parallel actuated pantograph leg for high-speed locomotion
    Wei Guo
    Changrong Cai
    Mantian Li
    Fusheng Zha
    Pengfei Wang
    Kenan Wang
    Journal of Bionic Engineering, 2017, 14 : 202 - 217