High-speed quadrupedal locomotion by imitation-relaxation reinforcement learning

被引:26
|
作者
Jin, Yongbin [1 ,2 ,3 ,4 ]
Liu, Xianwei [1 ]
Shao, Yecheng [1 ,4 ]
Wang, Hongtao [1 ,2 ,3 ,4 ]
Yang, Wei [1 ,2 ,3 ,4 ]
机构
[1] Zhejiang Univ, Ctr X Mech, Hangzhou, Peoples R China
[2] Hangzhou Global Sci & Technol Innovat Ctr, ZJU, Hangzhou, Peoples R China
[3] Zhejiang Univ, State Key Lab Fluid Power & Mechatron Syst, Hangzhou, Peoples R China
[4] Zhejiang Univ, Inst Appl Mech, Hangzhou, Peoples R China
关键词
ENTROPY STABILITY; DYNAMICS; DESIGN; ROBOT; MODEL;
D O I
10.1038/s42256-022-00576-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Fast and stable locomotion of legged robots involves demanding and contradictory requirements, in particular rapid control frequency as well as an accurate dynamics model. Benefiting from universal approximation ability and offline optimization of neural networks, reinforcement learning has been used to solve various challenging problems in legged robot locomotion; however, the optimal control of quadruped robot requires optimizing multiple objectives such as keeping balance, improving efficiency, realizing periodic gait and following commands. These objectives cannot always be achieved simultaneously, especially at high speed. Here, we introduce an imitation-relaxation reinforcement learning (IRRL) method to optimize the objectives in stages. To bridge the gap between simulation and reality, we further introduce the concept of stochastic stability into system robustness analysis. The state space entropy decreasing rate is a quantitative metric and can sharply capture the occurrence of period-doubling bifurcation and possible chaos. By employing IRRL in training and the stochastic stability analysis, we are able to demonstrate a stable running speed of 5.0 m s(-1) for a MIT-MiniCheetah-like robot.
引用
收藏
页码:1198 / 1208
页数:11
相关论文
共 50 条
  • [1] High-speed quadrupedal locomotion by imitation-relaxation reinforcement learning
    Yongbin Jin
    Xianwei Liu
    Yecheng Shao
    Hongtao Wang
    Wei Yang
    Nature Machine Intelligence, 2022, 4 : 1198 - 1208
  • [2] High speed locomotion for a quadrupedal microrobot
    Baisch, Andrew T.
    Ozcan, Onur
    Goldberg, Benjamin
    Ithier, Daniel
    Wood, Robert J.
    INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2014, 33 (08): : 1063 - 1082
  • [3] Learning Quadrupedal High-Speed Running on Uneven Terrain
    Han, Xinyu
    Zhao, Mingguo
    Chen, Xuechao
    Ma, Gan
    BIOMIMETICS, 2024, 9 (01)
  • [4] Policy gradient reinforcement learning for fast quadrupedal locomotion
    Kohl, N
    Stone, P
    2004 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1- 5, PROCEEDINGS, 2004, : 2619 - 2624
  • [5] Automated Hyperparameter Tuning in Reinforcement Learning for Quadrupedal Robot Locomotion
    Kim, Myeongseop
    Kim, Jung-Su
    Park, Jae-Han
    ELECTRONICS, 2024, 13 (01)
  • [6] A Leg Design Method for High Speed Quadrupedal Locomotion
    Dallas, Spyridon
    Machairas, Konstantinos
    Koutsoukis, Konstantinos
    Papadopoulos, Evangelos
    2017 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2017, : 4877 - 4882
  • [7] Reinforcement Learning for High-Speed Quadrupedal Locomotion With Motor Operating Region Constraints: Mitigating Motor Model Discrepancies through Torque Clipping in Realistic Motor Operating Region
    Shin, Young-Ha
    Song, Tae-Gyu
    Ji, Gwanghyeon
    Park, Hae-Won
    IEEE ROBOTICS & AUTOMATION MAGAZINE, 2024,
  • [8] Learning Multiple-Gait Quadrupedal Locomotion via Hierarchical Reinforcement Learning
    Wei, Lang
    Li, Yunxiang
    Ai, Yunfei
    Wu, Yuze
    Xu, Hao
    Wang, Wei
    INTERNATIONAL JOURNAL OF PRECISION ENGINEERING AND MANUFACTURING, 2023, 24 (9) : 1599 - 1613
  • [9] Learning Risk-Aware Quadrupedal Locomotion using Distributional Reinforcement Learning
    Schneider, Lukas
    Frey, Jonas
    Miki, Takahiro
    Hutter, Marco
    2024 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2024), 2024, : 11451 - 11458
  • [10] Learning Multiple-Gait Quadrupedal Locomotion via Hierarchical Reinforcement Learning
    Lang Wei
    Yunxiang Li
    Yunfei Ai
    Yuze Wu
    Hao Xu
    Wei Wang
    Guoming Hu
    International Journal of Precision Engineering and Manufacturing, 2023, 24 : 1599 - 1613