High-speed quadrupedal locomotion by imitation-relaxation reinforcement learning

被引：26

作者：

Jin, Yongbin ^{[1
,2
,3
,4
]}

Liu, Xianwei ^{[1
]}

Shao, Yecheng ^{[1
,4
]}

Wang, Hongtao ^{[1
,2
,3
,4
]}

Yang, Wei ^{[1
,2
,3
,4
]}

机构：

[1] Zhejiang Univ, Ctr X Mech, Hangzhou, Peoples R China

[2] Hangzhou Global Sci & Technol Innovat Ctr, ZJU, Hangzhou, Peoples R China

[3] Zhejiang Univ, State Key Lab Fluid Power & Mechatron Syst, Hangzhou, Peoples R China

[4] Zhejiang Univ, Inst Appl Mech, Hangzhou, Peoples R China

来源：

NATURE MACHINE INTELLIGENCE | 2022年 / 4卷 / 12期

关键词：

ENTROPY STABILITY; DYNAMICS; DESIGN; ROBOT; MODEL;

D O I：

10.1038/s42256-022-00576-3

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Fast and stable locomotion of legged robots involves demanding and contradictory requirements, in particular rapid control frequency as well as an accurate dynamics model. Benefiting from universal approximation ability and offline optimization of neural networks, reinforcement learning has been used to solve various challenging problems in legged robot locomotion; however, the optimal control of quadruped robot requires optimizing multiple objectives such as keeping balance, improving efficiency, realizing periodic gait and following commands. These objectives cannot always be achieved simultaneously, especially at high speed. Here, we introduce an imitation-relaxation reinforcement learning (IRRL) method to optimize the objectives in stages. To bridge the gap between simulation and reality, we further introduce the concept of stochastic stability into system robustness analysis. The state space entropy decreasing rate is a quantitative metric and can sharply capture the occurrence of period-doubling bifurcation and possible chaos. By employing IRRL in training and the stochastic stability analysis, we are able to demonstrate a stable running speed of 5.0 m s(-1) for a MIT-MiniCheetah-like robot.

引用

页码：1198 / 1208

页数：11

共 50 条

[1] High-speed quadrupedal locomotion by imitation-relaxation reinforcement learning
Yongbin Jin
Xianwei Liu
Yecheng Shao
Hongtao Wang
Wei Yang
Nature Machine Intelligence, 2022, 4 : 1198 - 1208
[2] High speed locomotion for a quadrupedal microrobot
Baisch, Andrew T.
Ozcan, Onur
Goldberg, Benjamin
Ithier, Daniel
Wood, Robert J.
INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2014, 33 (08): : 1063 - 1082
[3] Learning Quadrupedal High-Speed Running on Uneven Terrain
Han, Xinyu
Zhao, Mingguo
Chen, Xuechao
Ma, Gan
BIOMIMETICS, 2024, 9 (01)
[4] Policy gradient reinforcement learning for fast quadrupedal locomotion
Kohl, N
Stone, P
2004 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1- 5, PROCEEDINGS, 2004, : 2619 - 2624
[5] Automated Hyperparameter Tuning in Reinforcement Learning for Quadrupedal Robot Locomotion
Kim, Myeongseop
Kim, Jung-Su
Park, Jae-Han
ELECTRONICS, 2024, 13 (01)
[6] A Leg Design Method for High Speed Quadrupedal Locomotion
Dallas, Spyridon
Machairas, Konstantinos
Koutsoukis, Konstantinos
Papadopoulos, Evangelos
2017 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2017, : 4877 - 4882
[7] Reinforcement Learning for High-Speed Quadrupedal Locomotion With Motor Operating Region Constraints: Mitigating Motor Model Discrepancies through Torque Clipping in Realistic Motor Operating Region
Shin, Young-Ha
Song, Tae-Gyu
Ji, Gwanghyeon
Park, Hae-Won
IEEE ROBOTICS & AUTOMATION MAGAZINE, 2024,
[8] Learning Multiple-Gait Quadrupedal Locomotion via Hierarchical Reinforcement Learning
Wei, Lang
Li, Yunxiang
Ai, Yunfei
Wu, Yuze
Xu, Hao
Wang, Wei
INTERNATIONAL JOURNAL OF PRECISION ENGINEERING AND MANUFACTURING, 2023, 24 (9) : 1599 - 1613
[9] Learning Risk-Aware Quadrupedal Locomotion using Distributional Reinforcement Learning
Schneider, Lukas
Frey, Jonas
Miki, Takahiro
Hutter, Marco
2024 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2024), 2024, : 11451 - 11458
[10] Learning Multiple-Gait Quadrupedal Locomotion via Hierarchical Reinforcement Learning
Lang Wei
Yunxiang Li
Yunfei Ai
Yuze Wu
Hao Xu
Wei Wang
Guoming Hu
International Journal of Precision Engineering and Manufacturing, 2023, 24 : 1599 - 1613

← 1 2 3 4 5 →