High-speed quadrupedal locomotion by imitation-relaxation reinforcement learning

被引：26

作者：

Jin, Yongbin ^{[1
,2
,3
,4
]}

Liu, Xianwei ^{[1
]}

Shao, Yecheng ^{[1
,4
]}

Wang, Hongtao ^{[1
,2
,3
,4
]}

Yang, Wei ^{[1
,2
,3
,4
]}

机构：

[1] Zhejiang Univ, Ctr X Mech, Hangzhou, Peoples R China

[2] Hangzhou Global Sci & Technol Innovat Ctr, ZJU, Hangzhou, Peoples R China

[3] Zhejiang Univ, State Key Lab Fluid Power & Mechatron Syst, Hangzhou, Peoples R China

[4] Zhejiang Univ, Inst Appl Mech, Hangzhou, Peoples R China

来源：

NATURE MACHINE INTELLIGENCE | 2022年 / 4卷 / 12期

关键词：

ENTROPY STABILITY; DYNAMICS; DESIGN; ROBOT; MODEL;

D O I：

10.1038/s42256-022-00576-3

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Fast and stable locomotion of legged robots involves demanding and contradictory requirements, in particular rapid control frequency as well as an accurate dynamics model. Benefiting from universal approximation ability and offline optimization of neural networks, reinforcement learning has been used to solve various challenging problems in legged robot locomotion; however, the optimal control of quadruped robot requires optimizing multiple objectives such as keeping balance, improving efficiency, realizing periodic gait and following commands. These objectives cannot always be achieved simultaneously, especially at high speed. Here, we introduce an imitation-relaxation reinforcement learning (IRRL) method to optimize the objectives in stages. To bridge the gap between simulation and reality, we further introduce the concept of stochastic stability into system robustness analysis. The state space entropy decreasing rate is a quantitative metric and can sharply capture the occurrence of period-doubling bifurcation and possible chaos. By employing IRRL in training and the stochastic stability analysis, we are able to demonstrate a stable running speed of 5.0 m s(-1) for a MIT-MiniCheetah-like robot.

引用

页码：1198 / 1208

页数：11

共 50 条

[31] Hierarchical Terrain-Aware Control for Quadrupedal Locomotion by Combining Deep Reinforcement Learning and Optimal Control
Yao, Qingfeng
Wang, Jilong
Wang, Donglin
Yang, Shuyu
Zhang, Hongyin
Wang, Yinuo
Wu, Zhengqing
2021 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2021, : 4546 - 4551
[32] CPG-Based Hierarchical Locomotion Control for Modular Quadrupedal Robots Using Deep Reinforcement Learning
Wang, Jiayu
Hu, Chuxiong
Zhu, Yu
IEEE ROBOTICS AND AUTOMATION LETTERS, 2021, 6 (04) : 7193 - 7200
[33] Learning Advanced Locomotion for Quadrupedal Robots: A Distributed Multi-Agent Reinforcement Learning Framework with Riemannian Motion Policies
Wang, Yuliu
Sagawa, Ryusuke
Yoshiyasu, Yusuke
ROBOTICS, 2024, 13 (06)
[34] Robust High-Speed Running for Quadruped Robots via Deep Reinforcement Learning
Bellegarda, Guillaume
Chen, Yiyu
Liu, Zhuochen
Quan Nguyen
2022 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS), 2022, : 10364 - 10370
[35] Closing the Dynamics Gap via Adversarial and Reinforcement Learning for High-Speed Racing
Niu, Jingyu
Hu, Yu
Li, Wei
Huang, Guangyan
Han, Yinhe
Li, Xiaowei
2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
[36] Two-Stage Safe Reinforcement Learning for High-Speed Autonomous Racing
Niu, Jingyu
Hu, Yu
Jin, Beibei
Han, Yinhe
Li, Xiaowei
2020 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC), 2020, : 3934 - 3941
[37] Interactive Force Analysis of High-speed Locomotion of Quadruped
Huang, Senwei
Zhang, Xiuli
2022 INTERNATIONAL CONFERENCE ON INDUSTRIAL AUTOMATION, ROBOTICS AND CONTROL ENGINEERING, IARCE, 2022, : 9 - 14
[38] Potential Game Based Task Offloading in the High-Speed Railway With Reinforcement Learning
Wu, Wei
Song, Haifeng
Wang, Hongwei
Dong, Hairong
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 24 (11) : 12671 - 12685
[39] Altitude control for high-speed vehicles in the cruise phase based on reinforcement learning
Chi H.
Yu F.
Guo Z.
Harbin Gongcheng Daxue Xuebao/Journal of Harbin Engineering University, 2021, 42 (09): : 1340 - 1346and1362
[40] A parallel actuated pantograph leg for high-speed locomotion
Wei Guo
Changrong Cai
Mantian Li
Fusheng Zha
Pengfei Wang
Kenan Wang
Journal of Bionic Engineering, 2017, 14 : 202 - 217

← 1 2 3 4 5 →