High-speed quadrupedal locomotion by imitation-relaxation reinforcement learning

被引:26
|
作者
Jin, Yongbin [1 ,2 ,3 ,4 ]
Liu, Xianwei [1 ]
Shao, Yecheng [1 ,4 ]
Wang, Hongtao [1 ,2 ,3 ,4 ]
Yang, Wei [1 ,2 ,3 ,4 ]
机构
[1] Zhejiang Univ, Ctr X Mech, Hangzhou, Peoples R China
[2] Hangzhou Global Sci & Technol Innovat Ctr, ZJU, Hangzhou, Peoples R China
[3] Zhejiang Univ, State Key Lab Fluid Power & Mechatron Syst, Hangzhou, Peoples R China
[4] Zhejiang Univ, Inst Appl Mech, Hangzhou, Peoples R China
关键词
ENTROPY STABILITY; DYNAMICS; DESIGN; ROBOT; MODEL;
D O I
10.1038/s42256-022-00576-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Fast and stable locomotion of legged robots involves demanding and contradictory requirements, in particular rapid control frequency as well as an accurate dynamics model. Benefiting from universal approximation ability and offline optimization of neural networks, reinforcement learning has been used to solve various challenging problems in legged robot locomotion; however, the optimal control of quadruped robot requires optimizing multiple objectives such as keeping balance, improving efficiency, realizing periodic gait and following commands. These objectives cannot always be achieved simultaneously, especially at high speed. Here, we introduce an imitation-relaxation reinforcement learning (IRRL) method to optimize the objectives in stages. To bridge the gap between simulation and reality, we further introduce the concept of stochastic stability into system robustness analysis. The state space entropy decreasing rate is a quantitative metric and can sharply capture the occurrence of period-doubling bifurcation and possible chaos. By employing IRRL in training and the stochastic stability analysis, we are able to demonstrate a stable running speed of 5.0 m s(-1) for a MIT-MiniCheetah-like robot.
引用
收藏
页码:1198 / 1208
页数:11
相关论文
共 50 条
  • [41] Locomotion control of unmanned high-speed AWIDAWIS vehicle
    Ruan, Jiuhong
    Li, Yibin
    Rong, Xuewen
    Song, Rui
    Nongye Jixie Xuebao/Transactions of the Chinese Society of Agricultural Machinery, 2009, 40 (12): : 37 - 42
  • [42] A Parallel Actuated Pantograph Leg for High-speed Locomotion
    Guo, Wei
    Cai, Changrong
    Li, Mantian
    Zha, Fusheng
    Wang, Pengfei
    Wang, Kenan
    JOURNAL OF BIONIC ENGINEERING, 2017, 14 (02) : 202 - 217
  • [43] HINDLIMB DOMINANCE DURING PRIMATE HIGH-SPEED LOCOMOTION
    KIMURA, T
    PRIMATES, 1992, 33 (04) : 465 - 476
  • [44] Speed Regulation of Overhead Catenary System Inspection Robot for High-Speed Railway through Reinforcement Learning
    Li, Siqi
    Xu, Cheng
    Chen, Lipei
    Liu, Zhenmin
    2018 IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTING, SCALABLE COMPUTING & COMMUNICATIONS, CLOUD & BIG DATA COMPUTING, INTERNET OF PEOPLE AND SMART CITY INNOVATION (SMARTWORLD/SCALCOM/UIC/ATC/CBDCOM/IOP/SCI), 2018, : 1378 - 1383
  • [45] Deep Reinforcement Learning Based Active Pantograph Control Strategy in High-Speed Railway
    Wang, Hui
    Han, Zhiwei
    Liu, Zhigang
    Wu, Yanbo
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 72 (01) : 227 - 238
  • [46] Reinforcement Learning-Based High-Speed Path Following Control for Autonomous Vehicles
    Liu, Jia
    Cui, Yunduan
    Duan, Jianghua
    Jiang, Zhengmin
    Pan, Zhongming
    Xu, Kun
    Li, Huiyun
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2024, 73 (06) : 7603 - 7615
  • [47] High-Speed Autonomous Racing Using Trajectory-Aided Deep Reinforcement Learning
    Evans, Benjamin David
    Engelbrecht, Herman Arnold
    Jordaan, Hendrik Willem
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 8 (09) : 5353 - 5359
  • [48] High-speed Railway Timetable Rescheduling Under Random Interruptions Based on Reinforcement Learning
    Pang Z.-S.
    Wang L.-W.
    Peng Q.-Y.
    Jiaotong Yunshu Xitong Gongcheng Yu Xinxi/Journal of Transportation Systems Engineering and Information Technology, 2023, 23 (05): : 279 - 289
  • [49] A Policy-based Reinforcement Learning Approach for High-speed Railway Timetable Rescheduling
    Wang, Yin
    Lv, Yisheng
    Zhou, Jianying
    Yuan, Zhiming
    Zhang, Qi
    Zhou, Min
    2021 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2021, : 2362 - 2367
  • [50] A Deep Reinforcement Learning Approach to High-speed Train Timetable Rescheduling under Disturbances
    Ning, Lingbin
    Li, Yidong
    Zhou, Min
    Song, Haifeng
    Dong, Hairong
    2019 IEEE INTELLIGENT TRANSPORTATION SYSTEMS CONFERENCE (ITSC), 2019, : 3469 - 3474