Understanding the stability of deep control policies for biped locomotion

被引:3
|
作者
Park, Hwangpil [1 ,3 ]
Yu, Ri [1 ]
Lee, Yoonsang [4 ]
Lee, Kyungho [5 ]
Lee, Jehee [2 ]
机构
[1] Seoul Natl Univ, Seoul, South Korea
[2] Seoul Natl Univ, Dept Comp Sci & Engn, Seoul, South Korea
[3] Samsung Elect, Suwon, South Korea
[4] Hanyang Univ, Comp Sci, Seoul, South Korea
[5] NC Soft, Sungnam, South Korea
来源
VISUAL COMPUTER | 2023年 / 39卷 / 01期
关键词
Biped locomotion; Deep reinforcement learning; Gait analysis; Physically based simulation; Push-recovery stability; RECOVERY;
D O I
10.1007/s00371-021-02342-9
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Achieving stability and robustness is the primary goal of biped locomotion control. Recently, deep reinforcement learning (DRL) has attracted great attention as a general methodology for constructing biped control policies and demonstrated significant improvements over the previous state-of-the-art control methods. Although deep control policies are more advantageous compared with previous controller design approaches, many questions remain: Are deep control policies as robust as human walking? Does simulated walking involve strategies similar to human walking for maintaining balance? Does a particular gait pattern affect human and simulated walking similarly? What do deep policies learn to achieve improved gait stability? The goal of this study is to address these questions by evaluating the push-recovery stability of deep policies compared with those of human subjects and a previous feedback controller. Furthermore, we conducted experiments to evaluate the effectiveness of variants of DRL algorithms.
引用
收藏
页码:473 / 487
页数:15
相关论文
共 50 条
  • [31] Balance and impedance control for biped humanoid robot locomotion
    Lim, H
    Setiawan, SA
    Takanishi, A
    IROS 2001: PROCEEDINGS OF THE 2001 IEEE/RJS INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-4: EXPANDING THE SOCIETAL ROLE OF ROBOTICS IN THE NEXT MILLENNIUM, 2001, : 494 - 499
  • [32] Reflex control of biped robot locomotion on a slippery surface
    Park, JH
    Kwon, O
    2001 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS I-IV, PROCEEDINGS, 2001, : 4134 - 4139
  • [33] CONTROL OF A BIPED LOCOMOTION SYSTEM IN A DOUBLE SUPPORT PHASE
    NARIKIYO, T
    ITO, M
    ROBOTICA, 1985, 3 (APR-) : 73 - 77
  • [34] Relationship between ZMP and stability on dynamic locomotion of biped robots
    Liu, Zhiyuan
    Dai, Shaoan
    Harbin Gongye Daxue Xuebao/Journal of Harbin Institute of Technology, 1994, 26 (01):
  • [35] BIPED LOCOMOTION ROBOTS
    ARIMOTO, S
    MIYAZAKI, F
    JAPAN ANNUAL REVIEWS IN ELECTRONICS COMPUTERS & TELECOMMUNICATIONS, 1984, 12 : 194 - 205
  • [36] BIPED LOCOMOTION.
    Johnson, Curtis D.
    Journal of Engineering Technology, 1985, 2 (01) : 6 - 12
  • [37] Experiments with nontraditional hybrid control technique of biped locomotion robots
    Vukobratovic, M
    Timcenko, O
    JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 1996, 16 (01) : 25 - 43
  • [38] Locomotion control of biped robots on irregularly protruded uneven surface
    Kim, Eung Seo
    Yeon, Je Sung
    Park, Jong Hyeon
    PROCEEDINGS OF THE 26TH IASTED INTERNATIONAL CONFERENCE ON MODELLING, IDENTIFICATION, AND CONTROL, 2007, : 51 - 56
  • [39] Sideward Locomotion Control of Biped Robots Based on Dynamics Morphing
    Atsuta, Hiroshi
    Sugihara, Tomomichi
    2014 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2014), 2014, : 959 - 964
  • [40] Sensor-based locomotion control system of biped robot
    Zhang, Y.X.
    Ma, L.
    Qiang, W.Y.
    Gaojishu Tongxin/High Technology Letters, 2001, 11 (06):