Understanding the stability of deep control policies for biped locomotion

被引:3
|
作者
Park, Hwangpil [1 ,3 ]
Yu, Ri [1 ]
Lee, Yoonsang [4 ]
Lee, Kyungho [5 ]
Lee, Jehee [2 ]
机构
[1] Seoul Natl Univ, Seoul, South Korea
[2] Seoul Natl Univ, Dept Comp Sci & Engn, Seoul, South Korea
[3] Samsung Elect, Suwon, South Korea
[4] Hanyang Univ, Comp Sci, Seoul, South Korea
[5] NC Soft, Sungnam, South Korea
来源
VISUAL COMPUTER | 2023年 / 39卷 / 01期
关键词
Biped locomotion; Deep reinforcement learning; Gait analysis; Physically based simulation; Push-recovery stability; RECOVERY;
D O I
10.1007/s00371-021-02342-9
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Achieving stability and robustness is the primary goal of biped locomotion control. Recently, deep reinforcement learning (DRL) has attracted great attention as a general methodology for constructing biped control policies and demonstrated significant improvements over the previous state-of-the-art control methods. Although deep control policies are more advantageous compared with previous controller design approaches, many questions remain: Are deep control policies as robust as human walking? Does simulated walking involve strategies similar to human walking for maintaining balance? Does a particular gait pattern affect human and simulated walking similarly? What do deep policies learn to achieve improved gait stability? The goal of this study is to address these questions by evaluating the push-recovery stability of deep policies compared with those of human subjects and a previous feedback controller. Furthermore, we conducted experiments to evaluate the effectiveness of variants of DRL algorithms.
引用
收藏
页码:473 / 487
页数:15
相关论文
共 50 条
  • [21] Model and control of the locomotion of a biomimic musculoskeletal biped
    Zhang D.
    Zhu K.
    Artificial Life and Robotics, 2006, 10 (2) : 91 - 95
  • [22] A CONTROL THEORETIC STUDY ON DYNAMICAL BIPED LOCOMOTION
    MIYAZAKI, F
    ARIMOTO, S
    JOURNAL OF DYNAMIC SYSTEMS MEASUREMENT AND CONTROL-TRANSACTIONS OF THE ASME, 1980, 102 (04): : 233 - 239
  • [23] Simulation and control of biped locomotion - GA optimization
    Rodrigues, L
    Prado, M
    Tavares, P
    daSilva, K
    Rosa, A
    1996 IEEE INTERNATIONAL CONFERENCE ON EVOLUTIONARY COMPUTATION (ICEC '96), PROCEEDINGS OF, 1996, : 390 - 395
  • [24] A CONTROL STUDY OF A KNEELESS BIPED LOCOMOTION SYSTEM
    YANG, JS
    JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 1994, 331 (02): : 125 - 143
  • [25] Combined control of CPG and torso attitude control for biped locomotion
    Takahashi, M
    Narukawa, T
    Miyakawa, K
    Yoshida, K
    2005 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-4, 2005, : 346 - 351
  • [26] Locomotion Control of a Biped Robot Using Nonlinear Oscillators
    Shinya Aoi
    Kazuo Tsuchiya
    Autonomous Robots, 2005, 19 : 219 - 232
  • [27] Locomotion control of a biped robot using nonlinear oscillators
    Aoi, S
    Tsuchiya, K
    AUTONOMOUS ROBOTS, 2005, 19 (03) : 219 - 232
  • [28] Rhythm-based control of biped locomotion robot
    Kawaji, S
    Ogasawara, K
    Arao, M
    1998 5TH INTERNATIONAL WORKSHOP ON ADVANCED MOTION CONTROL - PROCEEDINGS: AMC '98 - COIMBRA, 1998, : 93 - 98
  • [29] CONTROL OF A DYNAMIC BIPED LOCOMOTION SYSTEM FOR STEADY WALKING
    FURUSHO, J
    MASUBUCHI, M
    JOURNAL OF DYNAMIC SYSTEMS MEASUREMENT AND CONTROL-TRANSACTIONS OF THE ASME, 1986, 108 (02): : 111 - 118
  • [30] An empirical exploration of a neural oscillator for biped locomotion control
    Endo, G
    Morimoto, J
    Nakanishi, J
    Cheng, G
    2004 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1- 5, PROCEEDINGS, 2004, : 3036 - 3042