Understanding the stability of deep control policies for biped locomotion

被引:3
|
作者
Park, Hwangpil [1 ,3 ]
Yu, Ri [1 ]
Lee, Yoonsang [4 ]
Lee, Kyungho [5 ]
Lee, Jehee [2 ]
机构
[1] Seoul Natl Univ, Seoul, South Korea
[2] Seoul Natl Univ, Dept Comp Sci & Engn, Seoul, South Korea
[3] Samsung Elect, Suwon, South Korea
[4] Hanyang Univ, Comp Sci, Seoul, South Korea
[5] NC Soft, Sungnam, South Korea
来源
VISUAL COMPUTER | 2023年 / 39卷 / 01期
关键词
Biped locomotion; Deep reinforcement learning; Gait analysis; Physically based simulation; Push-recovery stability; RECOVERY;
D O I
10.1007/s00371-021-02342-9
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Achieving stability and robustness is the primary goal of biped locomotion control. Recently, deep reinforcement learning (DRL) has attracted great attention as a general methodology for constructing biped control policies and demonstrated significant improvements over the previous state-of-the-art control methods. Although deep control policies are more advantageous compared with previous controller design approaches, many questions remain: Are deep control policies as robust as human walking? Does simulated walking involve strategies similar to human walking for maintaining balance? Does a particular gait pattern affect human and simulated walking similarly? What do deep policies learn to achieve improved gait stability? The goal of this study is to address these questions by evaluating the push-recovery stability of deep policies compared with those of human subjects and a previous feedback controller. Furthermore, we conducted experiments to evaluate the effectiveness of variants of DRL algorithms.
引用
收藏
页码:473 / 487
页数:15
相关论文
共 50 条
  • [41] Dynamic control of biped locomotion robot using optimal regulator
    Sano, Akihito
    Furusho, Junji
    Nippon Kikai Gakkai Ronbunshu, C Hen/Transactions of the Japan Society of Mechanical Engineers, Part C, 1988, 54 (504): : 1804 - 1811
  • [42] Programmable central pattern generators: an application to biped locomotion control
    Righetti, Ludovic
    Ijspeert, Auke Jan
    2006 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), VOLS 1-10, 2006, : 1585 - +
  • [43] Fuzzy Logic Velocity Control of a Biped Robot Locomotion and Simulation
    Ankarali, Arif
    INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS, 2012, 9
  • [44] A Truncated Fourier Series with Genetic Algorithm for the control of Biped Locomotion
    Shafii, Nima
    Javadi, Mohammad H. Seyed
    Kimiaghalam, Bahram
    2009 IEEE/ASME INTERNATIONAL CONFERENCE ON ADVANCED INTELLIGENT MECHATRONICS, VOLS 1-3, 2009, : 1770 - 1774
  • [45] Optimal gait control for a biped locomotion using genetic algorithm
    Kim, JG
    Choi, SH
    Park, KH
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2004, PT 4, 2004, 3046 : 29 - 38
  • [46] Turning control of a biped locomotion robot using nonlinear oscillators
    Aoi, S
    Tsuchiya, K
    Tsujita, K
    2004 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1- 5, PROCEEDINGS, 2004, : 3043 - 3048
  • [47] Variable stiffness control of series elastic actuated biped locomotion
    Luo, Jianwen
    Wang, Shuguo
    Zhao, Ye
    Fu, Yili
    INTELLIGENT SERVICE ROBOTICS, 2018, 11 (03) : 225 - 235
  • [48] Model Reference Adaptive Control for Actuators of a Biped Robot Locomotion
    Vempaty, Pavan K.
    Cheok, Ka C.
    Loh, Robert N. K.
    WCECS 2009: WORLD CONGRESS ON ENGINEERING AND COMPUTER SCIENCE, VOLS I AND II, 2009, : 983 - 988
  • [49] Variable stiffness control of series elastic actuated biped locomotion
    Jianwen Luo
    Shuguo Wang
    Ye Zhao
    Yili Fu
    Intelligent Service Robotics, 2018, 11 : 225 - 235
  • [50] CONTROL OF ATTITUDE OF BIPED LOCOMOTION BY LINEAR-MOTION BALANCE
    FUNABASHI, H
    WATANABE, K
    OGAWA, K
    BULLETIN OF THE JSME-JAPAN SOCIETY OF MECHANICAL ENGINEERS, 1979, 22 (172): : 1499 - 1506