Learning CPG-based biped locomotion with a policy gradient method

被引:0
|
作者
Matsubara, T [1 ]
Morimoto, J [1 ]
Nakanishi, J [1 ]
Sato, M [1 ]
Doya, K [1 ]
机构
[1] Nara Inst Sci & Technol, Nara, Japan
关键词
reinforcement learning; policy gradient; biped locomotion; central pattern generator; WALKING;
D O I
暂无
中图分类号
TP24 [机器人技术];
学科分类号
080202 ; 1405 ;
摘要
Recently, CPG-based controllers have been widely explored to achieve robust biped locomotion. However, this approach has difficulties in tuning open parameters in the controller. In this paper, we present a learning framework for CPG-based biped locomotion with a policy gradient method. We demonstrate that appropriate sensory feedback in the CPG-based control architecture can he acquired using the proposed method within a thousand trials by numerical simulations. We analyze linear stability of a periodic orbit of the acquired biped walking considering a return map. Furthermore, we apply the learned controllers in numerical simulations to our physical 5-link robot in order to empirically evaluate the effectiveness of the proposed framework. Experimental results suggest the robustness of the acquired controllers against environmental changes and variations in the mass properties of the robot.
引用
收藏
页码:208 / 213
页数:6
相关论文
共 50 条
  • [1] Learning CPG-based biped locomotion with a policy gradient method
    Matsubara, T. (takam-m@atr.jp), (Inst. of Elec. and Elec. Eng. Computer Society, 445 Hoes Lane - P.O.Box 1331, Piscataway, NJ 08855-1331, United States):
  • [2] Learning CPG-based biped locomotion with a policy gradient method
    Matsubara, Takamitsu
    Morimoto, Jun
    Nakanishi, Jun
    Sato, Masa-aki
    Doya, Kenji
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2006, 54 (11) : 911 - 920
  • [3] Learning CPG-based biped locomotion with a policy gradient method: Application to a humanoid robot
    Endo, Gen
    Morimoto, Jun
    Matsubara, Takamitsu
    Nakanishi, Jun
    Cheng, Gordon
    INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2008, 27 (02): : 213 - 228
  • [4] Learning sensory feedback to CPG with policy gradient for biped locomotion
    Matsubara, T
    Morimoto, J
    Nakanishi, J
    Sato, MA
    Doya, K
    2005 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), VOLS 1-4, 2005, : 4164 - 4169
  • [5] Biped Locomotion Control through a Biomimetic CPG-based Controller
    Cristina P. Santos
    Nuno Alves
    Juan C. Moreno
    Journal of Intelligent & Robotic Systems, 2017, 85 : 47 - 70
  • [6] Biped Locomotion Control through a Biomimetic CPG-based Controller
    Santos, Cristina P.
    Alves, Nuno
    Moreno, Juan C.
    JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2017, 85 (01) : 47 - 70
  • [7] A CPG-based control method for the rolling locomotion of a desert spider
    Shi, Ruidong
    Zhang, Xiuli
    Tian, Yaobin
    Dong, Shouyang
    Yao, Yan'an
    2016 IEEE WORKSHOP ON ADVANCED ROBOTICS AND ITS SOCIAL IMPACTS (ARSO), 2016, : 243 - 248
  • [8] Development of Semi-Passive Biped Walking Robot Embedded with CPG-based Locomotion Control
    Suzuki, Hirotatsu
    Lee, Jae Hoon
    Okamoto, Shingo
    2017 14TH INTERNATIONAL CONFERENCE ON UBIQUITOUS ROBOTS AND AMBIENT INTELLIGENCE (URAI), 2017, : 75 - 78
  • [9] A CPG-based Sensory Feedback Control Method for Robotic Fish Locomotion
    Wang Ming
    Yu Junzhi
    Tan Min
    Zhang Guiqing
    2011 30TH CHINESE CONTROL CONFERENCE (CCC), 2011, : 4115 - 4120
  • [10] A stochastic optimization method of CPG-based motion control for humanoid locomotion
    Itoh, Y
    Taki, K
    Kato, S
    Itoh, H
    2004 IEEE CONFERENCE ON ROBOTICS, AUTOMATION AND MECHATRONICS, VOLS 1 AND 2, 2004, : 347 - 351