Learning CPG-based biped locomotion with a policy gradient method

被引：0

作者：

Matsubara, T ^{[1
]}

Morimoto, J ^{[1
]}

Nakanishi, J ^{[1
]}

Sato, M ^{[1
]}

Doya, K ^{[1
]}

机构：

[1] Nara Inst Sci & Technol, Nara, Japan

来源：

2005 5TH IEEE-RAS INTERNATIONAL CONFERENCE ON HUMANOID ROBOTS | 2005年

关键词：

reinforcement learning; policy gradient; biped locomotion; central pattern generator; WALKING;

D O I：

暂无

中图分类号：

TP24 [机器人技术];

学科分类号：

080202 ; 1405 ;

摘要：

Recently, CPG-based controllers have been widely explored to achieve robust biped locomotion. However, this approach has difficulties in tuning open parameters in the controller. In this paper, we present a learning framework for CPG-based biped locomotion with a policy gradient method. We demonstrate that appropriate sensory feedback in the CPG-based control architecture can he acquired using the proposed method within a thousand trials by numerical simulations. We analyze linear stability of a periodic orbit of the acquired biped walking considering a return map. Furthermore, we apply the learned controllers in numerical simulations to our physical 5-link robot in order to empirically evaluate the effectiveness of the proposed framework. Experimental results suggest the robustness of the acquired controllers against environmental changes and variations in the mass properties of the robot.

引用

页码：208 / 213

页数：6

共 50 条

[1] Learning CPG-based biped locomotion with a policy gradient method
Matsubara, T. (takam-m@atr.jp), (Inst. of Elec. and Elec. Eng. Computer Society, 445 Hoes Lane - P.O.Box 1331, Piscataway, NJ 08855-1331, United States):
[2] Learning CPG-based biped locomotion with a policy gradient method
Matsubara, Takamitsu
Morimoto, Jun
Nakanishi, Jun
Sato, Masa-aki
Doya, Kenji
ROBOTICS AND AUTONOMOUS SYSTEMS, 2006, 54 (11) : 911 - 920
[3] Learning CPG-based biped locomotion with a policy gradient method: Application to a humanoid robot
Endo, Gen
Morimoto, Jun
Matsubara, Takamitsu
Nakanishi, Jun
Cheng, Gordon
INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2008, 27 (02): : 213 - 228
[4] Learning sensory feedback to CPG with policy gradient for biped locomotion
Matsubara, T
Morimoto, J
Nakanishi, J
Sato, MA
Doya, K
2005 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), VOLS 1-4, 2005, : 4164 - 4169
[5] Biped Locomotion Control through a Biomimetic CPG-based Controller
Cristina P. Santos
Nuno Alves
Juan C. Moreno
Journal of Intelligent & Robotic Systems, 2017, 85 : 47 - 70
[6] Biped Locomotion Control through a Biomimetic CPG-based Controller
Santos, Cristina P.
Alves, Nuno
Moreno, Juan C.
JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2017, 85 (01) : 47 - 70
[7] A CPG-based control method for the rolling locomotion of a desert spider
Shi, Ruidong
Zhang, Xiuli
Tian, Yaobin
Dong, Shouyang
Yao, Yan'an
2016 IEEE WORKSHOP ON ADVANCED ROBOTICS AND ITS SOCIAL IMPACTS (ARSO), 2016, : 243 - 248
[8] Development of Semi-Passive Biped Walking Robot Embedded with CPG-based Locomotion Control
Suzuki, Hirotatsu
Lee, Jae Hoon
Okamoto, Shingo
2017 14TH INTERNATIONAL CONFERENCE ON UBIQUITOUS ROBOTS AND AMBIENT INTELLIGENCE (URAI), 2017, : 75 - 78
[9] A CPG-based Sensory Feedback Control Method for Robotic Fish Locomotion
Wang Ming
Yu Junzhi
Tan Min
Zhang Guiqing
2011 30TH CHINESE CONTROL CONFERENCE (CCC), 2011, : 4115 - 4120
[10] A stochastic optimization method of CPG-based motion control for humanoid locomotion
Itoh, Y
Taki, K
Kato, S
Itoh, H
2004 IEEE CONFERENCE ON ROBOTICS, AUTOMATION AND MECHATRONICS, VOLS 1 AND 2, 2004, : 347 - 351

← 1 2 3 4 5 →