Realizing the quadruped robot walking by using reinforcement learning

被引:0
|
作者
Murao, H [1 ]
Tamaki, H [1 ]
Kitamura, S [1 ]
机构
[1] Kobe Univ, Kobe, Hyogo 6578501, Japan
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We apply the reinforcement leashing to acquire a gait pattern of a quadruped locomotive robot. Where, no prescribed teaching signals as for supervised learnings but only scalar reinforcement signals are assumed. Tile advantage of tile reinforcement learning for such a problem is that we need no exact robot models for calculating prescribed teaching signals, but we simply need to evaluate results of trials and generate reinforcement signals. It is expected as a result that tile robot can acquire a walking pattern suitable to its structure, dynamics and environment by itself. We use here a well-known Actor-Critic learning method. It is applied to determine the value of amplitude and argument of assumed sinusoidal curves for angles of every joints. The computer simulations showed that it could generate various stable walking pattern suitable to the environment and dynamics of the robot. We also applied the proposed method to an experimental real robot and could ascertain the learning process for getting the walking pattern. As a conclusion, we have proposed a method of the reinforcement learning scheme which assumed the sinusoidal curve for each joint angle trajectory, and we have showed, by simulation and experiment, its effectiveness for generating the stable walking patterns for the quadruped robot.
引用
收藏
页码:240 / 243
页数:4
相关论文
共 50 条
  • [1] Walking pattern acquisition for quadruped robot by using modular reinforcement learning
    Murao, H
    Tamaki, H
    Kitamura, S
    2001 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS, VOLS 1-5: E-SYSTEMS AND E-MAN FOR CYBERNETICS IN CYBERSPACE, 2002, : 1402 - 1405
  • [2] Reinforcement Learning based Control of a Quadruped Robot
    Ancy, A.
    Jisha, V. R.
    2022 IEEE 19TH INDIA COUNCIL INTERNATIONAL CONFERENCE, INDICON, 2022,
  • [3] Continual Reinforcement Learning for Quadruped Robot Locomotion
    Gai, Sibo
    Lyu, Shangke
    Zhang, Hongyin
    Wang, Donglin
    ENTROPY, 2024, 26 (01)
  • [4] Quadruped robot obstacle negotiation via reinforcement learning
    Lee, Honglak
    Shen, Yirong
    Yu, Chih-Han
    Singh, Gurjeet
    Ng, Andrew Y.
    2006 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), VOLS 1-10, 2006, : 3003 - +
  • [5] Synthesizing the optimal gait of a quadruped robot with soft actuators using deep reinforcement learning
    Ji, Qinglei
    Fu, Shuo
    Tan, Kaige
    Muralidharan, Seshagopalan Thorapalli
    Lagrelius, Karin
    Danelia, David
    Andrikopoulos, Georgios
    Wang, Xi Vincent
    Wang, Lihui
    Feng, Lei
    ROBOTICS AND COMPUTER-INTEGRATED MANUFACTURING, 2022, 78
  • [6] Model Predictive Control of Quadruped Robot Based on Reinforcement Learning
    Zhang, Zhitong
    Chang, Xu
    Ma, Hongxu
    An, Honglei
    Lang, Lin
    APPLIED SCIENCES-BASEL, 2023, 13 (01):
  • [7] A strategy for push recovery in quadruped Robot based on reinforcement Learning
    Chen, Yang-zhen
    Hou, Wen-Qi
    Wang, Jian
    Wang, Jian-Wen
    Ma, Hong-xu
    2015 34TH CHINESE CONTROL CONFERENCE (CCC), 2015, : 3145 - 3151
  • [8] A Reinforcement Learning Method for Humanoid Robot Walking
    Liu, Yunda
    Bi, Sheng
    Dong, Min
    Zhang, Yingjie
    Huang, Jialing
    Zhang, Jiawei
    2018 IEEE 8TH ANNUAL INTERNATIONAL CONFERENCE ON CYBER TECHNOLOGY IN AUTOMATION, CONTROL, AND INTELLIGENT SYSTEMS (IEEE-CYBER), 2018, : 623 - 628
  • [9] Reinforcement learning for passive dynamic walking robot
    Mao, Yong
    Li, Shi
    Wang, Jiaxin
    Jia, Peifa
    Yang, Zehong
    Qiu, Zhen
    Qinghua Daxue Xuebao/Journal of Tsinghua University, 2008, 48 (01): : 92 - 96
  • [10] Using a controller based on reinforcement learning for a passive dynamic walking robot
    Schuitema, E
    Hobbelen, DGE
    Jonker, PP
    Wisse, M
    Karssen, JGD
    2005 5TH IEEE-RAS INTERNATIONAL CONFERENCE ON HUMANOID ROBOTS, 2005, : 232 - 237