Teaching Humanoid Robot Reaching Motion by Imitation and Reinforcement Learning

被引:1
|
作者
Savevska, Kristina [1 ,2 ]
Ude, Ales [1 ]
机构
[1] Jozef Stefan Inst, Dept Automat Biocybernet & Robot, Humanoid & Cognit Robot Lab, Jamova Cesta 39, Ljubljana 1000, Slovenia
[2] Int Postgrad Sch Jozef Stefan, Jamova Cesta 39, Ljubljana 1000, Slovenia
关键词
Humanoids; Imitation Learning; Reinforcement learning;
D O I
10.1007/978-3-031-32606-6_7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a user-friendly method for programming humanoid robots without the need for expert knowledge. We propose a combination of imitation learning and reinforcement learning to teach and optimize demonstrated trajectories. An initial trajectory for reinforcement learning is generated using a stable whole-body motion imitation system. The acquired motion is then refined using a stochastic optimal control-based reinforcement learning algorithm called Policy Improvement with Path Integrals with Covariance Matrix Adaptation (PI2-CMA). We tested the approach for programming humanoid robot reaching motion. Our experimental results show that the proposed approach is successful at learning reaching motions while preserving the postural balance of the robot. We also show how a stable humanoid robot trajectory learned in simulation can be effectively adapted to different dynamic environments, e.g. a different simulator or a real robot. The resulting learning methodology allows for quick and efficient optimization of the demonstrated trajectories while also taking into account the constraints of the desired task. The learning methodology was tested in a simulated environment and on the real humanoid robot TALOS.
引用
收藏
页码:53 / 61
页数:9
相关论文
共 50 条
  • [31] Learning by Imitation and Implementation Of Sign Language Gestures by a Humanoid Robot
    Kivrak, Hasan
    Kose, Hatice
    2015 23RD SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2015, : 1993 - 1996
  • [32] Reinforcement learning for motion control of humanoid robots
    Iida, S. (iida@ics.nitech.ac.jp), 2004, Institute of Electrical and Electronics Engineers, IEEE; Robotics Society of Japan, RSJ (Institute of Electrical and Electronics Engineers Inc.):
  • [33] Autonomous learning of 3D reaching in a humanoid robot
    Nori, Francesco
    Natale, Lorenzo
    Sandini, Giulio
    Metta, Giorgio
    2007 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-9, 2007, : 1148 - 1153
  • [34] Learning precise 3D reaching in a humanoid robot
    Natale, Lorenzo
    Nori, Francesco
    Sandini, Giulio
    Metta, Giorgio
    2007 IEEE 6TH INTERNATIONAL CONFERENCE ON DEVELOPMENT AND LEARNING, 2007, : 217 - 222
  • [35] Learning for a Robot: Deep Reinforcement Learning, Imitation Learning, Transfer Learning
    Hua, Jiang
    Zeng, Liangcai
    Li, Gongfa
    Ju, Zhaojie
    SENSORS, 2021, 21 (04) : 1 - 21
  • [36] Learning for a robot: Deep reinforcement learning, imitation learning, transfer learning
    Hua, Jiang
    Zeng, Liangcai
    Li, Gongfa
    Ju, Zhaojie
    Sensors (Switzerland), 2021, 21 (04): : 1 - 21
  • [37] Deep Reinforcement Learning for a Humanoid Robot Soccer Player
    Isaac Jesus da Silva
    Danilo Hernani Perico
    Thiago Pedro Donadon Homem
    Reinaldo Augusto da Costa Bianchi
    Journal of Intelligent & Robotic Systems, 2021, 102
  • [38] Deep Reinforcement Learning for a Humanoid Robot Soccer Player
    da Silva, Isaac Jesus
    Perico, Danilo Hernani
    Donadon Homem, Thiago Pedro
    da Costa Bianchi, Reinaldo Augusto
    JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2021, 102 (03)
  • [39] Analysis of Cost Functions for Reinforcement Learning of Reaching Tasks in Humanoid Robots
    Savevska, Kristina
    Ude, Ales
    APPLIED SCIENCES-BASEL, 2024, 14 (01):
  • [40] Real-time full body motion imitation on the COMAN humanoid robot
    Gams, Andrej
    van den Kieboom, Jesse
    Dzeladini, Florin
    Ude, Ales
    Ijspeert, Auke Jan
    ROBOTICA, 2015, 33 (05) : 1049 - 1061