Teaching Humanoid Robot Reaching Motion by Imitation and Reinforcement Learning

被引:1
|
作者
Savevska, Kristina [1 ,2 ]
Ude, Ales [1 ]
机构
[1] Jozef Stefan Inst, Dept Automat Biocybernet & Robot, Humanoid & Cognit Robot Lab, Jamova Cesta 39, Ljubljana 1000, Slovenia
[2] Int Postgrad Sch Jozef Stefan, Jamova Cesta 39, Ljubljana 1000, Slovenia
关键词
Humanoids; Imitation Learning; Reinforcement learning;
D O I
10.1007/978-3-031-32606-6_7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a user-friendly method for programming humanoid robots without the need for expert knowledge. We propose a combination of imitation learning and reinforcement learning to teach and optimize demonstrated trajectories. An initial trajectory for reinforcement learning is generated using a stable whole-body motion imitation system. The acquired motion is then refined using a stochastic optimal control-based reinforcement learning algorithm called Policy Improvement with Path Integrals with Covariance Matrix Adaptation (PI2-CMA). We tested the approach for programming humanoid robot reaching motion. Our experimental results show that the proposed approach is successful at learning reaching motions while preserving the postural balance of the robot. We also show how a stable humanoid robot trajectory learned in simulation can be effectively adapted to different dynamic environments, e.g. a different simulator or a real robot. The resulting learning methodology allows for quick and efficient optimization of the demonstrated trajectories while also taking into account the constraints of the desired task. The learning methodology was tested in a simulated environment and on the real humanoid robot TALOS.
引用
收藏
页码:53 / 61
页数:9
相关论文
共 50 条
  • [41] Imitation of Human Motion on a Humanoid Robot using Inverse Kinematics and Path Optimization
    Pandey, Padmakar
    Kumar, Krishan
    Nandi, G. C.
    PROCEEDINGS OF THE 2018 SECOND INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICICCS), 2018, : 1863 - 1868
  • [42] Imitation of Human Motion on a Humanoid Robot using Non-Linear Optimization
    Do, Martin
    Azad, Pedram
    Asfour, Tamim
    Dillmann, Ruediger
    2008 8TH IEEE-RAS INTERNATIONAL CONFERENCE ON HUMANOID ROBOTS (HUMANOIDS 2008), 2008, : 693 - 700
  • [43] Robust Regression-Based Motion Perception for Online Imitation on Humanoid Robot
    Zhu, Tehao
    Zhao, Qunfei
    Wan, Weibing
    Xia, Zeyang
    INTERNATIONAL JOURNAL OF SOCIAL ROBOTICS, 2017, 9 (05) : 705 - 725
  • [44] Robust Regression-Based Motion Perception for Online Imitation on Humanoid Robot
    Tehao Zhu
    Qunfei Zhao
    Weibing Wan
    Zeyang Xia
    International Journal of Social Robotics, 2017, 9 : 705 - 725
  • [45] Development of Imitation Learning Through Physical Therapy Using a Humanoid Robot
    Malik, Norjasween Abdul
    Yussof, Hanafiah
    Hanapiah, Fazah Akhtar
    MEDICAL AND REHABILITATION ROBOTICS AND INSTRUMENTATION (MRRI2013), 2014, 42 : 191 - 197
  • [46] Learning of Gestures by Imitation using a Monocular Vision System on a Humanoid Robot
    Sabbaghi, Elaheh
    Bahrami, Mohsen
    Ghidary, Saeed Shiry
    2014 SECOND RSI/ISM INTERNATIONAL CONFERENCE ON ROBOTICS AND MECHATRONICS (ICROM), 2014, : 588 - 594
  • [47] TeachMe: Three-phase learning framework for robotic motion imitation based on interactive teaching and reinforcement learning
    Kim, Taewoo
    Lee, Joo-Haeng
    2019 28TH IEEE INTERNATIONAL CONFERENCE ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION (RO-MAN), 2019,
  • [48] LEARNING (GOOD HANDWRITING IN GREEK) BY TEACHING (A HUMANOID ROBOT)
    Ioannou, C.
    Neophytou, C.
    Asselborn, T.
    Johal, W.
    Hadzilacos, T.
    14TH INTERNATIONAL TECHNOLOGY, EDUCATION AND DEVELOPMENT CONFERENCE (INTED2020), 2020, : 7287 - 7296
  • [49] Humanoid Robot Gait on Sloping Floors Using Reinforcement Learning
    Silva, Isaac J.
    Perico, Danilo H.
    Homem, Thiago P. D.
    Vilao, Claudio O., Jr.
    Tonidandel, Flavio
    Bianchi, Reinaldo A. C.
    ROBOTICS, 2016, 619 : 228 - 246
  • [50] Push Recovery Control for Humanoid Robot using Reinforcement Learning
    Seo, Donghyeon
    Kim, Harin
    Kim, Donghan
    2019 THIRD IEEE INTERNATIONAL CONFERENCE ON ROBOTIC COMPUTING (IRC 2019), 2019, : 488 - 492