Teaching Humanoid Robot Reaching Motion by Imitation and Reinforcement Learning

被引:1
|
作者
Savevska, Kristina [1 ,2 ]
Ude, Ales [1 ]
机构
[1] Jozef Stefan Inst, Dept Automat Biocybernet & Robot, Humanoid & Cognit Robot Lab, Jamova Cesta 39, Ljubljana 1000, Slovenia
[2] Int Postgrad Sch Jozef Stefan, Jamova Cesta 39, Ljubljana 1000, Slovenia
关键词
Humanoids; Imitation Learning; Reinforcement learning;
D O I
10.1007/978-3-031-32606-6_7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper presents a user-friendly method for programming humanoid robots without the need for expert knowledge. We propose a combination of imitation learning and reinforcement learning to teach and optimize demonstrated trajectories. An initial trajectory for reinforcement learning is generated using a stable whole-body motion imitation system. The acquired motion is then refined using a stochastic optimal control-based reinforcement learning algorithm called Policy Improvement with Path Integrals with Covariance Matrix Adaptation (PI2-CMA). We tested the approach for programming humanoid robot reaching motion. Our experimental results show that the proposed approach is successful at learning reaching motions while preserving the postural balance of the robot. We also show how a stable humanoid robot trajectory learned in simulation can be effectively adapted to different dynamic environments, e.g. a different simulator or a real robot. The resulting learning methodology allows for quick and efficient optimization of the demonstrated trajectories while also taking into account the constraints of the desired task. The learning methodology was tested in a simulated environment and on the real humanoid robot TALOS.
引用
收藏
页码:53 / 61
页数:9
相关论文
共 50 条
  • [1] On human motion imitation by humanoid robot
    Suleiman, Wael
    Yoshida, Eiichi
    Kanehiro, Fumio
    Laumond, Jean-Paul
    Monin, Andre
    2008 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1-9, 2008, : 2697 - +
  • [2] A humanoid robot with motion imitation ability
    Chou, Li-Po
    Wang, Wen-June
    PROCEEDINGS OF 2007 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2007, : 2031 - 2036
  • [3] Humanoid Robot Motion Imitation Using Kinect
    Lin, Hsien-I
    Chou, Chan-Ching
    2015 INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND INTELLIGENT SYSTEMS (ARIS), 2015,
  • [4] Dynamic Imitation of Human Motion for Humanoid Robot
    Chiang, Shu-Yin
    Kuo, Shih-Chuan
    Lin, Jau-Bi
    Chen, Ching-Hui
    2017 IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTED, SCALABLE COMPUTING & COMMUNICATIONS, CLOUD & BIG DATA COMPUTING, INTERNET OF PEOPLE AND SMART CITY INNOVATION (SMARTWORLD/SCALCOM/UIC/ATC/CBDCOM/IOP/SCI), 2017,
  • [5] Human Arm Motion Imitation by a Humanoid Robot
    Fifiatrault, Sylvain
    Cretu, Ana-Maria
    2014 IEEE INTERNATIONAL SYMPOSIUM ON ROBOTIC AND SENSORS ENVIRONMENTS (ROSE 2014), 2014,
  • [6] Visual Imitation of Humanoid Robot for 3-D Motion of Different Humanoid Robot
    Hwang, Chih-Lyang
    Lan, Chien-Wu
    Wang, Chao-Kuei
    Hao, Shu-Sheng
    JOURNAL OF THE CHINESE SOCIETY OF MECHANICAL ENGINEERS, 2013, 34 (02): : 109 - 119
  • [7] Tcleoperation of a Humanoid Robot with Motion Imitation and Legged Locomotion
    Sripada, Aditya
    Asokan, Harish
    Warrier, Abhishek
    Kapoor, Arpit
    Gaur, Harshit
    Patel, Rahil
    Sridhar, R.
    2018 3RD IEEE INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND MECHATRONICS (IEEE ICARM), 2018, : 375 - 379
  • [8] Teaching a humanoid robot to walk faster through Safe Reinforcement Learning
    Garcia, Javier
    Shafie, Diogo
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2020, 88
  • [9] Motion Imitation of a Humanoid Robot via Pose Estimation
    Meng, Shuyu
    Qiu, Suo
    Liang, Tianhao
    Ren, Qinyuan
    2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 1526 - 1532
  • [10] Motion Segmentation and Recognition for Imitation Learning and Influence of Bias for Learning Walking Motion of Humanoid Robot Based on Human Demonstrated Motion
    Takahashi, Yasutake
    Hatano, Hiroki
    Maida, Yosuke
    Usui, Kazuyuki
    Maeda, Yoichiro
    JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2015, 19 (04) : 532 - 543