Teaching Humanoid Robot Reaching Motion by Imitation and Reinforcement Learning

被引：1

作者：

Savevska, Kristina ^{[1
,2
]}

Ude, Ales ^{[1
]}

机构：

[1] Jozef Stefan Inst, Dept Automat Biocybernet & Robot, Humanoid & Cognit Robot Lab, Jamova Cesta 39, Ljubljana 1000, Slovenia

[2] Int Postgrad Sch Jozef Stefan, Jamova Cesta 39, Ljubljana 1000, Slovenia

来源：

ADVANCES IN SERVICE AND INDUSTRIAL ROBOTICS, RAAD 2023 | 2023年 / 135卷

关键词：

Humanoids; Imitation Learning; Reinforcement learning;

D O I：

10.1007/978-3-031-32606-6_7

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper presents a user-friendly method for programming humanoid robots without the need for expert knowledge. We propose a combination of imitation learning and reinforcement learning to teach and optimize demonstrated trajectories. An initial trajectory for reinforcement learning is generated using a stable whole-body motion imitation system. The acquired motion is then refined using a stochastic optimal control-based reinforcement learning algorithm called Policy Improvement with Path Integrals with Covariance Matrix Adaptation (PI2-CMA). We tested the approach for programming humanoid robot reaching motion. Our experimental results show that the proposed approach is successful at learning reaching motions while preserving the postural balance of the robot. We also show how a stable humanoid robot trajectory learned in simulation can be effectively adapted to different dynamic environments, e.g. a different simulator or a real robot. The resulting learning methodology allows for quick and efficient optimization of the demonstrated trajectories while also taking into account the constraints of the desired task. The learning methodology was tested in a simulated environment and on the real humanoid robot TALOS.

引用

页码：53 / 61

页数：9

共 50 条

[31] Learning by Imitation and Implementation Of Sign Language Gestures by a Humanoid Robot
Kivrak, Hasan
Kose, Hatice
2015 23RD SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2015, : 1993 - 1996
[32] Reinforcement learning for motion control of humanoid robots
Iida, S. (iida@ics.nitech.ac.jp), 2004, Institute of Electrical and Electronics Engineers, IEEE; Robotics Society of Japan, RSJ (Institute of Electrical and Electronics Engineers Inc.):
[33] Autonomous learning of 3D reaching in a humanoid robot
Nori, Francesco
Natale, Lorenzo
Sandini, Giulio
Metta, Giorgio
2007 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-9, 2007, : 1148 - 1153
[34] Learning precise 3D reaching in a humanoid robot
Natale, Lorenzo
Nori, Francesco
Sandini, Giulio
Metta, Giorgio
2007 IEEE 6TH INTERNATIONAL CONFERENCE ON DEVELOPMENT AND LEARNING, 2007, : 217 - 222
[35] Learning for a Robot: Deep Reinforcement Learning, Imitation Learning, Transfer Learning
Hua, Jiang
Zeng, Liangcai
Li, Gongfa
Ju, Zhaojie
SENSORS, 2021, 21 (04) : 1 - 21
[36] Learning for a robot: Deep reinforcement learning, imitation learning, transfer learning
Hua, Jiang
Zeng, Liangcai
Li, Gongfa
Ju, Zhaojie
Sensors (Switzerland), 2021, 21 (04): : 1 - 21
[37] Deep Reinforcement Learning for a Humanoid Robot Soccer Player
Isaac Jesus da Silva
Danilo Hernani Perico
Thiago Pedro Donadon Homem
Reinaldo Augusto da Costa Bianchi
Journal of Intelligent & Robotic Systems, 2021, 102
[38] Deep Reinforcement Learning for a Humanoid Robot Soccer Player
da Silva, Isaac Jesus
Perico, Danilo Hernani
Donadon Homem, Thiago Pedro
da Costa Bianchi, Reinaldo Augusto
JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2021, 102 (03)
[39] Analysis of Cost Functions for Reinforcement Learning of Reaching Tasks in Humanoid Robots
Savevska, Kristina
Ude, Ales
APPLIED SCIENCES-BASEL, 2024, 14 (01):
[40] Real-time full body motion imitation on the COMAN humanoid robot
Gams, Andrej
van den Kieboom, Jesse
Dzeladini, Florin
Ude, Ales
Ijspeert, Auke Jan
ROBOTICA, 2015, 33 (05) : 1049 - 1061

← 1 2 3 4 5 →