Teaching Humanoid Robot Reaching Motion by Imitation and Reinforcement Learning

被引：1

作者：

Savevska, Kristina ^{[1
,2
]}

Ude, Ales ^{[1
]}

机构：

[1] Jozef Stefan Inst, Dept Automat Biocybernet & Robot, Humanoid & Cognit Robot Lab, Jamova Cesta 39, Ljubljana 1000, Slovenia

[2] Int Postgrad Sch Jozef Stefan, Jamova Cesta 39, Ljubljana 1000, Slovenia

来源：

ADVANCES IN SERVICE AND INDUSTRIAL ROBOTICS, RAAD 2023 | 2023年 / 135卷

关键词：

Humanoids; Imitation Learning; Reinforcement learning;

D O I：

10.1007/978-3-031-32606-6_7

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper presents a user-friendly method for programming humanoid robots without the need for expert knowledge. We propose a combination of imitation learning and reinforcement learning to teach and optimize demonstrated trajectories. An initial trajectory for reinforcement learning is generated using a stable whole-body motion imitation system. The acquired motion is then refined using a stochastic optimal control-based reinforcement learning algorithm called Policy Improvement with Path Integrals with Covariance Matrix Adaptation (PI2-CMA). We tested the approach for programming humanoid robot reaching motion. Our experimental results show that the proposed approach is successful at learning reaching motions while preserving the postural balance of the robot. We also show how a stable humanoid robot trajectory learned in simulation can be effectively adapted to different dynamic environments, e.g. a different simulator or a real robot. The resulting learning methodology allows for quick and efficient optimization of the demonstrated trajectories while also taking into account the constraints of the desired task. The learning methodology was tested in a simulated environment and on the real humanoid robot TALOS.

引用

页码：53 / 61

页数：9

共 50 条

[1] On human motion imitation by humanoid robot
Suleiman, Wael
Yoshida, Eiichi
Kanehiro, Fumio
Laumond, Jean-Paul
Monin, Andre
2008 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, VOLS 1-9, 2008, : 2697 - +
[2] A humanoid robot with motion imitation ability
Chou, Li-Po
Wang, Wen-June
PROCEEDINGS OF 2007 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2007, : 2031 - 2036
[3] Humanoid Robot Motion Imitation Using Kinect
Lin, Hsien-I
Chou, Chan-Ching
2015 INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND INTELLIGENT SYSTEMS (ARIS), 2015,
[4] Dynamic Imitation of Human Motion for Humanoid Robot
Chiang, Shu-Yin
Kuo, Shih-Chuan
Lin, Jau-Bi
Chen, Ching-Hui
2017 IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTED, SCALABLE COMPUTING & COMMUNICATIONS, CLOUD & BIG DATA COMPUTING, INTERNET OF PEOPLE AND SMART CITY INNOVATION (SMARTWORLD/SCALCOM/UIC/ATC/CBDCOM/IOP/SCI), 2017,
[5] Human Arm Motion Imitation by a Humanoid Robot
Fifiatrault, Sylvain
Cretu, Ana-Maria
2014 IEEE INTERNATIONAL SYMPOSIUM ON ROBOTIC AND SENSORS ENVIRONMENTS (ROSE 2014), 2014,
[6] Visual Imitation of Humanoid Robot for 3-D Motion of Different Humanoid Robot
Hwang, Chih-Lyang
Lan, Chien-Wu
Wang, Chao-Kuei
Hao, Shu-Sheng
JOURNAL OF THE CHINESE SOCIETY OF MECHANICAL ENGINEERS, 2013, 34 (02): : 109 - 119
[7] Tcleoperation of a Humanoid Robot with Motion Imitation and Legged Locomotion
Sripada, Aditya
Asokan, Harish
Warrier, Abhishek
Kapoor, Arpit
Gaur, Harshit
Patel, Rahil
Sridhar, R.
2018 3RD IEEE INTERNATIONAL CONFERENCE ON ADVANCED ROBOTICS AND MECHATRONICS (IEEE ICARM), 2018, : 375 - 379
[8] Teaching a humanoid robot to walk faster through Safe Reinforcement Learning
Garcia, Javier
Shafie, Diogo
ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2020, 88
[9] Motion Imitation of a Humanoid Robot via Pose Estimation
Meng, Shuyu
Qiu, Suo
Liang, Tianhao
Ren, Qinyuan
2023 35TH CHINESE CONTROL AND DECISION CONFERENCE, CCDC, 2023, : 1526 - 1532
[10] Motion Segmentation and Recognition for Imitation Learning and Influence of Bias for Learning Walking Motion of Humanoid Robot Based on Human Demonstrated Motion
Takahashi, Yasutake
Hatano, Hiroki
Maida, Yosuke
Usui, Kazuyuki
Maeda, Yoichiro
JOURNAL OF ADVANCED COMPUTATIONAL INTELLIGENCE AND INTELLIGENT INFORMATICS, 2015, 19 (04) : 532 - 543

← 1 2 3 4 5 →