Teaching Humanoid Robot Reaching Motion by Imitation and Reinforcement Learning

被引：1

作者：

Savevska, Kristina ^{[1
,2
]}

Ude, Ales ^{[1
]}

机构：

[1] Jozef Stefan Inst, Dept Automat Biocybernet & Robot, Humanoid & Cognit Robot Lab, Jamova Cesta 39, Ljubljana 1000, Slovenia

[2] Int Postgrad Sch Jozef Stefan, Jamova Cesta 39, Ljubljana 1000, Slovenia

来源：

ADVANCES IN SERVICE AND INDUSTRIAL ROBOTICS, RAAD 2023 | 2023年 / 135卷

关键词：

Humanoids; Imitation Learning; Reinforcement learning;

D O I：

10.1007/978-3-031-32606-6_7

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper presents a user-friendly method for programming humanoid robots without the need for expert knowledge. We propose a combination of imitation learning and reinforcement learning to teach and optimize demonstrated trajectories. An initial trajectory for reinforcement learning is generated using a stable whole-body motion imitation system. The acquired motion is then refined using a stochastic optimal control-based reinforcement learning algorithm called Policy Improvement with Path Integrals with Covariance Matrix Adaptation (PI2-CMA). We tested the approach for programming humanoid robot reaching motion. Our experimental results show that the proposed approach is successful at learning reaching motions while preserving the postural balance of the robot. We also show how a stable humanoid robot trajectory learned in simulation can be effectively adapted to different dynamic environments, e.g. a different simulator or a real robot. The resulting learning methodology allows for quick and efficient optimization of the demonstrated trajectories while also taking into account the constraints of the desired task. The learning methodology was tested in a simulated environment and on the real humanoid robot TALOS.

引用

页码：53 / 61

页数：9

共 50 条

[41] Imitation of Human Motion on a Humanoid Robot using Inverse Kinematics and Path Optimization
Pandey, Padmakar
Kumar, Krishan
Nandi, G. C.
PROCEEDINGS OF THE 2018 SECOND INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTING AND CONTROL SYSTEMS (ICICCS), 2018, : 1863 - 1868
[42] Imitation of Human Motion on a Humanoid Robot using Non-Linear Optimization
Do, Martin
Azad, Pedram
Asfour, Tamim
Dillmann, Ruediger
2008 8TH IEEE-RAS INTERNATIONAL CONFERENCE ON HUMANOID ROBOTS (HUMANOIDS 2008), 2008, : 693 - 700
[43] Robust Regression-Based Motion Perception for Online Imitation on Humanoid Robot
Zhu, Tehao
Zhao, Qunfei
Wan, Weibing
Xia, Zeyang
INTERNATIONAL JOURNAL OF SOCIAL ROBOTICS, 2017, 9 (05) : 705 - 725
[44] Robust Regression-Based Motion Perception for Online Imitation on Humanoid Robot
Tehao Zhu
Qunfei Zhao
Weibing Wan
Zeyang Xia
International Journal of Social Robotics, 2017, 9 : 705 - 725
[45] Development of Imitation Learning Through Physical Therapy Using a Humanoid Robot
Malik, Norjasween Abdul
Yussof, Hanafiah
Hanapiah, Fazah Akhtar
MEDICAL AND REHABILITATION ROBOTICS AND INSTRUMENTATION (MRRI2013), 2014, 42 : 191 - 197
[46] Learning of Gestures by Imitation using a Monocular Vision System on a Humanoid Robot
Sabbaghi, Elaheh
Bahrami, Mohsen
Ghidary, Saeed Shiry
2014 SECOND RSI/ISM INTERNATIONAL CONFERENCE ON ROBOTICS AND MECHATRONICS (ICROM), 2014, : 588 - 594
[47] TeachMe: Three-phase learning framework for robotic motion imitation based on interactive teaching and reinforcement learning
Kim, Taewoo
Lee, Joo-Haeng
2019 28TH IEEE INTERNATIONAL CONFERENCE ON ROBOT AND HUMAN INTERACTIVE COMMUNICATION (RO-MAN), 2019,
[48] LEARNING (GOOD HANDWRITING IN GREEK) BY TEACHING (A HUMANOID ROBOT)
Ioannou, C.
Neophytou, C.
Asselborn, T.
Johal, W.
Hadzilacos, T.
14TH INTERNATIONAL TECHNOLOGY, EDUCATION AND DEVELOPMENT CONFERENCE (INTED2020), 2020, : 7287 - 7296
[49] Humanoid Robot Gait on Sloping Floors Using Reinforcement Learning
Silva, Isaac J.
Perico, Danilo H.
Homem, Thiago P. D.
Vilao, Claudio O., Jr.
Tonidandel, Flavio
Bianchi, Reinaldo A. C.
ROBOTICS, 2016, 619 : 228 - 246
[50] Push Recovery Control for Humanoid Robot using Reinforcement Learning
Seo, Donghyeon
Kim, Harin
Kim, Donghan
2019 THIRD IEEE INTERNATIONAL CONFERENCE ON ROBOTIC COMPUTING (IRC 2019), 2019, : 488 - 492

← 1 2 3 4 5 →