Primitive Skill-based Robot Learning from Human Evaluative Feedback

被引：1

作者：

Hiranaka, Ayano ^{[1
]}

Hwang, Minjune ^{[2
]}

Lee, Sharon ^{[2
]}

Wang, Chen ^{[2
]}

Fei-Fei, Li ^{[2
]}

Wu, Jiajun ^{[2
]}

Zhang, Ruohan ^{[2
]}

机构：

[1] Stanford Univ, Dept Mech Engn, Stanford, CA 94305 USA

[2] Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA

来源：

2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS) | 2023年

关键词：

D O I：

10.1109/IROS55552.2023.10341912

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Reinforcement learning (RL) algorithms face significant challenges when dealing with long-horizon robot manipulation tasks in real-world environments due to sample inefficiency and safety issues. To overcome these challenges, we propose a novel framework, SEED, which leverages two approaches: reinforcement learning from human feedback (RLHF) and primitive skill-based reinforcement learning. Both approaches are particularly effective in addressing sparse reward issues and the complexities involved in long-horizon tasks. By combining them, SEED reduces the human effort required in RLHF and increases safety in training robot manipulation with RL in real-world settings. Additionally, parameterized skills provide a clear view of the agent's high-level intentions, allowing humans to evaluate skill choices before they are executed. This feature makes the training process even safer and more efficient. To evaluate the performance of SEED, we conducted extensive experiments on five manipulation tasks with varying levels of complexity. Our results show that SEED significantly outperforms state-of-the-art RL algorithms in sample efficiency and safety. In addition, SEED also exhibits a substantial reduction of human effort compared to other RLHF methods. Further details and video results can be found at https: //seediros23.github.io/.

引用

页码：7817 / 7824

页数：8

共 50 条

[1] On a primitive skill-based supervisory robot control architecture
Milighetti, G
Kuntze, HB
Frey, CW
Diestel-Feddersen, B
Balzer, J
2005 12th International Conference on Advanced Robotics, 2005, : 141 - 147
[2] Learning Skill-based Industrial Robot Tasks with User Priors Density
Mayr, Matthias
Hvarfner, Carl
Chatzilygeroudis, Konstantinos
Nardi, Luigi
Krueger, Volker
2022 IEEE 18TH INTERNATIONAL CONFERENCE ON AUTOMATION SCIENCE AND ENGINEERING (CASE), 2022, : 1485 - 1492
[3] Skill-based learning of an assembly process
Skill-basiertes Lernen für Montageprozesse
Akkaladevi, Sharath Chandra (sharath.akkaladevi@profactor.at), 1600, Springer-Verlag Wien (134):
[4] Skill-based human–robot cooperation in tele-operated path tracking
Nima Enayati
Giancarlo Ferrigno
Elena De Momi
Autonomous Robots, 2018, 42 : 997 - 1009
[5] Learning and Retrieval from Prior Data for Skill-based Imitation Learning
Nasiriany, Soroush
Gao, Tian
Mandlekar, Ajay
Zhu, Yuke
CONFERENCE ON ROBOT LEARNING, VOL 205, 2022, 205 : 2181 - 2204
[6] Flexible skill-based control for robot cells in manufacturing
Wiese, Torben
Abicht, Johannes
Friedrich, Christian
Hellmich, Arvid
Ihlenfeldt, Steffen
FRONTIERS IN ROBOTICS AND AI, 2022, 9
[7] Skill-based human-robot cooperation in tele-operated path tracking
Enayati, Nima
Ferrigno, Giancarlo
De Momi, Elena
AUTONOMOUS ROBOTS, 2018, 42 (05) : 997 - 1009
[8] A Skill-Based MILP Model in Cellular Manufacturing Systems with Human-Robot Collaboration
Yetkin, Busra Nur
Ulutas, Berna Haktanirlar
IFAC PAPERSONLINE, 2022, 55 (10): : 1728 - 1733
[9] Skill-based Model-based Reinforcement Learning
Shi, Lucy Xiaoyang
Lim, Joseph J.
Lee, Youngwoon
CONFERENCE ON ROBOT LEARNING, VOL 205, 2022, 205 : 2262 - 2272
[10] Learning to Form Skill-based Teams of Experts
Rad, Radin Hamidi
Fani, Hossein
Kargar, Mehdi
Szlichta, Jaroslaw
Bagheri, Ebrahim
CIKM '20: PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, 2020, : 2049 - 2052

← 1 2 3 4 5 →