Primitive Skill-based Robot Learning from Human Evaluative Feedback

被引:1
|
作者
Hiranaka, Ayano [1 ]
Hwang, Minjune [2 ]
Lee, Sharon [2 ]
Wang, Chen [2 ]
Fei-Fei, Li [2 ]
Wu, Jiajun [2 ]
Zhang, Ruohan [2 ]
机构
[1] Stanford Univ, Dept Mech Engn, Stanford, CA 94305 USA
[2] Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA
关键词
D O I
10.1109/IROS55552.2023.10341912
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Reinforcement learning (RL) algorithms face significant challenges when dealing with long-horizon robot manipulation tasks in real-world environments due to sample inefficiency and safety issues. To overcome these challenges, we propose a novel framework, SEED, which leverages two approaches: reinforcement learning from human feedback (RLHF) and primitive skill-based reinforcement learning. Both approaches are particularly effective in addressing sparse reward issues and the complexities involved in long-horizon tasks. By combining them, SEED reduces the human effort required in RLHF and increases safety in training robot manipulation with RL in real-world settings. Additionally, parameterized skills provide a clear view of the agent's high-level intentions, allowing humans to evaluate skill choices before they are executed. This feature makes the training process even safer and more efficient. To evaluate the performance of SEED, we conducted extensive experiments on five manipulation tasks with varying levels of complexity. Our results show that SEED significantly outperforms state-of-the-art RL algorithms in sample efficiency and safety. In addition, SEED also exhibits a substantial reduction of human effort compared to other RLHF methods. Further details and video results can be found at https: //seediros23.github.io/.
引用
收藏
页码:7817 / 7824
页数:8
相关论文
共 50 条
  • [1] On a primitive skill-based supervisory robot control architecture
    Milighetti, G
    Kuntze, HB
    Frey, CW
    Diestel-Feddersen, B
    Balzer, J
    2005 12th International Conference on Advanced Robotics, 2005, : 141 - 147
  • [2] Learning Skill-based Industrial Robot Tasks with User Priors Density
    Mayr, Matthias
    Hvarfner, Carl
    Chatzilygeroudis, Konstantinos
    Nardi, Luigi
    Krueger, Volker
    2022 IEEE 18TH INTERNATIONAL CONFERENCE ON AUTOMATION SCIENCE AND ENGINEERING (CASE), 2022, : 1485 - 1492
  • [3] Skill-based learning of an assembly process
    Skill-basiertes Lernen für Montageprozesse
    Akkaladevi, Sharath Chandra (sharath.akkaladevi@profactor.at), 1600, Springer-Verlag Wien (134):
  • [4] Skill-based human–robot cooperation in tele-operated path tracking
    Nima Enayati
    Giancarlo Ferrigno
    Elena De Momi
    Autonomous Robots, 2018, 42 : 997 - 1009
  • [5] Learning and Retrieval from Prior Data for Skill-based Imitation Learning
    Nasiriany, Soroush
    Gao, Tian
    Mandlekar, Ajay
    Zhu, Yuke
    CONFERENCE ON ROBOT LEARNING, VOL 205, 2022, 205 : 2181 - 2204
  • [6] Flexible skill-based control for robot cells in manufacturing
    Wiese, Torben
    Abicht, Johannes
    Friedrich, Christian
    Hellmich, Arvid
    Ihlenfeldt, Steffen
    FRONTIERS IN ROBOTICS AND AI, 2022, 9
  • [7] Skill-based human-robot cooperation in tele-operated path tracking
    Enayati, Nima
    Ferrigno, Giancarlo
    De Momi, Elena
    AUTONOMOUS ROBOTS, 2018, 42 (05) : 997 - 1009
  • [8] A Skill-Based MILP Model in Cellular Manufacturing Systems with Human-Robot Collaboration
    Yetkin, Busra Nur
    Ulutas, Berna Haktanirlar
    IFAC PAPERSONLINE, 2022, 55 (10): : 1728 - 1733
  • [9] Skill-based Model-based Reinforcement Learning
    Shi, Lucy Xiaoyang
    Lim, Joseph J.
    Lee, Youngwoon
    CONFERENCE ON ROBOT LEARNING, VOL 205, 2022, 205 : 2262 - 2272
  • [10] Learning to Form Skill-based Teams of Experts
    Rad, Radin Hamidi
    Fani, Hossein
    Kargar, Mehdi
    Szlichta, Jaroslaw
    Bagheri, Ebrahim
    CIKM '20: PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, 2020, : 2049 - 2052