Scaling Up and Distilling Down: Language-Guided Robot Skill Acquisition

被引:0
|
作者
Ha, Huy [1 ]
Florence, Pete [2 ]
Song, Shuran [1 ]
机构
[1] Columbia Univ, New York, NY 10027 USA
[2] Google DeepMind, San Francisco, CA USA
来源
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We present a framework for robot skill acquisition, which 1) efficiently scale up data generation of language-labelled robot data and 2) effectively distills this data down into a robust multi-task language-conditioned visuo-motor policy. For (1), we use a large language model (LLM) to guide high-level planning, and sampling-based robot planners (e.g. motion or grasp samplers) for generating diverse and rich manipulation trajectories. To robustify this data-collection process, the LLM also infers a code-snippet for the success condition of each task, simultaneously enabling the data-collection process to detect failure and retry as well as the automatic labeling of trajectories with success/failure. For (2), we extend the diffusion policy single-task behavior-cloning approach to multi-task settings with language conditioning. Finally, we propose a new multi-task benchmark with 18 tasks across five domains to test long-horizon behavior, common-sense reasoning, tool-use, and intuitive physics. We find that our distilled policy successfully learned the robust retrying behavior in its data collection procedure, while improving absolute success rates by 33:2% on average across five domains. All code, data, and qualitative policy results are available at our project website.
引用
收藏
页数:12
相关论文
共 3 条
  • [1] Language-guided Robot Grasping: CLIP-based Referring Grasp Synthesis in Clutter
    Tziafas, Georgios
    Xu, Yucheng
    Goel, Arushi
    Kasaei, Mohammadreza
    Li, Zhibin
    Kasaei, Hamidreza
    CONFERENCE ON ROBOT LEARNING, VOL 229, 2023, 229
  • [2] Compound Heuristic Information Guided Policy Improvement for Robot Motor Skill Acquisition
    Fu, Jian
    Li, Cong
    Teng, Xiang
    Luo, Fan
    Li, Boqun
    APPLIED SCIENCES-BASEL, 2020, 10 (15):
  • [3] Constructing a Language From Scratch: Combining Bottom-Up and Top-Down Learning Processes in a Computational Model of Language Acquisition
    Gaspers, Judith
    Cimiano, Philipp
    Rohlfing, Katharina
    Wrede, Britta
    IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2017, 9 (02) : 183 - 196