PROGPROMPT: program generation for situated robot task planning using large language models

被引:8
|
作者
Singh, Ishika [1 ]
Blukis, Valts [2 ]
Mousavian, Arsalan [2 ]
Goyal, Ankit [2 ]
Xu, Danfei [2 ]
Tremblay, Jonathan [2 ]
Fox, Dieter [2 ,3 ]
Thomason, Jesse [1 ]
Garg, Animesh [2 ,4 ]
机构
[1] Univ Southern Calif, Comp Sci, Los Angeles, CA 90089 USA
[2] NVIDIA, Seattle Robot Lab, Seattle, WA 98105 USA
[3] Univ Washington, Comp Sci & Engn, Seattle, WA 98195 USA
[4] Georgia Inst Technol, Sch Interact Comp, Atlanta, GA 30308 USA
关键词
Robot task planning; LLM code generation; Planning domain generalization; Symbolic planning;
D O I
10.1007/s10514-023-10135-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Task planning can require defining myriad domain knowledge about the world in which a robot needs to act. To ameliorate that effort, large language models (LLMs) can be used to score potential next actions during task planning, and even generate action sequences directly, given an instruction in natural language with no additional domain information. However, such methods either require enumerating all possible next steps for scoring, or generate free-form text that may contain actions not possible on a given robot in its current context. We present a programmatic LLM prompt structure that enables plan generation functional across situated environments, robot capabilities, and tasks. Our key insight is to prompt the LLM with program-like specifications of the available actions and objects in an environment, as well as with example programs that can be executed. We make concrete recommendations about prompt structure and generation constraints through ablation experiments, demonstrate state of the art success rates in VirtualHome household tasks, and deploy our method on a physical robot arm for tabletop tasks. Website and code at progprompt.github.io
引用
收藏
页码:999 / 1012
页数:14
相关论文
共 50 条
  • [21] Conditionally Combining Robot Skills using Large Language Models
    Zentner, K. R.
    Julian, Ryan
    Ichter, Brian
    Sukhatme, Gaurav S.
    2024 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2024), 2024, : 14046 - 14053
  • [22] Generative Expressive Robot Behaviors using Large Language Models
    Mahadevan, Karthik
    Chien, Jonathan
    Brown, Noah
    Xu, Zhuo
    Parada, Carolina
    Xia, Fei
    Zeng, Andy
    Takayama, Leila
    Sadigh, Dorsa
    PROCEEDINGS OF THE 2024 ACM/IEEE INTERNATIONAL CONFERENCE ON HUMAN-ROBOT INTERACTION, HRI 2024, 2024, : 482 - 491
  • [23] Real-time emotion generation in human-robot dialogue using large language models
    Mishra, Chinmaya
    Verdonschot, Rinus
    Hagoort, Peter
    Skantze, Gabriel
    FRONTIERS IN ROBOTICS AND AI, 2023, 10
  • [24] INFORMATION-SYSTEM STRUCTURE FOR A TASK LEVEL ROBOT ASSEMBLY LANGUAGE FOR ONLINE PROGRAM GENERATION
    HOLM, H
    BOSS, HP
    PETURSSON, H
    NIELSEN, JA
    ROBOTICS AND COMPUTER-INTEGRATED MANUFACTURING, 1993, 10 (1-2) : 77 - 88
  • [25] FLTRNN: Faithful Long-Horizon Task Planning for Robotics with Large Language Models
    Zhang, Jiatao
    Tang, Lanling
    Song, Yufan
    Menge, Qiwei
    Qian, Haofu
    Shao, Jun
    Song, Wei
    Zhu, Shiqiang
    Gu, Jason
    2024 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA 2024, 2024, : 6680 - 6686
  • [26] FLTRNN: Faithful Long-Horizon Task Planning for Robotics with Large Language Models
    Song, Wei (songweizju@163.com), 1600, Institute of Electrical and Electronics Engineers Inc.
  • [27] Multi-Robot Task Planning and Sequencing using the SAT-TSP Language
    Imeson, Frank
    Smith, Stephen L.
    2015 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2015, : 5397 - 5402
  • [28] SpecGen: Automated Generation of Formal Program Specifications via Large Language Models
    Ma, Lezhi
    Liu, Shangqing
    Li, Yi
    Xie, Xiaofei
    Bu, Lei
    arXiv, 1600,
  • [29] Automation of Network Configuration Generation using Large Language Models
    Chakraborty, Supratim
    Chitta, Nithin
    Sundaresan, Rajesh
    2024 20TH INTERNATIONAL CONFERENCE ON NETWORK AND SERVICE MANAGEMENT, CNSM 2024, 2024,
  • [30] Opinerium: Subjective Question Generation Using Large Language Models
    Babakhani, Pedram
    Lommatzsch, Andreas
    Brodt, Torben
    Sacker, Doreen
    Sivrikaya, Fikret
    Albayrak, Sahin
    IEEE ACCESS, 2024, 12 : 66085 - 66099