PROGPROMPT: program generation for situated robot task planning using large language models

被引:8
|
作者
Singh, Ishika [1 ]
Blukis, Valts [2 ]
Mousavian, Arsalan [2 ]
Goyal, Ankit [2 ]
Xu, Danfei [2 ]
Tremblay, Jonathan [2 ]
Fox, Dieter [2 ,3 ]
Thomason, Jesse [1 ]
Garg, Animesh [2 ,4 ]
机构
[1] Univ Southern Calif, Comp Sci, Los Angeles, CA 90089 USA
[2] NVIDIA, Seattle Robot Lab, Seattle, WA 98105 USA
[3] Univ Washington, Comp Sci & Engn, Seattle, WA 98195 USA
[4] Georgia Inst Technol, Sch Interact Comp, Atlanta, GA 30308 USA
关键词
Robot task planning; LLM code generation; Planning domain generalization; Symbolic planning;
D O I
10.1007/s10514-023-10135-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Task planning can require defining myriad domain knowledge about the world in which a robot needs to act. To ameliorate that effort, large language models (LLMs) can be used to score potential next actions during task planning, and even generate action sequences directly, given an instruction in natural language with no additional domain information. However, such methods either require enumerating all possible next steps for scoring, or generate free-form text that may contain actions not possible on a given robot in its current context. We present a programmatic LLM prompt structure that enables plan generation functional across situated environments, robot capabilities, and tasks. Our key insight is to prompt the LLM with program-like specifications of the available actions and objects in an environment, as well as with example programs that can be executed. We make concrete recommendations about prompt structure and generation constraints through ablation experiments, demonstrate state of the art success rates in VirtualHome household tasks, and deploy our method on a physical robot arm for tabletop tasks. Website and code at progprompt.github.io
引用
收藏
页码:999 / 1012
页数:14
相关论文
共 50 条
  • [41] Automatic Unit Test Code Generation Using Large Language Models
    Ocal, Akdeniz Kutay
    Keskinoz, Mehmet
    32ND IEEE SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU 2024, 2024,
  • [42] Synthetic Query Generation using Large Language Models for Virtual Assistants
    Sannigrahi, Sonal
    Fraga-Silva, Thiago
    Oualil, Youssef
    Van Gysel, Christophe
    PROCEEDINGS OF THE 47TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2024, 2024, : 2837 - 2841
  • [43] Personalized Impression Generation for PET Reports Using Large Language Models
    Tie, Xin
    Shin, Muheon
    Pirasteh, Ali
    Ibrahim, Nevein
    Huemann, Zachary
    Castellino, Sharon M.
    Kelly, Kara M.
    Garrett, John
    Hu, Junjie
    Cho, Steve Y.
    Bradshaw, Tyler J.
    JOURNAL OF IMAGING INFORMATICS IN MEDICINE, 2024, 37 (02): : 471 - 488
  • [44] A generic natural language interface for task planning - application to a mobile robot
    Romano, JLG
    Camacho, EF
    Ortega, JG
    Bonilla, RT
    CONTROL ENGINEERING PRACTICE, 2000, 8 (10) : 1119 - 1133
  • [45] The use of large language models for program repair
    Zubair, Fida
    Al-Hitmi, Maryam
    Catal, Cagatay
    COMPUTER STANDARDS & INTERFACES, 2025, 93
  • [46] Large Language Models for Automated Program Repair
    Ribeiro, Francisco
    SPLASH Companion 2023 - Companion Proceedings of the 2023 ACM SIGPLAN International Conference on Systems, Programming, Languages, and Applications: Software for Humanity, 2023, : 7 - 9
  • [47] Large Language Models for Automated Program Repair
    Ribeiro, Francisco
    COMPANION PROCEEDINGS OF THE 2023 ACM SIGPLAN INTERNATIONAL CONFERENCE ON SYSTEMS, PROGRAMMING, LANGUAGES, AND APPLICATIONS: SOFTWARE FOR HUMANITY, SPLASH COMPANION 2023, 2023, : 7 - 9
  • [48] Opinion On Program Synthesis and Large Language Models
    Huttel, Hans
    COMMUNICATIONS OF THE ACM, 2025, 68 (01) : 33 - 35
  • [49] Enchanting Program Specification Synthesis by Large Language Models Using Static Analysis and Program Verification
    Wen, Cheng
    Cao, Jialun
    Su, Jie
    Xu, Zhiwu
    Qin, Shengchao
    He, Mengda
    Li, Haokun
    Cheung, Shing-Chi
    Tian, Cong
    COMPUTER AIDED VERIFICATION, PT II, CAV 2024, 2024, 14682 : 302 - 328
  • [50] Limitations of Large Language Models in Propaganda Detection Task
    Szwoch, Joanna
    Staszkow, Mateusz
    Rzepka, Rafal
    Araki, Kenji
    APPLIED SCIENCES-BASEL, 2024, 14 (10):