PROGPROMPT: program generation for situated robot task planning using large language models

被引:8
|
作者
Singh, Ishika [1 ]
Blukis, Valts [2 ]
Mousavian, Arsalan [2 ]
Goyal, Ankit [2 ]
Xu, Danfei [2 ]
Tremblay, Jonathan [2 ]
Fox, Dieter [2 ,3 ]
Thomason, Jesse [1 ]
Garg, Animesh [2 ,4 ]
机构
[1] Univ Southern Calif, Comp Sci, Los Angeles, CA 90089 USA
[2] NVIDIA, Seattle Robot Lab, Seattle, WA 98105 USA
[3] Univ Washington, Comp Sci & Engn, Seattle, WA 98195 USA
[4] Georgia Inst Technol, Sch Interact Comp, Atlanta, GA 30308 USA
关键词
Robot task planning; LLM code generation; Planning domain generalization; Symbolic planning;
D O I
10.1007/s10514-023-10135-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Task planning can require defining myriad domain knowledge about the world in which a robot needs to act. To ameliorate that effort, large language models (LLMs) can be used to score potential next actions during task planning, and even generate action sequences directly, given an instruction in natural language with no additional domain information. However, such methods either require enumerating all possible next steps for scoring, or generate free-form text that may contain actions not possible on a given robot in its current context. We present a programmatic LLM prompt structure that enables plan generation functional across situated environments, robot capabilities, and tasks. Our key insight is to prompt the LLM with program-like specifications of the available actions and objects in an environment, as well as with example programs that can be executed. We make concrete recommendations about prompt structure and generation constraints through ablation experiments, demonstrate state of the art success rates in VirtualHome household tasks, and deploy our method on a physical robot arm for tabletop tasks. Website and code at progprompt.github.io
引用
收藏
页码:999 / 1012
页数:14
相关论文
共 50 条
  • [1] ProgPrompt: program generation for situated robot task planning using large language models
    Ishika Singh
    Valts Blukis
    Arsalan Mousavian
    Ankit Goyal
    Danfei Xu
    Jonathan Tremblay
    Dieter Fox
    Jesse Thomason
    Animesh Garg
    Autonomous Robots, 2023, 47 : 999 - 1012
  • [2] PROGPROMPT: Generating Situated Robot Task Plans using Large Language Models
    Singh, Ishika
    Blukis, Valts
    Mousavian, Arsalan
    Goyal, Ankit
    Xu, Danfei
    Tremblay, Jonathan
    Fox, Dieter
    Thomason, Jesse
    Garg, Animesh
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2023), 2023, : 11523 - 11530
  • [3] Task Planning for a Factory Robot Using Large Language Model
    Tsushima, Yosuke
    Yamamoto, Shu
    Ravankar, Ankit A.
    Luces, Jose Victorio Salazar
    Hirata, Yasuhisa
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2025, 10 (03): : 2383 - 2390
  • [4] A framework for neurosymbolic robot action planning using large language models
    Capitanelli, Alessio
    Mastrogiovanni, Fulvio
    FRONTIERS IN NEUROROBOTICS, 2024, 18
  • [5] Generation of Robot Manipulation Plans Using Generative Large Language Models
    Toberg, Jan-Philipp
    Cimiano, Philipp
    2023 SEVENTH IEEE INTERNATIONAL CONFERENCE ON ROBOTIC COMPUTING, IRC 2023, 2023, : 190 - 197
  • [6] Enhancing Robot Task Planning and Execution through Multi-Layer Large Language Models
    Luan, Zhirong
    Lai, Yujun
    Huang, Rundong
    Bai, Shuanghao
    Zhang, Yuedi
    Zhang, Haoran
    Wang, Qian
    SENSORS, 2024, 24 (05)
  • [7] SayPlan: Grounding Large Language Models using 3D Scene Graphs for Scalable Robot Task Planning
    Rana, Krishan
    Haviland, Jesse
    Garg, Sourav
    Abou-Chakra, Jad
    Reid, Ian
    Sunderhauf, Niko
    CONFERENCE ON ROBOT LEARNING, VOL 229, 2023, 229
  • [8] Automated planning for situated natural language generation
    Garoufi, Konstantina
    Koller, Alexander
    ACL 2010: 48TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2010, : 1573 - 1582
  • [9] Task and Motion Planning with Large Language Models for Object Rearrangement
    Ding, Yan
    Zhang, Xiaohan
    Paxton, Chris
    Zhang, Shiqi
    2023 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, IROS, 2023, : 2086 - 2092
  • [10] Action Contextualization: Adaptive Task Planning and Action Tuning Using Large Language Models
    Gupta, Sthithpragya
    Yao, Kunpeng
    Niederhauser, Loic
    Billard, Aude
    IEEE ROBOTICS AND AUTOMATION LETTERS, 2024, 9 (11): : 9407 - 9414