PROGPROMPT: program generation for situated robot task planning using large language models

被引:8
|
作者
Singh, Ishika [1 ]
Blukis, Valts [2 ]
Mousavian, Arsalan [2 ]
Goyal, Ankit [2 ]
Xu, Danfei [2 ]
Tremblay, Jonathan [2 ]
Fox, Dieter [2 ,3 ]
Thomason, Jesse [1 ]
Garg, Animesh [2 ,4 ]
机构
[1] Univ Southern Calif, Comp Sci, Los Angeles, CA 90089 USA
[2] NVIDIA, Seattle Robot Lab, Seattle, WA 98105 USA
[3] Univ Washington, Comp Sci & Engn, Seattle, WA 98195 USA
[4] Georgia Inst Technol, Sch Interact Comp, Atlanta, GA 30308 USA
关键词
Robot task planning; LLM code generation; Planning domain generalization; Symbolic planning;
D O I
10.1007/s10514-023-10135-3
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Task planning can require defining myriad domain knowledge about the world in which a robot needs to act. To ameliorate that effort, large language models (LLMs) can be used to score potential next actions during task planning, and even generate action sequences directly, given an instruction in natural language with no additional domain information. However, such methods either require enumerating all possible next steps for scoring, or generate free-form text that may contain actions not possible on a given robot in its current context. We present a programmatic LLM prompt structure that enables plan generation functional across situated environments, robot capabilities, and tasks. Our key insight is to prompt the LLM with program-like specifications of the available actions and objects in an environment, as well as with example programs that can be executed. We make concrete recommendations about prompt structure and generation constraints through ablation experiments, demonstrate state of the art success rates in VirtualHome household tasks, and deploy our method on a physical robot arm for tabletop tasks. Website and code at progprompt.github.io
引用
收藏
页码:999 / 1012
页数:14
相关论文
共 50 条
  • [31] FireRobBrain: Planning for a Firefighting Robot using Knowledge Graph and Large Language Model
    Zhang, Jialei
    Cai, Shubin
    Jiang, Zhiwei
    Xiao, Jian
    Ming, Zhong
    PROCEEDINGS OF THE 2024 IEEE 10TH INTERNATIONAL CONFERENCE ON INTELLIGENT DATA AND SECURITY, IDS 2024, 2024, : 37 - 41
  • [32] A Survey of Robot Intelligence with Large Language Models
    Jeong, Hyeongyo
    Lee, Haechan
    Kim, Changwon
    Shin, Sungtae
    APPLIED SCIENCES-BASEL, 2024, 14 (19):
  • [33] Towards A Holistic Landscape of Situated Theory of Mind in Large Language Models
    Ma, Ziqiao
    Sansom, Jacob
    Peng, Run
    Chai, Joyce
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS - EMNLP 2023, 2023, : 1011 - 1031
  • [34] Robot task planning using semantic maps
    Galindo, Cipriano
    Fernandez-Madrigal, Juan-Antonio
    Gonzalez, Javier
    Saffiotti, Alessandro
    ROBOTICS AND AUTONOMOUS SYSTEMS, 2008, 56 (11) : 955 - 966
  • [35] Complex Motion Planning for Quadruped Robots Using Large Language Models
    Zhang, Xiang
    He, Run
    Tong, Kai
    Man, Shuquan
    Tong, Jingyu
    Li, Haodong
    Zhuang, Huiping
    2024 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, ISCAS 2024, 2024,
  • [36] Interactive Symbol Generation of Task Planning for Daily Assistive Robot
    Mikita, Hiroyuki
    Azuma, Haseru
    Kakiuchi, Yohei
    Okada, Kei
    Inaba, Masayuki
    2012 12TH IEEE-RAS INTERNATIONAL CONFERENCE ON HUMANOID ROBOTS (HUMANOIDS), 2012, : 698 - 703
  • [37] Assembly Task Modelling Method for Automatic Robot Program Generation
    Sartori, Alberto
    Mathiesen, Simon Faarvang
    Sorensen, Lars Caroe
    Waspe, Ralf
    Schlette, Christian
    2023 11TH INTERNATIONAL CONFERENCE ON CONTROL, MECHATRONICS AND AUTOMATION, ICCMA, 2023, : 245 - 252
  • [38] Using large language models to evaluate alternative uses task flexibility score
    Hadas, Eran
    Hershkovitz, Arnon
    THINKING SKILLS AND CREATIVITY, 2024, 52
  • [39] Learning compositional models of robot skills for task and motion planning
    Wang, Zi
    Garretti, Caelan Reed
    Kaelbling, Leslie Pack
    Lozano-Perez, Tomas
    INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2021, 40 (6-7): : 866 - 894
  • [40] On the Reliability and Explainability of Language Models for Program Generation
    Liu, Yue
    Tantithamthavorn, Chakkrit
    Liu, Yonghui
    Li, Li
    ACM TRANSACTIONS ON SOFTWARE ENGINEERING AND METHODOLOGY, 2024, 33 (05)