PROGPROMPT: program generation for situated robot task planning using large language models

被引：8

作者：

Singh, Ishika ^{[1
]}

Blukis, Valts ^{[2
]}

Mousavian, Arsalan ^{[2
]}

Goyal, Ankit ^{[2
]}

Xu, Danfei ^{[2
]}

Tremblay, Jonathan ^{[2
]}

Fox, Dieter ^{[2
,3
]}

Thomason, Jesse ^{[1
]}

Garg, Animesh ^{[2
,4
]}

机构：

[1] Univ Southern Calif, Comp Sci, Los Angeles, CA 90089 USA

[2] NVIDIA, Seattle Robot Lab, Seattle, WA 98105 USA

[3] Univ Washington, Comp Sci & Engn, Seattle, WA 98195 USA

[4] Georgia Inst Technol, Sch Interact Comp, Atlanta, GA 30308 USA

来源：

AUTONOMOUS ROBOTS | 2023年 / 47卷 / 08期

关键词：

Robot task planning; LLM code generation; Planning domain generalization; Symbolic planning;

D O I：

10.1007/s10514-023-10135-3

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Task planning can require defining myriad domain knowledge about the world in which a robot needs to act. To ameliorate that effort, large language models (LLMs) can be used to score potential next actions during task planning, and even generate action sequences directly, given an instruction in natural language with no additional domain information. However, such methods either require enumerating all possible next steps for scoring, or generate free-form text that may contain actions not possible on a given robot in its current context. We present a programmatic LLM prompt structure that enables plan generation functional across situated environments, robot capabilities, and tasks. Our key insight is to prompt the LLM with program-like specifications of the available actions and objects in an environment, as well as with example programs that can be executed. We make concrete recommendations about prompt structure and generation constraints through ablation experiments, demonstrate state of the art success rates in VirtualHome household tasks, and deploy our method on a physical robot arm for tabletop tasks. Website and code at progprompt.github.io

引用

页码：999 / 1012

页数：14

共 50 条

[31] FireRobBrain: Planning for a Firefighting Robot using Knowledge Graph and Large Language Model
Zhang, Jialei
Cai, Shubin
Jiang, Zhiwei
Xiao, Jian
Ming, Zhong
PROCEEDINGS OF THE 2024 IEEE 10TH INTERNATIONAL CONFERENCE ON INTELLIGENT DATA AND SECURITY, IDS 2024, 2024, : 37 - 41
[32] A Survey of Robot Intelligence with Large Language Models
Jeong, Hyeongyo
Lee, Haechan
Kim, Changwon
Shin, Sungtae
APPLIED SCIENCES-BASEL, 2024, 14 (19):
[33] Towards A Holistic Landscape of Situated Theory of Mind in Large Language Models
Ma, Ziqiao
Sansom, Jacob
Peng, Run
Chai, Joyce
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS - EMNLP 2023, 2023, : 1011 - 1031
[34] Robot task planning using semantic maps
Galindo, Cipriano
Fernandez-Madrigal, Juan-Antonio
Gonzalez, Javier
Saffiotti, Alessandro
ROBOTICS AND AUTONOMOUS SYSTEMS, 2008, 56 (11) : 955 - 966
[35] Complex Motion Planning for Quadruped Robots Using Large Language Models
Zhang, Xiang
He, Run
Tong, Kai
Man, Shuquan
Tong, Jingyu
Li, Haodong
Zhuang, Huiping
2024 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, ISCAS 2024, 2024,
[36] Interactive Symbol Generation of Task Planning for Daily Assistive Robot
Mikita, Hiroyuki
Azuma, Haseru
Kakiuchi, Yohei
Okada, Kei
Inaba, Masayuki
2012 12TH IEEE-RAS INTERNATIONAL CONFERENCE ON HUMANOID ROBOTS (HUMANOIDS), 2012, : 698 - 703
[37] Assembly Task Modelling Method for Automatic Robot Program Generation
Sartori, Alberto
Mathiesen, Simon Faarvang
Sorensen, Lars Caroe
Waspe, Ralf
Schlette, Christian
2023 11TH INTERNATIONAL CONFERENCE ON CONTROL, MECHATRONICS AND AUTOMATION, ICCMA, 2023, : 245 - 252
[38] Using large language models to evaluate alternative uses task flexibility score
Hadas, Eran
Hershkovitz, Arnon
THINKING SKILLS AND CREATIVITY, 2024, 52
[39] Learning compositional models of robot skills for task and motion planning
Wang, Zi
Garretti, Caelan Reed
Kaelbling, Leslie Pack
Lozano-Perez, Tomas
INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2021, 40 (6-7): : 866 - 894
[40] On the Reliability and Explainability of Language Models for Program Generation
Liu, Yue
Tantithamthavorn, Chakkrit
Liu, Yonghui
Li, Li
ACM TRANSACTIONS ON SOFTWARE ENGINEERING AND METHODOLOGY, 2024, 33 (05)

← 1 2 3 4 5 →