Grounding natural language instructions to semantic goal representations for abstraction and generalization

被引：0

作者：

Dilip Arumugam

Siddharth Karamcheti

Nakul Gopalan

Edward C. Williams

Mina Rhee

Lawson L. S. Wong

Stefanie Tellex

机构：

[1] Brown University,

来源：

Autonomous Robots | 2019年 / 43卷

关键词：

Natural Language Commands; Language Grounding Models; Argument Binding; Callable Units; Reward Function;

D O I：

暂无

中图分类号：

学科分类号：

摘要：

Language grounding is broadly defined as the problem of mapping natural language instructions to robot behavior. To truly be effective, these language grounding systems must be accurate in their selection of behavior, efficient in the robot’s realization of that selected behavior, and capable of generalizing beyond commands and environment configurations only seen at training time. One choice that is crucial to the success of a language grounding model is the choice of representation used to capture the objective specified by the input command. Prior work has been varied in its use of explicit goal representations, with some approaches lacking a representation altogether, resulting in models that infer whole sequences of robot actions, while other approaches map to carefully constructed logical form representations. While many of the models in either category are reasonably accurate, they fail to offer either efficient execution or any generalization without requiring a large amount of manual specification. In this work, we take a first step towards language grounding models that excel across accuracy, efficiency, and generalization through the construction of simple, semantic goal representations within Markov decision processes. We propose two related semantic goal representations that take advantage of the hierarchical structure of tasks and the compositional nature of language respectively, and present multiple grounding models for each. We validate these ideas empirically with results collected from following text instructions within a simulated mobile-manipulator domain, as well as demonstrations of a physical robot responding to spoken instructions in real time. Our grounding models tie abstraction in language commands to a hierarchical planner for the robot’s execution, enabling a response-time speed-up of several orders of magnitude over baseline planners within sufficiently large domains. Concurrently, our grounding models for generalization infer elements of the semantic representation that are subsequently combined to form a complete goal description, enabling the interpretation of commands involving novel combinations never seen during training. Taken together, our results show that the design of semantic goal representation has powerful implications for the accuracy, efficiency, and generalization capabilities of language grounding models.

引用

页码：449 / 468

页数：19

共 50 条

[1] Grounding natural language instructions to semantic goal representations for abstraction and generalization
Arumugam, Dilip
Karamcheti, Siddharth
Gopalan, Nakul
Williams, Edward C.
Rhee, Mina
Wong, Lawson L. S.
Tellex, Stefanie
AUTONOMOUS ROBOTS, 2019, 43 (02) : 449 - 468
[2] Evaluation of Word Representations in Grounding Natural Language Instructions through Computational Human-Robot Interaction
Roesler, Oliver
Aly, Amir
Taniguchi, Tadahiro
Hayashi, Yoshikatsu
HRI '19: 2019 14TH ACM/IEEE INTERNATIONAL CONFERENCE ON HUMAN-ROBOT INTERACTION, 2019, : 307 - 316
[3] Draw Me a Flower: Processing and Grounding Abstraction in Natural Language
Lachmy, Royi
Pyatkin, Valentina
Manevich, Avshalom
Tsarfaty, Reut
TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2022, 10 : 1341 - 1356
[4] A Model for Verifiable Grounding and Execution of Complex Natural Language Instructions
Boteanu, Adrian
Howard, Thomas
Arkin, Jacob
Kress-Gazit, Hadas
2016 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2016), 2016, : 2649 - 2654
[5] Integration of Natural Language and Vision Processing: Grounding representations
McKevitt, P
ARTIFICIAL INTELLIGENCE REVIEW, 1996, 10 (1-2) : 7 - 13
[6] Semantic Representations for Multilingual Natural Language Processing
Kozerenko, Elena B.
2019 6TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI 2019), 2019, : 433 - 438
[7] Differentiable Parsing and Visual Grounding of Natural Language Instructions for Object Placement
Zhao, Zirui
Lee, Wee Sun
Hsu, David
2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2023), 2023, : 11546 - 11553
[8] Natural language instructions induce compositional generalization in networks of neurons
Riveland, Reidar
Pouget, Alexandre
NATURE NEUROSCIENCE, 2024, 27 (05) : 988 - 999
[9] SEMANTIC GENERALIZATION - IAR LOCUS AND INSTRUCTIONS
CRAMER, P
JOURNAL OF EXPERIMENTAL PSYCHOLOGY, 1970, 83 (02): : 266 - &
[10] Learning Structured Natural Language Representations for Semantic Parsing
Cheng, Jianpeng
Reddy, Siva
Saraswat, Vijay
Lapata, Mirella
PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1, 2017, : 44 - 55

← 1 2 3 4 5 →