Grounding natural language instructions to semantic goal representations for abstraction and generalization

被引:0
|
作者
Dilip Arumugam
Siddharth Karamcheti
Nakul Gopalan
Edward C. Williams
Mina Rhee
Lawson L. S. Wong
Stefanie Tellex
机构
[1] Brown University,
来源
Autonomous Robots | 2019年 / 43卷
关键词
Natural Language Commands; Language Grounding Models; Argument Binding; Callable Units; Reward Function;
D O I
暂无
中图分类号
学科分类号
摘要
Language grounding is broadly defined as the problem of mapping natural language instructions to robot behavior. To truly be effective, these language grounding systems must be accurate in their selection of behavior, efficient in the robot’s realization of that selected behavior, and capable of generalizing beyond commands and environment configurations only seen at training time. One choice that is crucial to the success of a language grounding model is the choice of representation used to capture the objective specified by the input command. Prior work has been varied in its use of explicit goal representations, with some approaches lacking a representation altogether, resulting in models that infer whole sequences of robot actions, while other approaches map to carefully constructed logical form representations. While many of the models in either category are reasonably accurate, they fail to offer either efficient execution or any generalization without requiring a large amount of manual specification. In this work, we take a first step towards language grounding models that excel across accuracy, efficiency, and generalization through the construction of simple, semantic goal representations within Markov decision processes. We propose two related semantic goal representations that take advantage of the hierarchical structure of tasks and the compositional nature of language respectively, and present multiple grounding models for each. We validate these ideas empirically with results collected from following text instructions within a simulated mobile-manipulator domain, as well as demonstrations of a physical robot responding to spoken instructions in real time. Our grounding models tie abstraction in language commands to a hierarchical planner for the robot’s execution, enabling a response-time speed-up of several orders of magnitude over baseline planners within sufficiently large domains. Concurrently, our grounding models for generalization infer elements of the semantic representation that are subsequently combined to form a complete goal description, enabling the interpretation of commands involving novel combinations never seen during training. Taken together, our results show that the design of semantic goal representation has powerful implications for the accuracy, efficiency, and generalization capabilities of language grounding models.
引用
收藏
页码:449 / 468
页数:19
相关论文
共 50 条
  • [1] Grounding natural language instructions to semantic goal representations for abstraction and generalization
    Arumugam, Dilip
    Karamcheti, Siddharth
    Gopalan, Nakul
    Williams, Edward C.
    Rhee, Mina
    Wong, Lawson L. S.
    Tellex, Stefanie
    AUTONOMOUS ROBOTS, 2019, 43 (02) : 449 - 468
  • [2] Evaluation of Word Representations in Grounding Natural Language Instructions through Computational Human-Robot Interaction
    Roesler, Oliver
    Aly, Amir
    Taniguchi, Tadahiro
    Hayashi, Yoshikatsu
    HRI '19: 2019 14TH ACM/IEEE INTERNATIONAL CONFERENCE ON HUMAN-ROBOT INTERACTION, 2019, : 307 - 316
  • [3] Draw Me a Flower: Processing and Grounding Abstraction in Natural Language
    Lachmy, Royi
    Pyatkin, Valentina
    Manevich, Avshalom
    Tsarfaty, Reut
    TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2022, 10 : 1341 - 1356
  • [4] A Model for Verifiable Grounding and Execution of Complex Natural Language Instructions
    Boteanu, Adrian
    Howard, Thomas
    Arkin, Jacob
    Kress-Gazit, Hadas
    2016 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2016), 2016, : 2649 - 2654
  • [6] Semantic Representations for Multilingual Natural Language Processing
    Kozerenko, Elena B.
    2019 6TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI 2019), 2019, : 433 - 438
  • [7] Differentiable Parsing and Visual Grounding of Natural Language Instructions for Object Placement
    Zhao, Zirui
    Lee, Wee Sun
    Hsu, David
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA 2023), 2023, : 11546 - 11553
  • [8] Natural language instructions induce compositional generalization in networks of neurons
    Riveland, Reidar
    Pouget, Alexandre
    NATURE NEUROSCIENCE, 2024, 27 (05) : 988 - 999
  • [9] SEMANTIC GENERALIZATION - IAR LOCUS AND INSTRUCTIONS
    CRAMER, P
    JOURNAL OF EXPERIMENTAL PSYCHOLOGY, 1970, 83 (02): : 266 - &
  • [10] Learning Structured Natural Language Representations for Semantic Parsing
    Cheng, Jianpeng
    Reddy, Siva
    Saraswat, Vijay
    Lapata, Mirella
    PROCEEDINGS OF THE 55TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2017), VOL 1, 2017, : 44 - 55