Compositional Networks Enable Systematic Generalization for Grounded Language Understanding

被引:0
|
作者
Kuo, Yen-Ling [1 ]
Katz, Boris [1 ]
Barbu, Andrei [1 ]
机构
[1] MIT, CSAIL & CBMM, Cambridge, MA 02139 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Humans are remarkably flexible when understanding new sentences that include combinations of concepts they have never encountered before. Recent work has shown that while deep networks can mimic some human language abilities when presented with novel sentences, systematic variation uncovers the limitations in the language-understanding abilities of networks. We demonstrate that these limitations can be overcome by addressing the generalization challenges in the gSCAN dataset, which explicitly measures how well an agent is able to interpret novel linguistic commands grounded in vision, e.g., novel pairings of adjectives and nouns. The key principle we employ is compositionality: that the compositional structure of networks should reflect the compositional structure of the problem domain they address, while allowing other parameters to be learned end-to-end. We build a general-purpose mechanism that enables agents to generalize their language understanding to compositional domains. Crucially, our network has the same state-of-theart performance as prior work while generalizing its knowledge when prior work does not. Our network also provides a level of interpretability that enables users to inspect what each part of networks learns. Robust grounded language understanding without dramatic failures and without corner cases is critical to building safe and fair robots; we demonstrate the significant role that compositionality can play in achieving that goal.
引用
收藏
页码:216 / 226
页数:11
相关论文
共 50 条
  • [1] A Benchmark for Systematic Generalization in Grounded Language Understanding
    Ruis, Laura
    Andreas, Jacob
    Baroni, Marco
    Bouchacourt, Diane
    Lake, Brenden M.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS (NEURIPS 2020), 2020, 33
  • [2] Compositional Generalization with Grounded Language Models
    Wold, Sondre
    Simon, Etienne
    Georges, Lucas
    Charpentier, Gabriel
    Kostylev, Egor V.
    Velldal, Erik
    Ovrelid, Lilja
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 3447 - 3460
  • [3] Compositional Generalization in Spoken Language Understanding
    Ray, Avik
    Shen, Yilin
    Jin, Hongxia
    INTERSPEECH 2023, 2023, : 750 - 754
  • [4] Compositional Generalization in Grounded Language Learning via Induced Model Sparsity
    Spilsbury, Sam
    Ilin, Alexander
    NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES: PROCEEDINGS OF THE STUDENT RESEARCH WORKSHOP, 2022, : 143 - 155
  • [5] Latent Compositional Representations Improve Systematic Generalization in Grounded Question Answering
    Bogin, Ben
    Subramanian, Sanjay
    Gardner, Matt
    Berant, Jonathan
    TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2021, 9 : 195 - 210
  • [6] Natural language instructions induce compositional generalization in networks of neurons
    Riveland, Reidar
    Pouget, Alexandre
    NATURE NEUROSCIENCE, 2024, 27 (05) : 988 - 999
  • [7] Grounded Compositional Outputs for Adaptive Language Modeling
    Pappas, Nikolaos
    Mulcaire, Phoebe
    Smith, Noah A.
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 1252 - 1267
  • [8] Grounded Graph Decoding Improves Compositional Generalization in Question Answering
    Gai, Yu
    Jain, Paras
    Zhang, Wendi
    Gonzalez, Joseph
    Song, Dawn
    Stoica, Ion
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 1829 - 1838
  • [9] Hierarchical Poset Decoding for Compositional Generalization in Language
    Guo, Yinuo
    Lin, Zeqi
    Lou, Jian-Guang
    Zhang, Dongmei
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [10] The emergence of compositional structures in perceptually grounded language games
    Vogt, P
    ARTIFICIAL INTELLIGENCE, 2005, 167 (1-2) : 206 - 242