Compositional Networks Enable Systematic Generalization for Grounded Language Understanding

被引：0

作者：

Kuo, Yen-Ling ^{[1
]}

Katz, Boris ^{[1
]}

Barbu, Andrei ^{[1
]}

机构：

[1] MIT, CSAIL & CBMM, Cambridge, MA 02139 USA

来源：

FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021 | 2021年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Humans are remarkably flexible when understanding new sentences that include combinations of concepts they have never encountered before. Recent work has shown that while deep networks can mimic some human language abilities when presented with novel sentences, systematic variation uncovers the limitations in the language-understanding abilities of networks. We demonstrate that these limitations can be overcome by addressing the generalization challenges in the gSCAN dataset, which explicitly measures how well an agent is able to interpret novel linguistic commands grounded in vision, e.g., novel pairings of adjectives and nouns. The key principle we employ is compositionality: that the compositional structure of networks should reflect the compositional structure of the problem domain they address, while allowing other parameters to be learned end-to-end. We build a general-purpose mechanism that enables agents to generalize their language understanding to compositional domains. Crucially, our network has the same state-of-theart performance as prior work while generalizing its knowledge when prior work does not. Our network also provides a level of interpretability that enables users to inspect what each part of networks learns. Robust grounded language understanding without dramatic failures and without corner cases is critical to building safe and fair robots; we demonstrate the significant role that compositionality can play in achieving that goal.

引用

页码：216 / 226

页数：11

共 50 条

[1] A Benchmark for Systematic Generalization in Grounded Language Understanding
Ruis, Laura
Andreas, Jacob
Baroni, Marco
Bouchacourt, Diane
Lake, Brenden M.
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS (NEURIPS 2020), 2020, 33
[2] Compositional Generalization with Grounded Language Models
Wold, Sondre
Simon, Etienne
Georges, Lucas
Charpentier, Gabriel
Kostylev, Egor V.
Velldal, Erik
Ovrelid, Lilja
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 3447 - 3460
[3] Compositional Generalization in Spoken Language Understanding
Ray, Avik
Shen, Yilin
Jin, Hongxia
INTERSPEECH 2023, 2023, : 750 - 754
[4] Compositional Generalization in Grounded Language Learning via Induced Model Sparsity
Spilsbury, Sam
Ilin, Alexander
NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES: PROCEEDINGS OF THE STUDENT RESEARCH WORKSHOP, 2022, : 143 - 155
[5] Latent Compositional Representations Improve Systematic Generalization in Grounded Question Answering
Bogin, Ben
Subramanian, Sanjay
Gardner, Matt
Berant, Jonathan
TRANSACTIONS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, 2021, 9 : 195 - 210
[6] Natural language instructions induce compositional generalization in networks of neurons
Riveland, Reidar
Pouget, Alexandre
NATURE NEUROSCIENCE, 2024, 27 (05) : 988 - 999
[7] Grounded Compositional Outputs for Adaptive Language Modeling
Pappas, Nikolaos
Mulcaire, Phoebe
Smith, Noah A.
PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 1252 - 1267
[8] Grounded Graph Decoding Improves Compositional Generalization in Question Answering
Gai, Yu
Jain, Paras
Zhang, Wendi
Gonzalez, Joseph
Song, Dawn
Stoica, Ion
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021, 2021, : 1829 - 1838
[9] Hierarchical Poset Decoding for Compositional Generalization in Language
Guo, Yinuo
Lin, Zeqi
Lou, Jian-Guang
Zhang, Dongmei
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
[10] The emergence of compositional structures in perceptually grounded language games
Vogt, P
ARTIFICIAL INTELLIGENCE, 2005, 167 (1-2) : 206 - 242

← 1 2 3 4 5 →