Compositional Networks Enable Systematic Generalization for Grounded Language Understanding

被引：0

作者：

Kuo, Yen-Ling ^{[1
]}

Katz, Boris ^{[1
]}

Barbu, Andrei ^{[1
]}

机构：

[1] MIT, CSAIL & CBMM, Cambridge, MA 02139 USA

来源：

FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2021 | 2021年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Humans are remarkably flexible when understanding new sentences that include combinations of concepts they have never encountered before. Recent work has shown that while deep networks can mimic some human language abilities when presented with novel sentences, systematic variation uncovers the limitations in the language-understanding abilities of networks. We demonstrate that these limitations can be overcome by addressing the generalization challenges in the gSCAN dataset, which explicitly measures how well an agent is able to interpret novel linguistic commands grounded in vision, e.g., novel pairings of adjectives and nouns. The key principle we employ is compositionality: that the compositional structure of networks should reflect the compositional structure of the problem domain they address, while allowing other parameters to be learned end-to-end. We build a general-purpose mechanism that enables agents to generalize their language understanding to compositional domains. Crucially, our network has the same state-of-theart performance as prior work while generalizing its knowledge when prior work does not. Our network also provides a level of interpretability that enables users to inspect what each part of networks learns. Robust grounded language understanding without dramatic failures and without corner cases is critical to building safe and fair robots; we demonstrate the significant role that compositionality can play in achieving that goal.

引用

页码：216 / 226

页数：11

共 50 条

[41] Improving Grounded Natural Language Understanding through Human-Robot Dialog
Thomason, Jesse
Padmakumar, Aishwarya
Sinapov, Jivko
Walker, Nick
Jiang, Yuqian
Yedidsion, Harel
Hart, Justin
Stone, Peter
Mooney, Raymond J.
2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2019, : 6934 - 6941
[42] GROUNDED LANGUAGE UNDERSTANDING FOR MANIPULATION INSTRUCTIONS USING GAN-BASED CLASSIFICATION
Sugiura, Komei
Kawai, Hisashi
2017 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2017, : 519 - 524
[43] Understanding Generalization in Neural Networks for Robustness against Adversarial Vulnerabilities
Chaudhury, Subhajit
THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 13714 - 13715
[44] Understanding Domain-Size Generalization in Markov Logic Networks
Chen, Florian
Weitkaemper, Felix
Malhotra, Sagar
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, PT VII, ECML PKDD 2024, 2024, 14947 : 297 - 314
[45] Networks are Slacking Off: Understanding Generalization Problem in Image Deraining
Gu, Jinjin
Ma, Xianzheng
Kong, Xiangtao
Qiao, Yu
Dong, Chao
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
[46] HiddenCut: Simple Data Augmentation for Natural Language Understanding with Better Generalization
Chen, Jiaao
Shen, Dinghan
Chen, Weizhu
Yang, Diyi
59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (ACL-IJCNLP 2021), VOL 1, 2021, : 4380 - 4390
[47] Understanding phase transition in communication networks to enable robust and resilient control
Sarkar, Soumik
Mukherjee, Kushal
Srivastav, Abhishek
Ray, Asok
2009 AMERICAN CONTROL CONFERENCE, VOLS 1-9, 2009, : 1549 - 1554
[48] Gestures Orchestrate Brain Networks for Language Understanding
Skipper, Jeremy I.
Goldin-Meadow, Susan
Nusbaum, Howard C.
Small, Steven L.
CURRENT BIOLOGY, 2009, 19 (08) : 661 - 667
[49] QUATERNION NEURAL NETWORKS FOR SPOKEN LANGUAGE UNDERSTANDING
Parcollet, Titouan
Morchid, Mohamed
Bousquet, Pierre-Michel
Dufour, Richard
Linares, Georges
De Mori, Renato
2016 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2016), 2016, : 362 - 368
[50] Design considerations for a hierarchical semantic compositional framework for medical natural language understanding
Taira, Ricky K.
Garlid, Anders O.
Speier, William
PLOS ONE, 2023, 18 (03):

← 1 2 3 4 5 →