Compositional Networks Enable Systematic Generalization for Grounded Language Understanding

被引:0
|
作者
Kuo, Yen-Ling [1 ]
Katz, Boris [1 ]
Barbu, Andrei [1 ]
机构
[1] MIT, CSAIL & CBMM, Cambridge, MA 02139 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Humans are remarkably flexible when understanding new sentences that include combinations of concepts they have never encountered before. Recent work has shown that while deep networks can mimic some human language abilities when presented with novel sentences, systematic variation uncovers the limitations in the language-understanding abilities of networks. We demonstrate that these limitations can be overcome by addressing the generalization challenges in the gSCAN dataset, which explicitly measures how well an agent is able to interpret novel linguistic commands grounded in vision, e.g., novel pairings of adjectives and nouns. The key principle we employ is compositionality: that the compositional structure of networks should reflect the compositional structure of the problem domain they address, while allowing other parameters to be learned end-to-end. We build a general-purpose mechanism that enables agents to generalize their language understanding to compositional domains. Crucially, our network has the same state-of-theart performance as prior work while generalizing its knowledge when prior work does not. Our network also provides a level of interpretability that enables users to inspect what each part of networks learns. Robust grounded language understanding without dramatic failures and without corner cases is critical to building safe and fair robots; we demonstrate the significant role that compositionality can play in achieving that goal.
引用
收藏
页码:216 / 226
页数:11
相关论文
共 50 条
  • [41] Improving Grounded Natural Language Understanding through Human-Robot Dialog
    Thomason, Jesse
    Padmakumar, Aishwarya
    Sinapov, Jivko
    Walker, Nick
    Jiang, Yuqian
    Yedidsion, Harel
    Hart, Justin
    Stone, Peter
    Mooney, Raymond J.
    2019 INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2019, : 6934 - 6941
  • [42] GROUNDED LANGUAGE UNDERSTANDING FOR MANIPULATION INSTRUCTIONS USING GAN-BASED CLASSIFICATION
    Sugiura, Komei
    Kawai, Hisashi
    2017 IEEE AUTOMATIC SPEECH RECOGNITION AND UNDERSTANDING WORKSHOP (ASRU), 2017, : 519 - 524
  • [43] Understanding Generalization in Neural Networks for Robustness against Adversarial Vulnerabilities
    Chaudhury, Subhajit
    THIRTY-FOURTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THE THIRTY-SECOND INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE AND THE TENTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2020, 34 : 13714 - 13715
  • [44] Understanding Domain-Size Generalization in Markov Logic Networks
    Chen, Florian
    Weitkaemper, Felix
    Malhotra, Sagar
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: RESEARCH TRACK, PT VII, ECML PKDD 2024, 2024, 14947 : 297 - 314
  • [45] Networks are Slacking Off: Understanding Generalization Problem in Image Deraining
    Gu, Jinjin
    Ma, Xianzheng
    Kong, Xiangtao
    Qiao, Yu
    Dong, Chao
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 36 (NEURIPS 2023), 2023,
  • [46] HiddenCut: Simple Data Augmentation for Natural Language Understanding with Better Generalization
    Chen, Jiaao
    Shen, Dinghan
    Chen, Weizhu
    Yang, Diyi
    59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (ACL-IJCNLP 2021), VOL 1, 2021, : 4380 - 4390
  • [47] Understanding phase transition in communication networks to enable robust and resilient control
    Sarkar, Soumik
    Mukherjee, Kushal
    Srivastav, Abhishek
    Ray, Asok
    2009 AMERICAN CONTROL CONFERENCE, VOLS 1-9, 2009, : 1549 - 1554
  • [48] Gestures Orchestrate Brain Networks for Language Understanding
    Skipper, Jeremy I.
    Goldin-Meadow, Susan
    Nusbaum, Howard C.
    Small, Steven L.
    CURRENT BIOLOGY, 2009, 19 (08) : 661 - 667
  • [49] QUATERNION NEURAL NETWORKS FOR SPOKEN LANGUAGE UNDERSTANDING
    Parcollet, Titouan
    Morchid, Mohamed
    Bousquet, Pierre-Michel
    Dufour, Richard
    Linares, Georges
    De Mori, Renato
    2016 IEEE WORKSHOP ON SPOKEN LANGUAGE TECHNOLOGY (SLT 2016), 2016, : 362 - 368
  • [50] Design considerations for a hierarchical semantic compositional framework for medical natural language understanding
    Taira, Ricky K.
    Garlid, Anders O.
    Speier, William
    PLOS ONE, 2023, 18 (03):