Improving Compositional Generalization using Iterated Learning and Simplicial Embeddings

被引:0
|
作者
Ren, Yi [1 ]
Lavoie, Samuel [2 ,3 ]
Galkin, Mikhail [4 ]
Sutherland, Danica J. [1 ,5 ]
Courville, Aaron [2 ,3 ]
机构
[1] Univ British Columbia, Vancouver, BC, Canada
[2] Univ Montreal, Montreal, PQ, Canada
[3] Mila, Montreal, PQ, Canada
[4] Intel AI Lab, San Diego, CA USA
[5] Amii, Edmonton, AB, Canada
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Compositional generalization, the ability of an agent to generalize to unseen combinations of latent factors, is easy for humans but hard for deep neural networks. A line of research in cognitive science has hypothesized a process, "iterated learning," to help explain how human language developed this ability; the theory rests on simultaneous pressures towards compressibility (when an ignorant agent learns from an informed one) and expressivity (when it uses the representation for downstream tasks). Inspired by this process, we propose to improve the compositional generalization of deep networks by using iterated learning on models with simplicial embeddings, which can approximately discretize representations. This approach is further motivated by an analysis of compositionality based on Kolmogorov complexity. We show that this combination of changes improves compositional generalization over other approaches, demonstrating these improvements both on vision tasks with well-understood latent factors and on real molecular graph prediction tasks where the latent structure is unknown.
引用
收藏
页数:26
相关论文
共 50 条
  • [1] Learning to Substitute Spans towards Improving Compositional Generalization
    Li, Zhaoyi
    Wei, Ying
    Lian, Defu
    PROCEEDINGS OF THE 61ST ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, VOL 1, 2023, : 2791 - 2811
  • [2] Improving Compositional Generalization in Semantic Parsing
    Oren, Inbar
    Herzig, Jonathan
    Gupta, Nitish
    Gardner, Matt
    Berant, Jonathan
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EMNLP 2020, 2020, : 2482 - 2495
  • [3] Learning to Compose Representations of Different Encoder Layers towards Improving Compositional Generalization
    Lin, Lei
    Li, Shuangtao
    Zheng, Yafang
    Fu, Biao
    Liu, Shan
    Chen, Yidong
    Shi, Xiaodong
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS - EMNLP 2023, 2023, : 1599 - 1614
  • [4] Curriculum learning for human compositional generalization
    Dekker, Ronald B.
    Otto, Fabian
    Summerfield, Christopher
    PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2022, 119 (41)
  • [5] Learning Algebraic Recombination for Compositional Generalization
    Liu, Chenyao
    An, Shengnan
    Lin, Zeqi
    Liu, Qian
    Chen, Bei
    Lou, Jian-Guang
    Wen, Lijie
    Zheng, Nanning
    Zhang, Dongmei
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL-IJCNLP 2021, 2021, : 1129 - 1144
  • [6] Adaptive Joint Learning of Compositional and Non-Compositional Phrase Embeddings
    Hashimoto, Kazuma
    Tsuruoka, Yoshimasa
    PROCEEDINGS OF THE 54TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 1, 2016, : 205 - 215
  • [7] Using Compositional Embeddings for Fact Checking
    da Silva, Ana Alexandra Morim
    Roeder, Michael
    Ngomo, Axel-Cyrille Ngonga
    SEMANTIC WEB - ISWC 2021, 2021, 12922 : 270 - 286
  • [8] Learning Graph Embeddings for Compositional Zero-shot Learning
    Naeem, Muhammad Ferjad
    Xian, Yongqin
    Tombari, Federico
    Akata, Zeynep
    2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 953 - 962
  • [9] Improving Compositional Generalization with Latent Structure and Data Augmentation
    Qiu, Linlu
    Shaw, Peter
    Pasupat, Panupong
    Nowak, Pawel Krzysztof
    Linzen, Tal
    Sha, Fei
    Toutanova, Kristina
    NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 4341 - 4362
  • [10] Disentangled Sequence to Sequence Learning for Compositional Generalization
    Zheng, Hao
    Lapata, Mirella
    PROCEEDINGS OF THE 60TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS (ACL 2022), VOL 1: (LONG PAPERS), 2022, : 4256 - 4268