Examining the Inductive Bias of Neural Language Models with Artificial Languages

被引:0
|
作者
White, Jennifer C. [1 ]
Cotterell, Ryan [1 ,2 ]
机构
[1] Univ Cambridge, Cambridge, England
[2] Swiss Fed Inst Technol, Zurich, Switzerland
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Since language models are used to model a wide variety of languages, it is natural to ask whether the neural architectures used for the task have inductive biases towards modeling particular types of languages. Investigation of these biases has proved complicated due to the many variables that appear in the experimental setup. Languages vary in many typological dimensions, and it is difficult to single out one or two to investigate without the others acting as confounders. We propose a novel method for investigating the inductive biases of language models using artificial languages. These languages are constructed to allow us to create parallel corpora across languages that differ only in the typological feature being investigated, such as word order. We then use them to train and test language models. This constitutes a fully controlled causal framework, and demonstrates how grammar engineering can serve as a useful tool for analyzing neural models. Using this method, we find that commonly used neural architectures exhibit different inductive biases: LSTMs display little preference with respect to word ordering, while transformers display a clear preference for some orderings over others. Further, we find that neither the inductive bias of the LSTM nor that of the transformer appears to reflect any tendencies that we see in attested natural languages.
引用
收藏
页码:454 / 463
页数:10
相关论文
共 50 条
  • [1] Gender Bias in Masked Language Models for Multiple Languages
    Kaneko, Masahiro
    Imankulova, Aizhan
    Bollegala, Danushka
    Okazaki, Naoaki
    NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 2740 - 2750
  • [2] Gender Bias in Masked Language Models for Multiple Languages
    Kaneko, Masahiro
    Imankulova, Aizhan
    Bollegala, Danushka
    Okazaki, Naoaki
    arXiv, 2022,
  • [3] Speaking Multiple Languages Affects the Moral Bias of Language Models
    Haemmerl, Katharina
    Deiseroth, Bjoern
    Schramowski, Patrick
    Libovicky, Rich
    Rothkopf, Constantin A.
    Fraser, Alexander
    Kersting, Kristian
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, 2023, : 2137 - 2156
  • [4] Leveraging the Inductive Bias of Large Language Models for Abstract Textual Reasoning
    Rytting, Christopher Michael
    Wingate, David
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [5] Compositional Neural Network Language Models for Agglutinative Languages
    Arisoy, Ebru
    Saraclar, Murat
    17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3494 - 3498
  • [6] Investigating representations of verb bias in neural language models
    Hawkins, Robert D.
    Yamakoshi, Takateru
    Griffiths, Thomas L.
    Goldberg, Adele E.
    PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 4653 - 4663
  • [7] NEURAL NETWORK BASED LANGUAGE MODELS FOR HIGHLY INFLECTIVE LANGUAGES
    Mikolov, Tomas
    Kopecky, Jiri
    Burget, Lukas
    Glembek, Ondrej
    Cernocky, Jan Honza
    2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4725 - 4728
  • [8] Examining Gender Bias in Languages with Grammatical Gender
    Zhou, Pei
    Shi, Weijia
    Zhao, Jieyu
    Huang, Kuan-Hao
    Chen, Muhao
    Cotterell, Ryan
    Chang, Kai-Wei
    2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 5276 - 5284
  • [9] On the Inductive Bias of Neural Tangent Kernels
    Bietti, Alberto
    Mairal, Julien
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [10] Grammatical acquisition: Inductive bias and coevolution of language and the language acquisition device
    Briscoe, T
    LANGUAGE, 2000, 76 (02) : 245 - 296