Examining the Inductive Bias of Neural Language Models with Artificial Languages

被引：0

作者：

White, Jennifer C. ^{[1
]}

Cotterell, Ryan ^{[1
,2
]}

机构：

[1] Univ Cambridge, Cambridge, England

[2] Swiss Fed Inst Technol, Zurich, Switzerland

来源：

59TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS AND THE 11TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING, VOL 1 (ACL-IJCNLP 2021) | 2021年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Since language models are used to model a wide variety of languages, it is natural to ask whether the neural architectures used for the task have inductive biases towards modeling particular types of languages. Investigation of these biases has proved complicated due to the many variables that appear in the experimental setup. Languages vary in many typological dimensions, and it is difficult to single out one or two to investigate without the others acting as confounders. We propose a novel method for investigating the inductive biases of language models using artificial languages. These languages are constructed to allow us to create parallel corpora across languages that differ only in the typological feature being investigated, such as word order. We then use them to train and test language models. This constitutes a fully controlled causal framework, and demonstrates how grammar engineering can serve as a useful tool for analyzing neural models. Using this method, we find that commonly used neural architectures exhibit different inductive biases: LSTMs display little preference with respect to word ordering, while transformers display a clear preference for some orderings over others. Further, we find that neither the inductive bias of the LSTM nor that of the transformer appears to reflect any tendencies that we see in attested natural languages.

引用

页码：454 / 463

页数：10

共 50 条

[1] Gender Bias in Masked Language Models for Multiple Languages
Kaneko, Masahiro
Imankulova, Aizhan
Bollegala, Danushka
Okazaki, Naoaki
NAACL 2022: THE 2022 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES, 2022, : 2740 - 2750
[2] Gender Bias in Masked Language Models for Multiple Languages
Kaneko, Masahiro
Imankulova, Aizhan
Bollegala, Danushka
Okazaki, Naoaki
arXiv, 2022,
[3] Speaking Multiple Languages Affects the Moral Bias of Language Models
Haemmerl, Katharina
Deiseroth, Bjoern
Schramowski, Patrick
Libovicky, Rich
Rothkopf, Constantin A.
Fraser, Alexander
Kersting, Kristian
FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023, 2023, : 2137 - 2156
[4] Leveraging the Inductive Bias of Large Language Models for Abstract Textual Reasoning
Rytting, Christopher Michael
Wingate, David
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
[5] Compositional Neural Network Language Models for Agglutinative Languages
Arisoy, Ebru
Saraclar, Murat
17TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2016), VOLS 1-5: UNDERSTANDING SPEECH PROCESSING IN HUMANS AND MACHINES, 2016, : 3494 - 3498
[6] Investigating representations of verb bias in neural language models
Hawkins, Robert D.
Yamakoshi, Takateru
Griffiths, Thomas L.
Goldberg, Adele E.
PROCEEDINGS OF THE 2020 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING (EMNLP), 2020, : 4653 - 4663
[7] NEURAL NETWORK BASED LANGUAGE MODELS FOR HIGHLY INFLECTIVE LANGUAGES
Mikolov, Tomas
Kopecky, Jiri
Burget, Lukas
Glembek, Ondrej
Cernocky, Jan Honza
2009 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1- 8, PROCEEDINGS, 2009, : 4725 - 4728
[8] Examining Gender Bias in Languages with Grammatical Gender
Zhou, Pei
Shi, Weijia
Zhao, Jieyu
Huang, Kuan-Hao
Chen, Muhao
Cotterell, Ryan
Chang, Kai-Wei
2019 CONFERENCE ON EMPIRICAL METHODS IN NATURAL LANGUAGE PROCESSING AND THE 9TH INTERNATIONAL JOINT CONFERENCE ON NATURAL LANGUAGE PROCESSING (EMNLP-IJCNLP 2019): PROCEEDINGS OF THE CONFERENCE, 2019, : 5276 - 5284
[9] On the Inductive Bias of Neural Tangent Kernels
Bietti, Alberto
Mairal, Julien
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
[10] Grammatical acquisition: Inductive bias and coevolution of language and the language acquisition device
Briscoe, T
LANGUAGE, 2000, 76 (02) : 245 - 296

← 1 2 3 4 5 →