The Grammar-Learning Trajectories of Neural Language Models

被引:0
|
作者
Choshen, Leshem [1 ]
Hacohen, Guy [1 ,2 ]
Weinshall, Daphna [1 ]
Abend, Omri [1 ]
机构
[1] Hebrew Univ Jerusalem, Dept Comp Sci, Jerusalem, Israel
[2] Hebrew Univ Jerusalem, Dept Brain Sci, Jerusalem, Israel
基金
以色列科学基金会;
关键词
ACQUISITION; PERCEPTION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The learning trajectories of linguistic phenomena in humans provide insight into linguistic representation, beyond what can be gleaned from inspecting the behavior of an adult speaker. To apply a similar approach to analyze neural language models (NLM), it is first necessary to establish that different models are similar enough in the generalizations they make. In this paper, we show that NLMs with different initialization, architecture, and training data acquire linguistic phenomena in a similar order, despite their different end performance. These findings suggest that there is some mutual inductive bias that underlies these models' learning of linguistic phenomena. Taking inspiration from psycholinguistics, we argue that studying this inductive bias is an opportunity to study the linguistic representation implicit in NLMs. Leveraging these findings, we compare the relative performance on different phenomena at varying learning stages with simpler reference models. Results suggest that NLMs exhibit consistent "developmental" stages. Moreover, we find the learning trajectory to be approximately one-dimensional: given an NLM with a certain overall performance, it is possible to predict what linguistic generalizations it has already acquired. Initial analysis of these stages presents phenomena clusters (notably morphological ones), whose performance progresses in unison, suggesting a potential link between the generalizations behind them.
引用
收藏
页码:8281 / 8297
页数:17
相关论文
共 50 条
  • [41] LEARNING FOREIGN LANGUAGE GRAMMAR BASED ON ARTISTIC TEXT
    Hierkierova, O. M.
    Burlak, Y., V
    SCIENCE AND EDUCATION, 2007, (3-4): : 122 - 124
  • [42] CONTRASTIVE GRAMMAR AS LEARNING STRATEGY IN RUSSIAN LANGUAGE MANUALS
    Ciesielkiewicz, Monika
    CUADERNOS DE RUSISTICA ESPANOLA, 2009, 5 : 157 - 167
  • [43] Evaluation of spoken language grammar learning in the ATIS domain
    Wang, YY
    Acero, A
    2002 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS I-IV, PROCEEDINGS, 2002, : 41 - 44
  • [44] The association between statistical learning and the development of second language grammar learning
    Chen, Yao
    Li, Li
    Wang, Mengxing
    Wang, Ruiming
    APPLIED COGNITIVE PSYCHOLOGY, 2023, 37 (05) : 1027 - 1036
  • [45] The Castilian Grammar of Antonio de Nebrija: Grammar of a language, language of a grammar
    Ridruejo, Emilio
    REVUE DE LINGUISTIQUE ROMANE, 2015, 79 (315): : 542 - 548
  • [46] A-Grammar: Mobile Learning Foundation of Arabic Grammar Language with Multimedia Aided Approach
    Yusof, Suhailah Mohd
    Shalan, Siti Nur Shuhada
    Almuddin, Syahirah
    Primsuwan, Phaveena
    Tahir, Norlizawati Md
    Abd Ghani, Rosmaiza
    2015 INTERNATIONAL SYMPOSIUM ON MATHEMATICAL SCIENCES AND COMPUTING RESEARCH (ISMSC), 2015, : 30 - 35
  • [47] Joint pairwise learning and masked language models for neural machine translation of English
    Yang, Shuhan
    Yang, Qun
    ARTIFICIAL LIFE AND ROBOTICS, 2025,
  • [48] Recurrent Neural Networks and Machine Learning Models Applied in Sign Language Recognition
    Novillo Quinde, Esteban Gustavo
    Saldana Torres, Juan Pablo
    Alvarez Valdez, Michael Andres
    Llivicota Leon, John Santiago
    Hurtado Ortiz, Remigio Ismael
    PROCEEDINGS OF NINTH INTERNATIONAL CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGY, ICICT 2024, VOL 3, 2024, 1013 : 615 - 624
  • [49] Neural Language Models and Few Shot Learning for Systematic Requirements Processing in MDSE
    Bertram, Vincent
    Boss, Miriam
    Kusmenko, Evgeny
    Nachmann, Imke Helene
    Rumpe, Bernhard
    Trotta, Danilo
    Wachtmeister, Louis
    PROCEEDINGS OF THE 15TH ACM SIGPLAN INTERNATIONAL CONFERENCE ON SOFTWARE LANGUAGE ENGINEERING, SLE 2022, 2022, : 260 - 265
  • [50] Learning Trajectories of Hamiltonian Systems with Neural Networks
    Haitsiukevich, Katsiaryna
    Ilin, Alexander
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT I, 2022, 13529 : 562 - 573