Dissecting Recall of Factual Associations in Auto-Regressive Language Models

被引:0
|
作者
Geval, Mor [1 ]
Bastings, Jasmijn [1 ]
Filippoval, Katja [1 ]
Globerson, Amir [2 ,3 ]
机构
[1] Google DeepMind, London, England
[2] Tel Aviv Univ, Tel Aviv, Israel
[3] Google Res, Mountain View, CA USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Transformer-based language models (LMs) are known to capture factual knowledge in their parameters. While previous work looked into where factual associations are stored, only little is known about how they are retrieved internally during inference. We investigate this question through the lens of information flow. Given a subject-relation query, we study how the model aggregates information about the subject and relation to predict the correct attribute. With interventions on attention edges, we first identify two critical points where information propagates to the prediction: one from the relation positions followed by another from the subject positions. Next, by analyzing the information at these points, we unveil a three-step internal mechanism for attribute extraction. First, the representation at the last-subject position goes through an enrichment process, driven by the early MLP sublayers, to encode many subject-related attributes. Second, information from the relation propagates to the prediction. Third, the prediction representation "queries" the enriched subject to extract the attribute. Perhaps surprisingly, this extraction is typically done via attention heads, which often encode subject-attribute mappings in their parameters. Overall, our findings introduce a comprehensive view of how factual associations are stored and extracted internally in LMs, facilitating future research on knowledge localization and editing.1
引用
收藏
页码:12216 / 12235
页数:20
相关论文
共 50 条
  • [41] ONLINE BAYESIAN APNEA-BRADYCARDIA DETECTION USING AUTO-REGRESSIVE MODELS
    Ge, D.
    Carrault, G.
    Hernandez, A. I.
    2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2014,
  • [42] Auto-Regressive Analysis and Simulation of Speech Signal
    Fei, Wan-Chun
    Lu, Xing-Xing
    Jiang, Xiao-Chen
    TEXTILE BIOENGINEERING AND INFORMATICS SYMPOSIUM PROCEEDINGS, VOLS 1 AND 2, 2012, : 887 - 891
  • [43] An auto-regressive model for battery voltage prediction
    Vilsen, Soren B.
    Stroe, Daniel-Ioan
    2021 THIRTY-SIXTH ANNUAL IEEE APPLIED POWER ELECTRONICS CONFERENCE AND EXPOSITION (APEC 2021), 2021, : 2673 - 2680
  • [44] Identification and control of nonlinear systems using PieceWise Auto-Regressive eXogenous models
    Lassoued, Zeineb
    Abderrahim, Kamel
    TRANSACTIONS OF THE INSTITUTE OF MEASUREMENT AND CONTROL, 2019, 41 (14) : 4050 - 4062
  • [45] Fuzzy auto-regressive model and its applications
    Ozawa, K
    Watanabe, T
    Kanke, M
    FIRST INTERNATIONAL CONFERENCE ON KNOWLEDGE-BASED INTELLIGENT ELECTRONIC SYSTEMS, PROCEEDINGS 1997 - KES '97, VOLS 1 AND 2, 1997, : 112 - 117
  • [46] Computerized Wrist Pulse Signal Diagnosis Using Modified Auto-Regressive Models
    Chen, Yinghui
    Zhang, Lei
    Zhang, David
    Zhang, Dongyu
    JOURNAL OF MEDICAL SYSTEMS, 2011, 35 (03) : 321 - 328
  • [47] On the Modeling of Discrete Time Auto-Regressive Representations
    Moysis, Lazaros
    Karampetakis, Nicholas P.
    2014 INTERNATIONAL CONFERENCE ON CONTROL, DECISION AND INFORMATION TECHNOLOGIES (CODIT), 2014, : 381 - 386
  • [48] On the use of Auto-Regressive Modeling for Arrhythmia Detection
    Adnane, Mourad
    Belouchrani, Adel
    2016 24TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO), 2016, : 2410 - 2414
  • [49] Auto-Regressive Models of Non-Stationary Time Series with Finite Length
    费万春
    白伦
    Tsinghua Science and Technology, 2005, (02) : 162 - 168
  • [50] Tracking an Auto-Regressive Process with Limited Communication
    Jinan, Rooji
    Parag, Parimal
    Tyagi, Himanshu
    2020 IEEE INTERNATIONAL SYMPOSIUM ON INFORMATION THEORY (ISIT), 2020, : 2462 - 2467