How Can We Know What Language Models Know?

被引:552
|
作者
Jiang, Zhengbao [1 ]
Xu, Frank F. [1 ]
Araki, Jun [2 ]
Neubig, Graham [1 ]
机构
[1] Carnegie Mellon Univ, Language Technol Inst, Pittsburgh, PA 15213 USA
[2] Bosch Res North Amer, Palo Alto, CA USA
基金
美国国家科学基金会;
关键词
Recent work has presented intriguing results examining the knowledge contained in language models (LMs) by having the LM fill in the blanks of prompts such as ‘‘Obama is a by profession’’. These prompts are usually manually created; and quite possibly sub-optimal; another prompt such as ‘‘Obama worked as a ’’ may result in more accurately predicting the correct profession. Because of this; given an inappropriate prompt; we might fail to retrieve facts that the LM does know; and thus any given prompt only provides a lower bound estimate of the knowledge contained in an LM. In this paper; we attempt to more accurately estimate the knowledge contained in LMs by automatically discovering better prompts to use in this querying process. Specifically; we propose mining-based and paraphrasing-based methods to automatically generate high-quality and diverse prompts; as well as ensemble methods to combine answers from different prompts. Extensive experiments on the LAMA benchmark for extracting relational knowledge from LMs demonstrate that our methods can improve accuracy from 31.1% to 39.6%; providing a tighter lower bound on what LMs know. We have released the code and the resulting LM Prompt And Query Archive (LPAQA) at https://github.com/jzbjyb/LPAQA. © 2020 Association for Computational Linguistics;
D O I
10.1162/tacl_a_00324
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Recent work has presented intriguing results examining the knowledge contained in language models (LMs) by having the LM fill in the blanks of prompts such as "Obama is a _ by profession''. These prompts are usually manually created, and quite possibly sub-optimal; another prompt such as "Obama worked as a_'' may result in more accurately predicting the correct profession. Because of this, given an inappropriate prompt, we might fail to retrieve facts that the LM does know, and thus any given prompt only provides a lower bound estimate of the knowledge contained in an LM. In this paper, we attempt to more accurately estimate the knowledge contained in LMs by automatically discovering better prompts to use in this querying process. Specifically, we propose mining-based and paraphrasing-based methods to automatically generate high-quality and diverse prompts, as well as ensemble methods to combine answers from different prompts. Extensive experiments on the LAMA benchmark for extracting relational knowledge from LMs demonstrate that our methods can improve accuracy from 31.1% to 39.6%, providing a tighter lower bound on what LMs know. We have released the code and the resulting LM Prompt And Query Archive (LPAQA) at https://github.com/jzbjyb/LPAQA.
引用
收藏
页码:423 / 438
页数:16
相关论文
共 50 条
  • [31] Psychological universals: What are they and how can we know?
    Norenzayan, A
    Heine, SJ
    PSYCHOLOGICAL BULLETIN, 2005, 131 (05) : 763 - 784
  • [32] How can we know what we need to know? Reflections on clinical judgment formation
    Busch, F
    Schmidt-Hellerau, C
    JOURNAL OF THE AMERICAN PSYCHOANALYTIC ASSOCIATION, 2004, 52 (03) : 689 - 707
  • [33] Newborn screening conditions: What we know, what we do not know, and how we will know it
    Levy, Harvey L.
    GENETICS IN MEDICINE, 2010, 12 (12) : S213 - S214
  • [34] WHAT WE KNOW ABOUT CHINA AND HOW WE KNOW IT
    STREGE, PH
    WORLDVIEW, 1976, 19 (12) : 41 - 41
  • [35] EDITORIAL: HOW DO WE KNOW WHAT WE KNOW?
    Fox, Christopher
    TEMPO, 2020, 74 (292) : 3 - 4
  • [36] Signs of sex: what we know and how we know it
    Schurko, Andrew M.
    Neiman, Maurine
    Logsdon, John M., Jr.
    TRENDS IN ECOLOGY & EVOLUTION, 2009, 24 (04) : 208 - 217
  • [37] What can we know?
    Welz, A.
    ZEITSCHRIFT FUR HERZ THORAX UND GEFASSCHIRURGIE, 2019, 33 (04): : 229 - 230
  • [38] Quasicrystals. what do we know - what can we know?
    Steurer, Walter
    ACTA CRYSTALLOGRAPHICA A-FOUNDATION AND ADVANCES, 2017, 73 : C142 - C142
  • [39] How to Succeed as an Athlete: What We Know, What We Need to Know
    Foster, Carl
    Barroso, Renato
    Beneke, Ralph
    Bok, Daniel
    Boullosa, Daniel
    Casado, Arturo
    Chamari, Karim
    Cortis, Cristina
    de Koning, Jos
    Fusco, Andrea
    Haugen, Thomas
    Lucia, Alejandro
    Mujika, Inigo
    Pyne, David
    Rodriguez-Marroyo, Jose A.
    Sandbakk, Oyvind
    Seiler, Stephen
    INTERNATIONAL JOURNAL OF SPORTS PHYSIOLOGY AND PERFORMANCE, 2022, 17 (03) : 333 - 334
  • [40] Assays and applications in warfarin metabolism: what we know, how we know it and what we need to know
    Jones, Drew R.
    Miller, Grover P.
    EXPERT OPINION ON DRUG METABOLISM & TOXICOLOGY, 2011, 7 (07) : 857 - 874