Mind the Biases: Quantifying Cognitive Biases in Language Model Prompting

被引:0
|
作者
Lin, Ruixi [1 ]
Ng, Hwee Tou [1 ]
机构
[1] Natl Univ Singapore, Dept Comp Sci, Singapore, Singapore
关键词
AVAILABILITY;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We advocate the importance of exposing uncertainty on results of language model prompting which display bias modes resembling cognitive biases, and propose to help users grasp the level of uncertainty via simple quantifying metrics. Cognitive biases in the human decision making process can lead to flawed responses when we face uncertainty. Not surprisingly, we have seen biases in language models resembling cognitive biases as a result of training on biased text, raising dangers in downstream tasks that are centered around people's lives if users trust their results too much. In this work, we reveal two bias modes leveraging cognitive biases when we prompt BERT, accompanied by two bias metrics. On a drug-drug interaction extraction task, our bias measurements reveal an error pattern similar to the availability bias when the labels for training prompts are imbalanced, and show that a toning-down transformation of the drug-drug description in a prompt can elicit a bias similar to the framing effect, warning users to distrust when prompting language models for answers.(1)
引用
收藏
页码:5269 / 5281
页数:13
相关论文
共 50 条
  • [11] Cognitive biases in children
    Pry, Rene
    ENFANCE, 2022, (02) : 291 - 296
  • [12] UNDERSTANDING COGNITIVE BIASES
    TEIGEN, KH
    INTERNATIONAL JOURNAL OF PSYCHOLOGY, 1992, 27 (3-4) : 172 - 172
  • [13] Cognitive biases and mindfulness
    Maymin, Philip Z.
    Langer, Ellen J.
    HUMANITIES & SOCIAL SCIENCES COMMUNICATIONS, 2021, 8 (01):
  • [14] Cognitive biases and mindfulness
    Philip Z. Maymin
    Ellen J. Langer
    Humanities and Social Sciences Communications, 8
  • [15] COGNITIVE BIASES AND DEPRESSION
    DOHR, KB
    RUSH, AJ
    BERNSTEIN, IH
    JOURNAL OF ABNORMAL PSYCHOLOGY, 1989, 98 (03) : 263 - 267
  • [16] Measuring and mitigating language model biases in abusive language detection
    Song, Rui
    Giunchiglia, Fausto
    Li, Yingji
    Shi, Lida
    Xu, Hao
    INFORMATION PROCESSING & MANAGEMENT, 2023, 60 (03)
  • [17] A cognitive modeling approach to learning and using reference biases in language
    Toth, Abigail G. G.
    Hendriks, Petra
    Taatgen, Niels A. A.
    van Rij, Jacolien
    FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2022, 5
  • [18] Quantifying participation biases on social media
    Pokhriyal, Neeti
    Valentino, Benjamin A.
    Vosoughi, Soroush
    EPJ DATA SCIENCE, 2023, 12 (01)
  • [19] Quantifying Biases in Online Information Exposure
    Nikolov, Dimitar
    Lalmas, Mounia
    Flammini, Alessandro
    Menczer, Filippo
    JOURNAL OF THE ASSOCIATION FOR INFORMATION SCIENCE AND TECHNOLOGY, 2019, 70 (03) : 218 - 229
  • [20] Quantifying participation biases on social media
    Neeti Pokhriyal
    Benjamin A. Valentino
    Soroush Vosoughi
    EPJ Data Science, 12