Mind the Biases: Quantifying Cognitive Biases in Language Model Prompting

被引：0

作者：

Lin, Ruixi ^{[1
]}

Ng, Hwee Tou ^{[1
]}

机构：

[1] Natl Univ Singapore, Dept Comp Sci, Singapore, Singapore

来源：

FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, ACL 2023 | 2023年

关键词：

AVAILABILITY;

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We advocate the importance of exposing uncertainty on results of language model prompting which display bias modes resembling cognitive biases, and propose to help users grasp the level of uncertainty via simple quantifying metrics. Cognitive biases in the human decision making process can lead to flawed responses when we face uncertainty. Not surprisingly, we have seen biases in language models resembling cognitive biases as a result of training on biased text, raising dangers in downstream tasks that are centered around people's lives if users trust their results too much. In this work, we reveal two bias modes leveraging cognitive biases when we prompt BERT, accompanied by two bias metrics. On a drug-drug interaction extraction task, our bias measurements reveal an error pattern similar to the availability bias when the labels for training prompts are imbalanced, and show that a toning-down transformation of the drug-drug description in a prompt can elicit a bias similar to the framing effect, warning users to distrust when prompting language models for answers.(1)

引用

页码：5269 / 5281

页数：13

共 50 条

[11] Cognitive biases in children
Pry, Rene
ENFANCE, 2022, (02) : 291 - 296
[12] UNDERSTANDING COGNITIVE BIASES
TEIGEN, KH
INTERNATIONAL JOURNAL OF PSYCHOLOGY, 1992, 27 (3-4) : 172 - 172
[13] Cognitive biases and mindfulness
Maymin, Philip Z.
Langer, Ellen J.
HUMANITIES & SOCIAL SCIENCES COMMUNICATIONS, 2021, 8 (01):
[14] Cognitive biases and mindfulness
Philip Z. Maymin
Ellen J. Langer
Humanities and Social Sciences Communications, 8
[15] COGNITIVE BIASES AND DEPRESSION
DOHR, KB
RUSH, AJ
BERNSTEIN, IH
JOURNAL OF ABNORMAL PSYCHOLOGY, 1989, 98 (03) : 263 - 267
[16] Measuring and mitigating language model biases in abusive language detection
Song, Rui
Giunchiglia, Fausto
Li, Yingji
Shi, Lida
Xu, Hao
INFORMATION PROCESSING & MANAGEMENT, 2023, 60 (03)
[17] A cognitive modeling approach to learning and using reference biases in language
Toth, Abigail G. G.
Hendriks, Petra
Taatgen, Niels A. A.
van Rij, Jacolien
FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2022, 5
[18] Quantifying participation biases on social media
Pokhriyal, Neeti
Valentino, Benjamin A.
Vosoughi, Soroush
EPJ DATA SCIENCE, 2023, 12 (01)
[19] Quantifying Biases in Online Information Exposure
Nikolov, Dimitar
Lalmas, Mounia
Flammini, Alessandro
Menczer, Filippo
JOURNAL OF THE ASSOCIATION FOR INFORMATION SCIENCE AND TECHNOLOGY, 2019, 70 (03) : 218 - 229
[20] Quantifying participation biases on social media
Neeti Pokhriyal
Benjamin A. Valentino
Soroush Vosoughi
EPJ Data Science, 12

← 1 2 3 4 5 →