Mind the Biases: Quantifying Cognitive Biases in Language Model Prompting

被引:0
|
作者
Lin, Ruixi [1 ]
Ng, Hwee Tou [1 ]
机构
[1] Natl Univ Singapore, Dept Comp Sci, Singapore, Singapore
关键词
AVAILABILITY;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We advocate the importance of exposing uncertainty on results of language model prompting which display bias modes resembling cognitive biases, and propose to help users grasp the level of uncertainty via simple quantifying metrics. Cognitive biases in the human decision making process can lead to flawed responses when we face uncertainty. Not surprisingly, we have seen biases in language models resembling cognitive biases as a result of training on biased text, raising dangers in downstream tasks that are centered around people's lives if users trust their results too much. In this work, we reveal two bias modes leveraging cognitive biases when we prompt BERT, accompanied by two bias metrics. On a drug-drug interaction extraction task, our bias measurements reveal an error pattern similar to the availability bias when the labels for training prompts are imbalanced, and show that a toning-down transformation of the drug-drug description in a prompt can elicit a bias similar to the framing effect, warning users to distrust when prompting language models for answers.(1)
引用
收藏
页码:5269 / 5281
页数:13
相关论文
共 50 条
  • [1] The cognitive biases of cognitive biases
    Douros, George
    EMERGENCY MEDICINE AUSTRALASIA, 2021, 33 (02) : 372 - 374
  • [2] Quantifying cognitive biases in analyst earnings forecasts
    Friesen, Geoffrey
    Weller, Paul A.
    JOURNAL OF FINANCIAL MARKETS, 2006, 9 (04) : 333 - 365
  • [3] Biases in Predicting the Human Language Model
    Fine, Alex B.
    Frank, Austin F.
    Jaeger, T. Florian
    Van Durme, Benjamin
    PROCEEDINGS OF THE 52ND ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 2, 2014, : 7 - 12
  • [4] Benchmarking Cognitive Biases in Large Language Models as Evaluators
    Koo, Ryan
    Lee, Minhwa
    Raheja, Vipul
    Park, Jongin
    Kim, Zae Myung
    Kang, Dongyeop
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: ACL 2024, 2024, : 517 - 545
  • [5] (Ir)rationality and cognitive biases in large language models
    Macmillan-Scott, Olivia
    Musolesi, Mirco
    ROYAL SOCIETY OPEN SCIENCE, 2024, 11 (06):
  • [6] Evaluation and mitigation of cognitive biases in medical language models
    Schmidgall, Samuel
    Harris, Carl
    Essien, Ime
    Olshvang, Daniel
    Rahman, Tawsifur
    Kim, Ji Woong
    Ziaei, Rojin
    Eshraghian, Jason
    Abadir, Peter
    Chellappa, Rama
    NPJ DIGITAL MEDICINE, 2024, 7 (01):
  • [7] The Computations of hostile biases (CHB) model: Grounding hostility biases in a unified cognitive framework
    Smeijers, Danique
    Bulten, Erik B. H.
    Brazil, Inti A.
    CLINICAL PSYCHOLOGY REVIEW, 2019, 73
  • [8] Cognitive Biases in Search A Review and Reflection of Cognitive Biases in Information Retrieval
    Azzopardi, Leif
    CHIIR '21: PROCEEDINGS OF THE 2021 CONFERENCE ON HUMAN INFORMATION INTERACTION AND RETRIEVAL, 2021, : 27 - 37
  • [9] Burning biases: Mitigating cognitive biases in fire engineering
    Kinsey, Michael J.
    Kinateder, Max
    Gwynne, Steven M., V
    Hopkin, Danny
    FIRE AND MATERIALS, 2021, 45 (04) : 543 - 552
  • [10] Cognitive Biases in Crowdsourcing
    Eickhoff, Carsten
    WSDM'18: PROCEEDINGS OF THE ELEVENTH ACM INTERNATIONAL CONFERENCE ON WEB SEARCH AND DATA MINING, 2018, : 162 - 170