Influence of prior probability information on large language model performance in radiological diagnosis

被引:0
|
作者
Fukushima, Takahiro [1 ]
Kurokawa, Ryo [1 ]
Hagiwara, Akifumi [1 ]
Sonoda, Yuki [1 ]
Asari, Yusuke [1 ]
Kurokawa, Mariko [1 ]
Kanzawa, Jun [1 ]
Gonoi, Wataru [1 ]
Abe, Osamu [1 ]
机构
[1] Univ Tokyo, Grad Sch Med, Dept Radiol, 7-3-1 Hongo,Bunkyo Ku, Tokyo 1138655, Japan
关键词
Large language model; Artificial intelligence; Claude; 3.5; Sonnet; Bayes' theorem;
D O I
10.1007/s11604-025-01743-3
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
PurposeLarge language models (LLMs) show promise in radiological diagnosis, but their performance may be affected by the context of the cases presented. Our purpose is to investigate how providing information about prior probabilities influences the diagnostic performance of an LLM in radiological quiz cases.Materials and methodsWe analyzed 322 consecutive cases from Radiology's "Diagnosis Please" quiz using Claude 3.5 Sonnet under three conditions: without context (Condition 1), informed as quiz cases (Condition 2), and presented as primary care cases (Condition 3). Diagnostic accuracy was compared using McNemar's test.ResultsThe overall accuracy rate significantly improved in Condition 2 compared to Condition 1 (70.2% vs. 64.9%, p = 0.029). Conversely, the accuracy rate significantly decreased in Condition 3 compared to Condition 1 (59.9% vs. 64.9%, p = 0.027).ConclusionsProviding information that may influence prior probabilities significantly affects the diagnostic performance of the LLM in radiological cases. This suggests that LLMs may incorporate Bayesian-like principles and adjust the weighting of their diagnostic responses based on prior information, highlighting the potential for optimizing LLM's performance in clinical settings by providing relevant contextual information.
引用
收藏
页数:6
相关论文
共 50 条
  • [21] VetLLM: Large Language Model for Predicting Diagnosis from Veterinary Notes
    Jiang, Yixing
    Irvin, Jeremy A.
    Ng, Andrew Y.
    Zou, James
    BIOCOMPUTING 2024, PSB 2024, 2024, : 120 - 133
  • [22] Large Language Model Performance on Practice Epilepsy Board Examinations
    Habib, Sara
    Butt, Haroon
    Goldenholz, Shira R.
    Chang, Chi Yuan
    Goldenholz, Daniel M.
    JAMA NEUROLOGY, 2024, 81 (06) : 660 - 661
  • [23] Performance of a Large Language Model on the Anesthesiology Continuing Education Exam
    Gupta, Vardaan
    Gu, Yang
    Lustik, Stewart J.
    Park, Won
    Yin, Shichen
    Rubinger, Daniel
    Chang, Francis M.
    Panda, Kunal
    Besharat, Soroush
    Sadhra, Hamza
    Glance, Laurent G.
    ANESTHESIOLOGY, 2024, 141 (06) : 1196 - 1199
  • [24] Influence of External Information on Large Language Models Mirrors Social Cognitive Patterns
    Bian, Ning
    Lin, Hongyu
    Liu, Peilin
    Lu, Yaojie
    Zhang, Chunkang
    He, Ben
    Han, Xianpei
    Sun, Le
    IEEE TRANSACTIONS ON COMPUTATIONAL SOCIAL SYSTEMS, 2024,
  • [25] Performance of an Open-Source Large Language Model in Extracting Information from Free-Text Radiology Reports
    Le Guellec, Bastien
    Lefevre, Alexandre
    Geay, Charlotte
    Shorten, Lucas
    Bruge, Cyril
    Hacein-Bey, Lotfi
    Amouyel, Philippe
    Pruvo, Jean-Pierre
    Kuchcinski, Gregory
    Hamroun, Aghiles
    RADIOLOGY-ARTIFICIAL INTELLIGENCE, 2024, 6 (04)
  • [26] Role of visual information in multimodal large language model performance: an evaluation using the Japanese nuclear medicine board examination
    Watanabe, Takashi
    Baba, Akira
    Fukuda, Takeshi
    Watanabe, Ken
    Woo, Jun
    Ojiri, Hiroya
    ANNALS OF NUCLEAR MEDICINE, 2025, 39 (02) : 217 - 224
  • [27] Introducing Probability for Model-Based Cognitive Diagnosis of Students' Test Performance
    Xu, Junjie
    Chen, Rong
    ADVANCES IN COMPUTER SCIENCE AND EDUCATION APPLICATIONS, PT II, 2011, 202 : 321 - 327
  • [28] Large Language Model Influence on Diagnostic Reasoning A Randomized Clinical Trial
    Goh, Ethan
    Gallo, Robert
    Hom, Jason
    Strong, Eric
    Weng, Yingjie
    Kerman, Hannah
    Cool, Josephine A.
    Kanjee, Zahir
    Parsons, Andrew S.
    Ahuja, Neera
    Horvitz, Eric
    Yang, Daniel
    Milstein, Arnold
    Olson, Andrew P. J.
    Rodman, Adam
    Chen, Jonathan H.
    JAMA NETWORK OPEN, 2024, 7 (10)
  • [29] Continuous Mandarin speech recognition for Chinese language with large vocabulary based on segmental probability model
    Shen, JL
    IEE PROCEEDINGS-VISION IMAGE AND SIGNAL PROCESSING, 1998, 145 (05): : 309 - 315
  • [30] THE INFLUENCE OF THE MANAGEMENT OF INFORMATION SOURCES ON THE INNOVATION PERFORMANCE OF LARGE AND SMALL BUSINESSES
    Gomes, Clandia Maffini
    Kruglianskas, Isak
    Scherer, Flavia Luciane
    INTERNATIONAL JOURNAL OF INNOVATION MANAGEMENT, 2012, 16 (02)