Influence of prior probability information on large language model performance in radiological diagnosis

被引:0
|
作者
Fukushima, Takahiro [1 ]
Kurokawa, Ryo [1 ]
Hagiwara, Akifumi [1 ]
Sonoda, Yuki [1 ]
Asari, Yusuke [1 ]
Kurokawa, Mariko [1 ]
Kanzawa, Jun [1 ]
Gonoi, Wataru [1 ]
Abe, Osamu [1 ]
机构
[1] Univ Tokyo, Grad Sch Med, Dept Radiol, 7-3-1 Hongo,Bunkyo Ku, Tokyo 1138655, Japan
关键词
Large language model; Artificial intelligence; Claude; 3.5; Sonnet; Bayes' theorem;
D O I
10.1007/s11604-025-01743-3
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
PurposeLarge language models (LLMs) show promise in radiological diagnosis, but their performance may be affected by the context of the cases presented. Our purpose is to investigate how providing information about prior probabilities influences the diagnostic performance of an LLM in radiological quiz cases.Materials and methodsWe analyzed 322 consecutive cases from Radiology's "Diagnosis Please" quiz using Claude 3.5 Sonnet under three conditions: without context (Condition 1), informed as quiz cases (Condition 2), and presented as primary care cases (Condition 3). Diagnostic accuracy was compared using McNemar's test.ResultsThe overall accuracy rate significantly improved in Condition 2 compared to Condition 1 (70.2% vs. 64.9%, p = 0.029). Conversely, the accuracy rate significantly decreased in Condition 3 compared to Condition 1 (59.9% vs. 64.9%, p = 0.027).ConclusionsProviding information that may influence prior probabilities significantly affects the diagnostic performance of the LLM in radiological cases. This suggests that LLMs may incorporate Bayesian-like principles and adjust the weighting of their diagnostic responses based on prior information, highlighting the potential for optimizing LLM's performance in clinical settings by providing relevant contextual information.
引用
收藏
页数:6
相关论文
共 50 条
  • [41] Feasibility of Differential Diagnosis Based on Imaging Patterns Using a Large Language Model
    Kottlors, Jonathan
    Bratke, Grischa
    Rauen, Philip
    Kabbasch, Christoph
    Persigehl, Thorsten
    Schlamann, Marc
    Lennartz, Simon
    RADIOLOGY, 2023, 308 (01)
  • [42] Large language model uncertainty proxies: discrimination and calibration for medical diagnosis and treatment
    Savage, Thomas
    Wang, John
    Gallo, Robert
    Boukil, Abdessalem
    Patel, Vishwesh
    Safavi-Naini, Seyed Amir Ahmad
    Soroush, Ali
    Chen, Jonathan H.
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2024, 32 (01) : 139 - 149
  • [43] FD-LLM: Large language model for fault diagnosis of complex equipment
    Lin, Lin
    Zhang, Sihao
    Fu, Song
    Liu, Yikun
    ADVANCED ENGINEERING INFORMATICS, 2025, 65
  • [44] Construction and preliminary application of large language model for reservoir performance analysis
    Pan, Huanquan
    Liu, Jianqiao
    Gong, Bin
    Zhu, Yiheng
    Bai, Junhui
    Huang, Hu
    Fang, Zhengbao
    Jing, Hongbin
    Liu, Chen
    Kuang, Tie
    Lan, Yubo
    Wang, Tianzhi
    Xie, Tian
    Cheng, Mingzhe
    Qin, Bin
    Shen, Yujiang
    PETROLEUM EXPLORATION AND DEVELOPMENT, 2024, 51 (05) : 1357 - 1366
  • [45] Construction and preliminary application of large language model for reservoir performance analysis
    PAN Huanquan
    LIU Jianqiao
    GONG Bin
    ZHU Yiheng
    BAI Junhui
    HUANG Hu
    FANG Zhengbao
    JING Hongbin
    LIU Chen
    KUANG Tie
    LAN Yubo
    WANG Tianzhi
    XIE Tian
    CHENG Mingzhe
    QIN Bin
    SHEN Yujiang
    Petroleum Exploration and Development, 2024, 51 (05) : 1357 - 1366
  • [46] Performance of a Large Language Model on Practice Questions for the Neonatal Board Examination
    Beam, Kristyn
    Sharma, Puneet
    Kumar, Bhawesh
    Wang, Cindy
    Brodsky, Dara
    Martin, Camilia R.
    Beam, Andrew
    JAMA PEDIATRICS, 2023, 177 (09) : 977 - 979
  • [47] Response Performance Evaluations of ChatGPT Models on Large Language Model Frameworks
    Kaplan, Alper
    Sayan, Ismail Utku
    Saban, Huseyin
    Begen, Emre
    Bayrak, Ahmet Tugrul
    32ND IEEE SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE, SIU 2024, 2024,
  • [48] Performance of the ChatGPT large language model for decision support in community pharmacy
    Shin, Euibeom
    Hartman, Maggie
    Ramanathan, Murali
    BRITISH JOURNAL OF CLINICAL PHARMACOLOGY, 2024, 90 (12) : 3320 - 3333
  • [49] A Mumford-Shah Functional based Variational Model with Contour, Shape, and Probability Prior information for Prostate Segmentation
    Ghose, S.
    Mitra, J.
    Oliver, A.
    Marti, R.
    Llado, X.
    Freixenet, J.
    Vilanova, J. C.
    Comet, J.
    Sidibe, D.
    Meriaudeau, F.
    2012 21ST INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR 2012), 2012, : 121 - 124
  • [50] MAPO: Boosting Large Language Model Performance with Model-Adaptive Prompt Optimization
    Chen, Yuyan
    Wen, Zhihao
    Fan, Ge
    Chen, Zhengyu
    Wu, Wei
    Liu, Dayiheng
    Li, Zhixu
    Liu, Bang
    Xiao, Yanghua
    FINDINGS OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS - EMNLP 2023, 2023, : 3279 - 3304