Feasibility of GPT-3 and GPT-4 for in-Depth Patient Education Prior to Interventional Radiological Procedures: A Comparative Analysis

被引:17
|
作者
Scheschenja, Michael [1 ]
Viniol, Simon [1 ]
Bastian, Moritz B. [1 ]
Wessendorf, Joel [1 ]
Koenig, Alexander M. [1 ]
Mahnken, Andreas H. [1 ]
机构
[1] Philipps Univ Marburg, Univ Hosp Marburg, Dept Diagnost & Intervent Radiol, Baldingerstr 1, D-35043 Marburg, Germany
关键词
Artificial intelligence; Patient education; Interventional radiology; Chat-GPT; Large language models;
D O I
10.1007/s00270-023-03563-2
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
PurposeThis study explores the utility of the large language models, GPT-3 and GPT-4, for in-depth patient education prior to interventional radiology procedures. Further, differences in answer accuracy between the models were assessed.Materials and methodsA total of 133 questions related to three specific interventional radiology procedures (Port implantation, PTA and TACE) covering general information as well as preparation details, risks and complications and post procedural aftercare were compiled. Responses of GPT-3 and GPT-4 were assessed for their accuracy by two board-certified radiologists using a 5-point Likert scale. The performance difference between GPT-3 and GPT-4 was analyzed.ResultsBoth GPT-3 and GPT-4 responded with (5) "completely correct" (4) "very good" answers for the majority of questions ((5) 30.8% + (4) 48.1% for GPT-3 and (5) 35.3% + (4) 47.4% for GPT-4). GPT-3 and GPT-4 provided (3) "acceptable" responses 15.8% and 15.0% of the time, respectively. GPT-3 provided (2) "mostly incorrect" responses in 5.3% of instances, while GPT-4 had a lower rate of such occurrences, at just 2.3%. No response was identified as potentially harmful. GPT-4 was found to give significantly more accurate responses than GPT-3 (p = 0.043).ConclusionGPT-3 and GPT-4 emerge as relatively safe and accurate tools for patient education in interventional radiology. GPT-4 showed a slightly better performance. The feasibility and accuracy of these models suggest their promising role in revolutionizing patient care. Still, users need to be aware of possible limitations.
引用
收藏
页码:245 / 250
页数:6
相关论文
共 43 条
  • [1] Feasibility of GPT-3 and GPT-4 for in-Depth Patient Education Prior to Interventional Radiological Procedures: A Comparative Analysis
    Michael Scheschenja
    Simon Viniol
    Moritz B. Bastian
    Joel Wessendorf
    Alexander M. König
    Andreas H. Mahnken
    CardioVascular and Interventional Radiology, 2024, 47 : 245 - 250
  • [2] Investigating the Perception of the Future in GPT-3,-3.5 and GPT-4
    Kozachek, Diana
    2023 PROCEEDINGS OF THE 15TH CONFERENCE ON CREATIVITY AND COGNITION, C&C 2023, 2023, : 282 - 287
  • [3] From GPT-3 to GPT-4: On the Evolving Efficacy of LLMs to Answer Multiple-Choice Questions for Programming Classes in Higher Education
    Savelka, Jaromir
    Agarwal, Arav
    Bogart, Christopher
    Sakr, Majd
    COMPUTER SUPPORTED EDUCATION, CSEDU 2023, 2024, 2052 : 160 - 182
  • [4] Assessing GPT-4 multimodal performance in radiological image analysis
    Brin, Dana
    Sorin, Vera
    Barash, Yiftach
    Konen, Eli
    Glicksberg, Benjamin S.
    Nadkarni, Girish N.
    Klang, Eyal
    EUROPEAN RADIOLOGY, 2025, 35 (04) : 1959 - 1965
  • [5] Cognitive Network Science Reveals Bias in GPT-3, GPT-3.5 Turbo, and GPT-4 Mirroring Math Anxiety in High-School Students
    Abramski, Katherine
    Citraro, Salvatore
    Lombardi, Luigi
    Rossetti, Giulio
    Stella, Massimo
    BIG DATA AND COGNITIVE COMPUTING, 2023, 7 (03)
  • [6] Large language models and bariatric surgery patient education: a comparative readability analysis of GPT-3.5, GPT-4, Bard, and online institutional resources
    Srinivasan, Nitin
    Samaan, Jamil S.
    Rajeev, Nithya D.
    Kanu, Mmerobasi U.
    Yeo, Yee Hui
    Samakar, Kamran
    SURGICAL ENDOSCOPY AND OTHER INTERVENTIONAL TECHNIQUES, 2024, 38 (05): : 2522 - 2532
  • [7] Large language models and bariatric surgery patient education: a comparative readability analysis of GPT-3.5, GPT-4, Bard, and online institutional resources
    Nitin Srinivasan
    Jamil S. Samaan
    Nithya D. Rajeev
    Mmerobasi U. Kanu
    Yee Hui Yeo
    Kamran Samakar
    Surgical Endoscopy, 2024, 38 : 2522 - 2532
  • [8] Assessing Generative Pretrained Transformers (GPT) in Clinical Decision-Making: Comparative Analysis of GPT-3.5 and GPT-4
    Lahat, Adi
    Sharif, Kassem
    Zoabi, Narmin
    Patt, Yonatan Shneor
    Sharif, Yousra
    Fisher, Lior
    Shani, Uria
    Arow, Mohamad
    Levin, Roni
    Klang, Eyal
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2024, 26
  • [9] INTERVENTIONAL NEPHROLOGY ASSESSMENT QUESTIONS: A PERFORMANCE EVALUATION AND COMPARATIVE ANALYSIS OF CHATGPT-3.5 AND GPT-4
    Sheikh, Mohammad
    Qureshi, Fawad
    Thongprayoon, Charat
    Suarez, Lourdes Gonzalez
    Craici, Lasmina
    Cheungpasitporn, Visit
    AMERICAN JOURNAL OF KIDNEY DISEASES, 2024, 83 (04) : S100 - S101
  • [10] Comparative analysis of GPT-4, Gemini, and Ernie as gloss sign language translators in special education
    Achraf Othman
    Khansa Chemnad
    Ahmed Tlili
    Ting Da
    Huanhuan Wang
    Ronghuai Huang
    Discover Global Society, 2 (1):