Assessing Ability for ChatGPT to Answer Total Knee Arthroplasty-Related Questions

被引:7
|
作者
Magruder, Matthew L. [1 ]
Rodriguez, Ariel N. [1 ]
Wong, Jason C. J. [1 ]
Erez, Orry [1 ]
Piuzzi, Nicolas S. [2 ]
Scuderi, Gil R. [3 ]
Slover, James D. [3 ]
Oh, Jason H. [3 ]
Schwarzkopf, Ran [4 ]
Chen, Antonia F. [5 ]
Iorio, Richard [5 ]
Goodman, Stuart B. [6 ]
Mont, Michael A. [7 ]
机构
[1] Maimonides Hosp, Dept Orthopaed Surg, 927 49th St, Brooklyn, NY 11219 USA
[2] Cleveland Clin, Dept Orthopaed Surg, Cleveland, OH USA
[3] Lenox Hill Hosp, Northwell Orthopaed Inst, Dept Orthopaed Surg, New York, NY USA
[4] NYU Langone Hlth, Dept Orthopaed Surg, NYU Langone Orthoped, New York, NY USA
[5] Brigham & Womens Hosp, Dept Orthopaed Surg, Boston, MA USA
[6] Stanford Univ, Sch Med, Dept Orthopaed Surg, Redwood City, CA USA
[7] Sinai Hosp Baltimore, Rubin Inst Adv Orthoped, Baltimore, MD USA
来源
JOURNAL OF ARTHROPLASTY | 2024年 / 39卷 / 08期
关键词
ChatGPT; artificial intelligence; large language model; total knee arthroplasty; clinical practice guidelines; ARTIFICIAL-INTELLIGENCE; PERFORMANCE; CALL;
D O I
10.1016/j.arth.2024.02.023
中图分类号
R826.8 [整形外科学]; R782.2 [口腔颌面部整形外科学]; R726.2 [小儿整形外科学]; R62 [整形外科学(修复外科学)];
学科分类号
摘要
Background: Artificial intelligence in the field of orthopaedics has been a topic of increasing interest and opportunity in recent years. Its applications are widespread both for physicians and patients, including use in clinical decision-making, in the operating room, and in research. In this study, we aimed to assess the quality of ChatGPT answers when asked questions related to total knee arthroplasty. Methods: ChatGPT prompts were created by turning 15 of the American Academy of Orthopaedic Surgeons Clinical Practice Guidelines into questions. An online survey was created, which included screenshots of each prompt and answers to the 15 questions. Surgeons were asked to grade ChatGPT answers from 1 to 5 based on their characteristics: (1) relevance, (2) accuracy, (3) clarity, (4) completeness, (5) evidence-based, and (6) consistency. There were 11 Adult Joint Reconstruction fellowship-trained surgeons who completed the survey. Questions were subclassified based on the subject of the prompt: (1) risk factors, (2) implant/intraoperative, and (3) pain/functional outcomes. The average and standard deviation for all answers, as well as for each subgroup, were calculated. Inter-rater reliability (IRR) was also calculated. Results: All answer characteristics were graded as being above average (ie, a score > 3). Relevance demonstrated the highest scores (4.43 +/- 0.77) by surgeons surveyed, and consistency demonstrated the lowest scores (3.54 +/- 1.10). ChatGPT prompts in the Risk Factors group demonstrated the best responses, while those in the Pain/Functional Outcome group demonstrated the lowest. The overall IRR was found to be 0.33 (poor reliability), with the highest IRR for relevance (0.43) and the lowest for evidence-based (0.28). Conclusions: ChatGPT can answer questions regarding well-established clinical guidelines in total knee arthroplasty with above-average accuracy but demonstrates variable reliability. This investigation is the first step in understanding large language model artificial intelligence like ChatGPT and how well they perform in the field of arthroplasty. (c) 2024 Elsevier Inc. All rights reserved.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Can ChatGPT Answer Patient Questions Regarding Total Knee Arthroplasty?
    Mika, Aleksander P.
    Mulvey, Hillary E.
    Engstrom, Stephen M.
    Polkowski, Gregory G.
    Martin, J. Ryan
    Wilson, Jacob M.
    JOURNAL OF KNEE SURGERY, 2024, 37 (09) : 664 - 673
  • [2] Language-adaptive artificial intelligence: assessing CHATGPT'S answer to frequently asked questions on total hip arthroplasty questions
    Ibrahim, Muhammad Talal
    Khaskheli, Sarah Ashraf
    Shahzad, Hania
    Noordin, Shahryar
    JOURNAL OF THE PAKISTAN MEDICAL ASSOCIATION, 2024, 74 (04) : S161 - S164
  • [3] ChatGPT's ability to comprehend and answer cirrhosis related questions in Arabic
    Samaan, Jamil S.
    Yeo, Yee Hui
    Ng, Wee Han
    Ting, Peng-Sheng
    Trivedi, Hirsh
    Vipani, Aarshi
    Yang, Ju Dong
    Liran, Omer
    Spiegel, Brennan
    Kuo, Alexander
    Ayoub, Walid S.
    ARAB JOURNAL OF GASTROENTEROLOGY, 2023, 24 (03) : 145 - 148
  • [4] ChatGPT's ability to comprehend and answer cirrhosis related questions: Comment
    Daungsupawong, Hinpetch
    Wiwanitkit, Viroj
    ARAB JOURNAL OF GASTROENTEROLOGY, 2024, 25 (01) : 74 - 74
  • [5] Assessing ChatGPT Ability to Answer Frequently Asked Questions About Essential Tremor
    Sorrentino, Cristiano
    Canoro, Vincenzo
    Russo, Maria
    Giordano, Caterina
    Barone, Paolo
    Erro, Roberto
    TREMOR AND OTHER HYPERKINETIC MOVEMENTS, 2024, 14 : 1 - 10
  • [6] Comment on: Assessing ChatGPT's ability to answer questions pertaining to erectile dysfunction
    Hershenhouse, Jacob S.
    Cacciamani, Giovanni E.
    INTERNATIONAL JOURNAL OF IMPOTENCE RESEARCH, 2024, 36 (07) : 796 - 797
  • [7] Evaluating ChatGPT ability to answer urinary tract Infection-Related questions
    Cakir, Hakan
    Caglar, Ufuk
    Sekkeli, Sami
    Zerdali, Esra
    Sarilar, Omer
    Yildiz, Oguzhan
    Ozgor, Faruk
    INFECTIOUS DISEASES NOW, 2024, 54 (04):
  • [8] Response to "ChatGPT's ability to comprehend and answer cirrhosis related questions: Comment"
    Samaan, Jamil S.
    Yeo, Yee Hui
    Ayoub, Walid S.
    ARAB JOURNAL OF GASTROENTEROLOGY, 2024, 25 (02) : 237 - 238
  • [9] Assessing ChatGPT responses to common patient questions regarding total ankle arthroplasty
    Artioli, Elena
    Veronesi, Francesca
    Mazzotti, Antonio
    Brogini, Silvia
    Zielli, Simone Ottavio
    Giavaresi, Gianluca
    Faldini, Cesare
    JOURNAL OF EXPERIMENTAL ORTHOPAEDICS, 2025, 12 (01)
  • [10] Assessing ChatGPT Responses to Common Patient Questions Regarding Total Hip Arthroplasty
    Mika, Aleksander P.
    Martin, J. Ryan
    Engstrom, Stephen M.
    Polkowski, Gregory G.
    Wilson, Jacob M.
    JOURNAL OF BONE AND JOINT SURGERY-AMERICAN VOLUME, 2023, 105 (19): : 1519 - 1526