A pilot evaluation of the diagnostic accuracy of ChatGPT-3.5 for multiple sclerosis from case reports

被引:0
|
作者
Joseph, Anika [1 ]
Joseph, Kevin [2 ]
Joseph, Angelyn [3 ]
机构
[1] Univ Ottawa, Hlth Sci Program, 75 Laurier Ave E, Ottawa, ON K1N 6N5, Canada
[2] Univ Ottawa, Biomed Sci Program, 75 Laurier Ave E, Ottawa, ON K1N 6N5, Canada
[3] Merivale High Sch, 1755 Merivale Rd, Nepean, ON K2G 1E2, Canada
关键词
artificial intelligence; multiple sclerosis; case reports; legal;
D O I
10.1515/tnsci-2022-0361
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
The limitation of artificial intelligence (AI) large language models to diagnose diseases from the perspective of patient safety remains underexplored and potential challenges, such as diagnostic errors and legal challenges, need to be addressed. To demonstrate the limitations of AI, we used ChatGPT-3.5 developed by OpenAI, as a tool for medical diagnosis using text-based case reports of multiple sclerosis (MS), which was selected as a prototypic disease. We analyzed 98 peer-reviewed case reports selected based on free-full text availability and published within the past decade (2014-2024), excluding any mention of an MS diagnosis to avoid bias. ChatGPT-3.5 was used to interpret clinical presentations and laboratory data from these reports. The model correctly diagnosed MS in 77 cases, achieving an accuracy rate of 78.6%. However, the remaining 21 cases were misdiagnosed, highlighting the model's limitations. Factors contributing to the errors include variability in data presentation and the inherent complexity of MS diagnosis, which requires imaging modalities in addition to clinical presentations and laboratory data. While these findings suggest that AI can support disease diagnosis and healthcare providers in decision-making, inadequate training with large datasets may lead to significant inaccuracies. Integrating AI into clinical practice necessitates rigorous validation and robust regulatory frameworks to ensure responsible use.
引用
收藏
页数:7
相关论文
共 50 条
  • [31] A pilot study of relaxin in multiple sclerosis: diagnostic and therapeutic implications
    Garvin, R.
    Burns, A.
    MULTIPLE SCLEROSIS JOURNAL, 2011, 17 : S523 - S523
  • [32] Evaluation of ChatGPT 3.5 Outputs for Pediatric Kidney Transplant Questions Regarding Accuracy, Relevance and Potential Harm
    Demirbas, Kaan Can
    Saygili, Seha
    Yilmaz, Esra Karabag
    Agbas, Ayse
    Canpolat, Nur
    PEDIATRIC NEPHROLOGY, 2024, 39 (01) : S395 - S396
  • [33] Fabry disease mimicking multiple sclerosis: Lessons from two case reports
    Shribnnan, S. E.
    Shah, A. R. F.
    Werring, D. J.
    Cockerell, O. C.
    MULTIPLE SCLEROSIS AND RELATED DISORDERS, 2015, 4 (02) : 170 - 175
  • [34] From Bytes to Best Practices: Tracing ChatGPT-3.5's Evolution and Alignment With the National Comprehensive Cancer Network® Guidelines in Pancreatic Adenocarcinoma Management
    Bresler, Tamir E.
    Pandya, Shivam
    Meyer, Ryan
    Htway, Zin
    Fujita, Manabu
    AMERICAN SURGEON, 2024, 90 (10) : 2543 - 2547
  • [35] Fingolimod in pediatric multiple sclerosis: Six case reports
    Ferilli, Michela Ada Noris
    Papetti, Laura
    Valeriani, Massimiliano
    JOURNAL OF THE NEUROLOGICAL SCIENCES, 2021, 429
  • [38] Multiple sclerosis in sarcoidosis patients: Two case reports
    Etemadifar, Masoud
    Mehri, Armin
    Sedaghat, Nahad
    Salari, Mehri
    Naini, Parsa Tavassoli
    CLINICAL CASE REPORTS, 2022, 10 (09):
  • [40] Fingolimod in pediatric multiple sclerosis: three case reports
    Ferilli, Michela Ada Noris
    Papetti, Laura
    Valeriani, Massimiliano
    NEUROLOGICAL SCIENCES, 2021, 42 (SUPPL 1) : 19 - 23