Can ChatGPT-3.5 Pass a Medical Exam? A Systematic Review of ChatGPT's Performance in Academic Testing

被引:16
|
作者
Sumbal, Anusha [1 ]
Sumbal, Ramish [1 ]
Amir, Alina [1 ]
机构
[1] Dow Univ Hlth Sci, Baba E Urdu Rd, Karachi 74200, Pakistan
关键词
ChatGPT; academic performance; medical education; artificial intelligence; digital health; medicine;
D O I
10.1177/23821205241238641
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
OBJECTIVE We, therefore, aim to conduct a systematic review to assess the academic potential of ChatGPT-3.5, along with its strengths and limitations when giving medical exams.METHOD Following PRISMA guidelines, a systemic search of the literature was performed using electronic databases PUBMED/MEDLINE, Google Scholar, and Cochrane. Articles from their inception till April 4, 2023, were queried. A formal narrative analysis was conducted by systematically arranging similarities and differences between individual findings together.RESULTS After rigorous screening, 12 articles underwent this review. All the selected papers assessed the academic performance of ChatGPT-3.5. One study compared the performance of ChatGPT-3.5 with the performance of ChatGPT-4 when giving a medical exam. Overall, ChatGPT performed well in 4 tests, averaged in 4 tests, and performed badly in 4 tests. ChatGPT's performance was directly proportional to the level of the questions' difficulty but was unremarkable on whether the questions were binary, descriptive, or MCQ-based. ChatGPT's explanation, reasoning, memory, and accuracy were remarkably good, whereas it failed to understand image-based questions, and lacked insight and critical thinking.CONCLUSION ChatGPT-3.5 performed satisfactorily in the exams it took as an examinee. However, there is a need for future related studies to fully explore the potential of ChatGPT in medical education.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Below average ChatGPT performance in medical microbiology exam compared to university students
    Sallam, Malik
    Al-Salahat, Khaled
    FRONTIERS IN EDUCATION, 2023, 8
  • [42] ChatGPT's performance on JS']JSA-certified anesthesiologist exam
    Kinoshita, Michiko
    Komasaka, Mizuki
    Tanaka, Katsuya
    JOURNAL OF ANESTHESIA, 2024, 38 (02) : 282 - 283
  • [43] Evaluating ChatGPT-3.5 and Claude-2 in Answering and Explaining Conceptual Medical Physiology Multiple-Choice Questions
    Agarwal, Mayank
    Goswami, Ayan
    Sharma, Priyanka
    CUREUS JOURNAL OF MEDICAL SCIENCE, 2023, 15 (09)
  • [44] Reply to 'Comment on: Benchmarking the performance of large language models in uveitis: a comparative analysis of ChatGPT-3.5, ChatGPT-4.0, Google Gemini, and Anthropic Claude3'
    Zhao, Fang-Fang
    He, Han-Jie
    Liang, Jia-Jian
    Cen, Ling-Ping
    EYE, 2025,
  • [45] ChatGPT in radiology: A systematic review of performance, pitfalls, and future perspectives
    Keshavarz, Pedram
    Bagherieh, Sara
    Nabipoorashra, Seyed Ali
    Chalian, Hamid
    Rahsepar, Amir Ali
    Kim, Grace Hyun J.
    Hassani, Cameron
    Raman, Steven S.
    Bedayat, Arash
    DIAGNOSTIC AND INTERVENTIONAL IMAGING, 2024, 105 (7-8) : 251 - 265
  • [46] Does small talk with a medical provider affect ChatGPT's medical counsel? Performance of ChatGPT on USMLE with and without distractions
    Safrai, Myriam
    Azaria, Amos
    PLOS ONE, 2024, 19 (04):
  • [47] More human than human? Differences in lexis and collocation within academic essays produced by ChatGPT-3.5 and human L2 writers
    Zhang, Mengxuan
    Crosthwaite, Peter
    IRAL-INTERNATIONAL REVIEW OF APPLIED LINGUISTICS IN LANGUAGE TEACHING, 2025,
  • [48] Accuracy and consistency of ChatGPT-3.5 and-4 in providing differential diagnoses in oral and maxillofacial diseases: a comparative diagnostic performance analysis
    Tomo, Saygo
    Lechien, Jerome R.
    Bueno, Hugo Sobrinho
    Cantieri-Debortoli, Daniela Filie
    Simonato, Luciana Estevam
    CLINICAL ORAL INVESTIGATIONS, 2024, 28 (10)
  • [49] The Performance of ChatGPT-4V in Interpreting Images and Tables in the Japanese Medical Licensing Exam
    Takagi, Soshi
    Koda, Masahide
    Watari, Takashi
    JMIR MEDICAL EDUCATION, 2024, 10
  • [50] Potential of ChatGPT to Pass the Japanese Medical and Healthcare Professional National Licenses: A Literature Review
    Ishida, Kai
    Hanada, Eisuke
    CUREUS JOURNAL OF MEDICAL SCIENCE, 2024, 16 (08)