Can ChatGPT-3.5 Pass a Medical Exam? A Systematic Review of ChatGPT's Performance in Academic Testing

被引：16

作者：

Sumbal, Anusha ^{[1
]}

Sumbal, Ramish ^{[1
]}

Amir, Alina ^{[1
]}

机构：

[1] Dow Univ Hlth Sci, Baba E Urdu Rd, Karachi 74200, Pakistan

来源：

JOURNAL OF MEDICAL EDUCATION AND CURRICULAR DEVELOPMENT | 2024年 / 11卷

关键词：

ChatGPT; academic performance; medical education; artificial intelligence; digital health; medicine;

D O I：

10.1177/23821205241238641

中图分类号：

G40 [教育学];

学科分类号：

040101 ; 120403 ;

摘要：

OBJECTIVE We, therefore, aim to conduct a systematic review to assess the academic potential of ChatGPT-3.5, along with its strengths and limitations when giving medical exams.METHOD Following PRISMA guidelines, a systemic search of the literature was performed using electronic databases PUBMED/MEDLINE, Google Scholar, and Cochrane. Articles from their inception till April 4, 2023, were queried. A formal narrative analysis was conducted by systematically arranging similarities and differences between individual findings together.RESULTS After rigorous screening, 12 articles underwent this review. All the selected papers assessed the academic performance of ChatGPT-3.5. One study compared the performance of ChatGPT-3.5 with the performance of ChatGPT-4 when giving a medical exam. Overall, ChatGPT performed well in 4 tests, averaged in 4 tests, and performed badly in 4 tests. ChatGPT's performance was directly proportional to the level of the questions' difficulty but was unremarkable on whether the questions were binary, descriptive, or MCQ-based. ChatGPT's explanation, reasoning, memory, and accuracy were remarkably good, whereas it failed to understand image-based questions, and lacked insight and critical thinking.CONCLUSION ChatGPT-3.5 performed satisfactorily in the exams it took as an examinee. However, there is a need for future related studies to fully explore the potential of ChatGPT in medical education.

引用

页数：12

共 50 条

[41] Below average ChatGPT performance in medical microbiology exam compared to university students
Sallam, Malik
Al-Salahat, Khaled
FRONTIERS IN EDUCATION, 2023, 8
[42] ChatGPT's performance on JS']JSA-certified anesthesiologist exam
Kinoshita, Michiko
Komasaka, Mizuki
Tanaka, Katsuya
JOURNAL OF ANESTHESIA, 2024, 38 (02) : 282 - 283
[43] Evaluating ChatGPT-3.5 and Claude-2 in Answering and Explaining Conceptual Medical Physiology Multiple-Choice Questions
Agarwal, Mayank
Goswami, Ayan
Sharma, Priyanka
CUREUS JOURNAL OF MEDICAL SCIENCE, 2023, 15 (09)
[44] Reply to 'Comment on: Benchmarking the performance of large language models in uveitis: a comparative analysis of ChatGPT-3.5, ChatGPT-4.0, Google Gemini, and Anthropic Claude3'
Zhao, Fang-Fang
He, Han-Jie
Liang, Jia-Jian
Cen, Ling-Ping
EYE, 2025,
[45] ChatGPT in radiology: A systematic review of performance, pitfalls, and future perspectives
Keshavarz, Pedram
Bagherieh, Sara
Nabipoorashra, Seyed Ali
Chalian, Hamid
Rahsepar, Amir Ali
Kim, Grace Hyun J.
Hassani, Cameron
Raman, Steven S.
Bedayat, Arash
DIAGNOSTIC AND INTERVENTIONAL IMAGING, 2024, 105 (7-8) : 251 - 265
[46] Does small talk with a medical provider affect ChatGPT's medical counsel? Performance of ChatGPT on USMLE with and without distractions
Safrai, Myriam
Azaria, Amos
PLOS ONE, 2024, 19 (04):
[47] More human than human? Differences in lexis and collocation within academic essays produced by ChatGPT-3.5 and human L2 writers
Zhang, Mengxuan
Crosthwaite, Peter
IRAL-INTERNATIONAL REVIEW OF APPLIED LINGUISTICS IN LANGUAGE TEACHING, 2025,
[48] Accuracy and consistency of ChatGPT-3.5 and-4 in providing differential diagnoses in oral and maxillofacial diseases: a comparative diagnostic performance analysis
Tomo, Saygo
Lechien, Jerome R.
Bueno, Hugo Sobrinho
Cantieri-Debortoli, Daniela Filie
Simonato, Luciana Estevam
CLINICAL ORAL INVESTIGATIONS, 2024, 28 (10)
[49] The Performance of ChatGPT-4V in Interpreting Images and Tables in the Japanese Medical Licensing Exam
Takagi, Soshi
Koda, Masahide
Watari, Takashi
JMIR MEDICAL EDUCATION, 2024, 10
[50] Potential of ChatGPT to Pass the Japanese Medical and Healthcare Professional National Licenses: A Literature Review
Ishida, Kai
Hanada, Eisuke
CUREUS JOURNAL OF MEDICAL SCIENCE, 2024, 16 (08)

← 1 2 3 4 5 →