Analysis of Responses of GPT-4 V to the Japanese National Clinical Engineer Licensing Examination

被引：1

作者：

Ishida, Kai ^{[1
]}

Arisaka, Naoya ^{[2
]}

Fujii, Kiyotaka ^{[3
]}

机构：

[1] Shonan Inst Technol, Fac Engn, Dept Mat & Human Environm Sci, Fujisawa, Japan

[2] Kitasato Univ, Sch Allied Hlth Sci, Dept Med Informat, Sagamihara, Kanagawa, Japan

[3] Kitasato Univ, Sch Allied Hlth Sci, Dept Clin Engn, Sagamihara, Kanagawa, Japan

来源：

JOURNAL OF MEDICAL SYSTEMS | 2024年 / 48卷 / 01期

关键词：

ChatGPT; Multimodal large language models; Artificial intelligence; Clinical engineer; Licensing examination; Medical education; CHATGPT;

D O I：

10.1007/s10916-024-02103-w

中图分类号：

R19 [保健组织与事业（卫生事业管理）];

学科分类号：

摘要：

Chat Generative Pretrained Transformer (ChatGPT; OpenAI) is a state-of-the-art large language model that can simulate human-like conversations based on user input. We evaluated the performance of GPT-4 V in the Japanese National Clinical Engineer Licensing Examination using 2,155 questions from 2012 to 2023. The average correct answer rate for all questions was 86.0%. In particular, clinical medicine, basic medicine, medical materials, biological properties, and mechanical engineering achieved a correct response rate of >= 90%. Conversely, medical device safety management, electrical and electronic engineering, and extracorporeal circulation obtained low correct answer rates ranging from 64.8% to 76.5%. The correct answer rates for questions that included figures/tables, required numerical calculation, figure/table boolean AND calculation, and knowledge of Japanese Industrial Standards were 55.2%, 85.8%, 64.2% and 31.0%, respectively. The reason for the low correct answer rates is that ChatGPT lacked recognition of the images and knowledge of standards and laws. This study concludes that careful attention is required when using ChatGPT because several of its explanations lack the correct description.

引用

页数：9

共 50 条

[21] ChatGPT, GPT-4, and Bard and official board examination: comment
Hinpetch Daungsupawong
Viroj Wiwanitkit
Japanese Journal of Radiology, 2024, 42 : 212 - 213
[22] Applying GPT-4 to the plastic surgery inservice training examination
Zhao, Jiuli
Du, Hong
JOURNAL OF PLASTIC RECONSTRUCTIVE AND AESTHETIC SURGERY, 2024, 91 : 225 - 226
[23] Applying GPT-4 to the Plastic Surgery Inservice Training Examination
Gupta, Rohun
Park, John B.
Herzog, Isabel
Yosufi, Nahid
Mangan, Amelia
Firouzbakht, Peter K.
Mailey, Brian A.
JOURNAL OF PLASTIC RECONSTRUCTIVE AND AESTHETIC SURGERY, 2023, 87 : 78 - 82
[24] Automated Financial Analysis Using GPT-4
Noels, Sander
Merlevede, Adriaan
Fecheyr, Andrew
Vanhalst, Maarten
Meerlaen, Nick
Viaene, Sebastien
De Bie, Tijl
MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: APPLIED DATA SCIENCE AND DEMO TRACK, ECML PKDD 2023, PT VII, 2023, 14175 : 345 - 349
[25] Evaluating the image recognition capabilities of GPT-4V and Gemini Pro in the Japanese national dental examination
Fukuda, Hikaru
Morishita, Masaki
Muraoka, Kosuke
Yamaguchi, Shino
Nakamura, Taiji
Yoshioka, Izumi
Awano, Shuji
Ono, Kentaro
JOURNAL OF DENTAL SCIENCES, 2025, 20 (01) : 368 - 372
[26] GPT-4 Turbo with Vision fails to outperform text-only GPT-4 Turbo in the Japan Diagnostic Radiology Board Examination
Hirano, Yuichiro
Hanaoka, Shouhei
Nakao, Takahiro
Miki, Soichiro
Kikuchi, Tomohiro
Nakamura, Yuta
Nomura, Yukihiro
Yoshikawa, Takeharu
Abe, Osamu
JAPANESE JOURNAL OF RADIOLOGY, 2024, 42 (08) : 918 - 926
[27] ChatGPT (GPT-4) passed the Japanese National License Examination for Pharmacists in 2022, answering all items including those with diagrams: a descriptive study
Sato, Hiroyasu
Ogasawara, Katsuhiko
JOURNAL OF EDUCATIONAL EVALUATION FOR HEALTH PROFESSIONS, 2024, 21
[28] Performance of ChatGPT-3.5 and GPT-4 in national licensing examinations for medicine, pharmacy, dentistry, and nursing: a systematic review and meta-analysis
Jin, Hye Kyung
Lee, Ha Eun
Kim, Eunyoung
BMC MEDICAL EDUCATION, 2024, 24 (01)
[29] GPT-4 turbo with vision fails to outperform text-only GPT-4 turbo in the Japan diagnostic radiology board examination: correspondence
Kleebayoon, Amnuay
Wiwanitkit, Viroj
JAPANESE JOURNAL OF RADIOLOGY, 2024, 42 (10) : 1213 - 1213
[30] Assessing Generative Pretrained Transformers (GPT) in Clinical Decision-Making: Comparative Analysis of GPT-3.5 and GPT-4
Lahat, Adi
Sharif, Kassem
Zoabi, Narmin
Patt, Yonatan Shneor
Sharif, Yousra
Fisher, Lior
Shani, Uria
Arow, Mohamad
Levin, Roni
Klang, Eyal
JOURNAL OF MEDICAL INTERNET RESEARCH, 2024, 26

← 1 2 3 4 5 →