Analysis of Responses of GPT-4 V to the Japanese National Clinical Engineer Licensing Examination

被引:1
|
作者
Ishida, Kai [1 ]
Arisaka, Naoya [2 ]
Fujii, Kiyotaka [3 ]
机构
[1] Shonan Inst Technol, Fac Engn, Dept Mat & Human Environm Sci, Fujisawa, Japan
[2] Kitasato Univ, Sch Allied Hlth Sci, Dept Med Informat, Sagamihara, Kanagawa, Japan
[3] Kitasato Univ, Sch Allied Hlth Sci, Dept Clin Engn, Sagamihara, Kanagawa, Japan
关键词
ChatGPT; Multimodal large language models; Artificial intelligence; Clinical engineer; Licensing examination; Medical education; CHATGPT;
D O I
10.1007/s10916-024-02103-w
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Chat Generative Pretrained Transformer (ChatGPT; OpenAI) is a state-of-the-art large language model that can simulate human-like conversations based on user input. We evaluated the performance of GPT-4 V in the Japanese National Clinical Engineer Licensing Examination using 2,155 questions from 2012 to 2023. The average correct answer rate for all questions was 86.0%. In particular, clinical medicine, basic medicine, medical materials, biological properties, and mechanical engineering achieved a correct response rate of >= 90%. Conversely, medical device safety management, electrical and electronic engineering, and extracorporeal circulation obtained low correct answer rates ranging from 64.8% to 76.5%. The correct answer rates for questions that included figures/tables, required numerical calculation, figure/table boolean AND calculation, and knowledge of Japanese Industrial Standards were 55.2%, 85.8%, 64.2% and 31.0%, respectively. The reason for the low correct answer rates is that ChatGPT lacked recognition of the images and knowledge of standards and laws. This study concludes that careful attention is required when using ChatGPT because several of its explanations lack the correct description.
引用
收藏
页数:9
相关论文
共 50 条
  • [21] ChatGPT, GPT-4, and Bard and official board examination: comment
    Hinpetch Daungsupawong
    Viroj Wiwanitkit
    Japanese Journal of Radiology, 2024, 42 : 212 - 213
  • [22] Applying GPT-4 to the plastic surgery inservice training examination
    Zhao, Jiuli
    Du, Hong
    JOURNAL OF PLASTIC RECONSTRUCTIVE AND AESTHETIC SURGERY, 2024, 91 : 225 - 226
  • [23] Applying GPT-4 to the Plastic Surgery Inservice Training Examination
    Gupta, Rohun
    Park, John B.
    Herzog, Isabel
    Yosufi, Nahid
    Mangan, Amelia
    Firouzbakht, Peter K.
    Mailey, Brian A.
    JOURNAL OF PLASTIC RECONSTRUCTIVE AND AESTHETIC SURGERY, 2023, 87 : 78 - 82
  • [24] Automated Financial Analysis Using GPT-4
    Noels, Sander
    Merlevede, Adriaan
    Fecheyr, Andrew
    Vanhalst, Maarten
    Meerlaen, Nick
    Viaene, Sebastien
    De Bie, Tijl
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES: APPLIED DATA SCIENCE AND DEMO TRACK, ECML PKDD 2023, PT VII, 2023, 14175 : 345 - 349
  • [25] Evaluating the image recognition capabilities of GPT-4V and Gemini Pro in the Japanese national dental examination
    Fukuda, Hikaru
    Morishita, Masaki
    Muraoka, Kosuke
    Yamaguchi, Shino
    Nakamura, Taiji
    Yoshioka, Izumi
    Awano, Shuji
    Ono, Kentaro
    JOURNAL OF DENTAL SCIENCES, 2025, 20 (01) : 368 - 372
  • [26] GPT-4 Turbo with Vision fails to outperform text-only GPT-4 Turbo in the Japan Diagnostic Radiology Board Examination
    Hirano, Yuichiro
    Hanaoka, Shouhei
    Nakao, Takahiro
    Miki, Soichiro
    Kikuchi, Tomohiro
    Nakamura, Yuta
    Nomura, Yukihiro
    Yoshikawa, Takeharu
    Abe, Osamu
    JAPANESE JOURNAL OF RADIOLOGY, 2024, 42 (08) : 918 - 926
  • [27] ChatGPT (GPT-4) passed the Japanese National License Examination for Pharmacists in 2022, answering all items including those with diagrams: a descriptive study
    Sato, Hiroyasu
    Ogasawara, Katsuhiko
    JOURNAL OF EDUCATIONAL EVALUATION FOR HEALTH PROFESSIONS, 2024, 21
  • [28] Performance of ChatGPT-3.5 and GPT-4 in national licensing examinations for medicine, pharmacy, dentistry, and nursing: a systematic review and meta-analysis
    Jin, Hye Kyung
    Lee, Ha Eun
    Kim, Eunyoung
    BMC MEDICAL EDUCATION, 2024, 24 (01)
  • [29] GPT-4 turbo with vision fails to outperform text-only GPT-4 turbo in the Japan diagnostic radiology board examination: correspondence
    Kleebayoon, Amnuay
    Wiwanitkit, Viroj
    JAPANESE JOURNAL OF RADIOLOGY, 2024, 42 (10) : 1213 - 1213
  • [30] Assessing Generative Pretrained Transformers (GPT) in Clinical Decision-Making: Comparative Analysis of GPT-3.5 and GPT-4
    Lahat, Adi
    Sharif, Kassem
    Zoabi, Narmin
    Patt, Yonatan Shneor
    Sharif, Yousra
    Fisher, Lior
    Shani, Uria
    Arow, Mohamad
    Levin, Roni
    Klang, Eyal
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2024, 26