Artificial intelligence in global health equity: an evaluation and discussion on the application of ChatGPT, in the Chinese National Medical Licensing Examination

被引:6
|
作者
Tong, Wenting [1 ]
Guan, Yongfu [2 ]
Chen, Jinping [2 ]
Huang, Xixuan [3 ]
Zhong, Yuting [4 ]
Zhang, Changrong [5 ]
Zhang, Hui [2 ,6 ]
机构
[1] Gannan Healthcare Vocat Coll, Dept Pharm, Ganzhou, Peoples R China
[2] Gannan Hlth Vocat Coll, Dept Rehabil & Elderly Care, Ganzhou, Jiangxi, Peoples R China
[3] Xiamen Univ, Dept Math, Xiamen, Fujian, Peoples R China
[4] Gannan Med Univ, Dept Anesthesiol, Ganzhou, Jiangxi, Peoples R China
[5] Qinghai Univ, Dept Chinese Med, Affiliated Hosp, Xining, Qinghai, Peoples R China
[6] Univ Roma Tor Vergata, Dept Syst Med, Chair Endocrinol & Med Sexol ENDOSEX, Rome, Italy
关键词
global healthcare; equity; artificial intelligence; ChatGPT; language bias;
D O I
10.3389/fmed.2023.1237432
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
BackgroundThe demand for healthcare is increasing globally, with notable disparities in access to resources, especially in Asia, Africa, and Latin America. The rapid development of Artificial Intelligence (AI) technologies, such as OpenAI's ChatGPT, has shown promise in revolutionizing healthcare. However, potential challenges, including the need for specialized medical training, privacy concerns, and language bias, require attention.MethodsTo assess the applicability and limitations of ChatGPT in Chinese and English settings, we designed an experiment evaluating its performance in the 2022 National Medical Licensing Examination (NMLE) in China. For a standardized evaluation, we used the comprehensive written part of the NMLE, translated into English by a bilingual expert. All questions were input into ChatGPT, which provided answers and reasons for choosing them. Responses were evaluated for "information quality" using the Likert scale.ResultsChatGPT demonstrated a correct response rate of 81.25% for Chinese and 86.25% for English questions. Logistic regression analysis showed that neither the difficulty nor the subject matter of the questions was a significant factor in AI errors. The Brier Scores, indicating predictive accuracy, were 0.19 for Chinese and 0.14 for English, indicating good predictive performance. The average quality score for English responses was excellent (4.43 point), slightly higher than for Chinese (4.34 point).ConclusionWhile AI language models like ChatGPT show promise for global healthcare, language bias is a key challenge. Ensuring that such technologies are robustly trained and sensitive to multiple languages and cultures is vital. Further research into AI's role in healthcare, particularly in areas with limited resources, is warranted.
引用
收藏
页数:7
相关论文
共 44 条
  • [21] Capability of GPT-4V(ision) in the Japanese National Medical Licensing Examination: Evaluation Study
    Nakao, Takahiro
    Miki, Soichiro
    Nakamura, Yuta
    Kikuchi, Tomohiro
    Nomura, Yukihiro
    Hanaoka, Shouhei
    Yoshikawa, Takeharu
    Abe, Osamu
    JMIR MEDICAL EDUCATION, 2024, 10
  • [22] Performance of ChatGPT in Solving Questions From the Progress Test (Brazilian National Medical Exam): A Potential Artificial Intelligence Tool in Medical Practice
    Alessi, Mateus Rodrigues
    Gomes, Heitor A.
    de Castro, Matheus Lopes
    Okamoto, Cristina Terumy
    CUREUS JOURNAL OF MEDICAL SCIENCE, 2024, 16 (07)
  • [23] Evaluation Model for the Application of Artificial Intelligence Medical Assistant System to the Development of Medical Ecology in China
    Shieh, Chich-Jen
    Wan, Guang-Sheng
    Wang, Wei
    Luo, Yuzhou
    EKOLOJI, 2019, 28 (107): : 311 - 316
  • [24] Artificial intelligence and global health equity Regulation and monitoring are needed to prevent harmful bias in AI tools
    Dychiao, Robyn Gayle
    Nazer, Lama
    Mlombwa, Donald
    Celi, Leo Anthony
    BMJ-BRITISH MEDICAL JOURNAL, 2024, 387
  • [25] Application of the "Plan-Do-Check-Action" plan in improving the pass rate of the "National Medical Licensing Examination"
    Zhou, Shu
    Zhang, Xian
    Zhang, Hao
    Zhang, Donglei
    Wei, Renxiong
    Yang, Miao
    BMC MEDICAL EDUCATION, 2024, 24 (01)
  • [26] Embedding Ethics and Equity in Artificial Intelligence and Machine Learning Infrastructure: National Alliance against Disparities in Patient Health
    Simmons, Malaika
    Hendricks-Sturrup, Rachele
    Waters, Gabriella
    Novak, Laurie
    Were, Martin
    Hussain, Sajid
    BIG DATA, 2023, 11
  • [27] North Korean defectors seeking health certification to take the national medical licensing examination in the Republic of Korea: figures and procedures
    Kim, Yoon Hee
    JOURNAL OF EDUCATIONAL EVALUATION FOR HEALTH PROFESSIONS, 2012, 9
  • [28] Application of artificial intelligence in lung cancer screening: A real-world study in a Chinese physical examination population
    Wu, Jiaxuan
    Li, Ruicen
    Gan, Jiadi
    Zheng, Qian
    Wang, Guoqing
    Tao, Wenjuan
    Yang, Ming
    Li, Wenyu
    Ji, Guiyi
    Li, Weimin
    THORACIC CANCER, 2024, 15 (28) : 2061 - 2072
  • [29] Application of multi-sensor network and artificial intelligence in health monitoring of medical geriatric care
    Meng, Hua
    Han, Yi
    Zan, Zhi
    SOFT COMPUTING, 2023,
  • [30] Combining collective and artificial intelligence for global health diseases diagnosis using crowdsourced annotated medical images
    Lin, Lin
    Bermejo-Pelaez, David
    Capellan-Martin, Daniel
    Cuadrado, Daniel
    Rodriguez, Cristina
    Garcia, Lydia
    Diez, Nuria
    Tome, Rocio
    Postigo, Maria
    Jesus Ledesma-Carbayo, Maria
    Luengo-Oroz, Miguel
    2021 43RD ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE & BIOLOGY SOCIETY (EMBC), 2021, : 3344 - 3348