Is ChatGPT accurate and reliable in answering questions regarding head and neck cancer?

被引：38

作者：

Kuscu, Oguz ^{[1
]}

Pamuk, A. Erim ^{[1
]}

Suslu, Nilda Sutay ^{[2
]}

Hosal, Sefik ^{[2
]}

机构：

[1] Hacettepe Univ, Sch Med, Dept Otorhinolaryngol, TR-06100 Ankara, Turkiye

[2] Atılım Univ, Sch Med, Dept Otorhinolaryngol, Ankara, Turkiye

来源：

FRONTIERS IN ONCOLOGY | 2023年 / 13卷

关键词：

ChatGPT; 4; head and neck (H&N) cancer; head and neck; artificial intelligence; chatbot; information literacy; natural language processing; machine learning; MODELS;

D O I：

10.3389/fonc.2023.1256459

中图分类号：

R73 [肿瘤学];

学科分类号：

100214 ;

摘要：

Background and objective Chat Generative Pre-trained Transformer (ChatGPT) is an artificial intelligence (AI)-based language processing model using deep learning to create human-like text dialogue. It has been a popular source of information covering vast number of topics including medicine. Patient education in head and neck cancer (HNC) is crucial to enhance the understanding of patients about their medical condition, diagnosis, and treatment options. Therefore, this study aims to examine the accuracy and reliability of ChatGPT in answering questions regarding HNC.Methods 154 head and neck cancer-related questions were compiled from sources including professional societies, institutions, patient support groups, and social media. These questions were categorized into topics like basic knowledge, diagnosis, treatment, recovery, operative risks, complications, follow-up, and cancer prevention. ChatGPT was queried with each question, and two experienced head and neck surgeons assessed each response independently for accuracy and reproducibility. Responses were rated on a scale: (1) comprehensive/correct, (2) incomplete/partially correct, (3) a mix of accurate and inaccurate/misleading, and (4) completely inaccurate/irrelevant. Discrepancies in grading were resolved by a third reviewer. Reproducibility was evaluated by repeating questions and analyzing grading consistency.Results ChatGPT yielded "comprehensive/correct" responses to 133/154 (86.4%) of the questions whereas, rates of "incomplete/partially correct" and "mixed with accurate and inaccurate data/misleading" responses were 11% and 2.6%, respectively. There were no "completely inaccurate/irrelevant" responses. According to category, the model provided "comprehensive/correct" answers to 80.6% of questions regarding "basic knowledge", 92.6% related to "diagnosis", 88.9% related to "treatment", 80% related to "recovery - operative risks - complications - follow-up", 100% related to "cancer prevention" and 92.9% related to "other". There was not any significant difference between the categories regarding the grades of ChatGPT responses (p=0.88). The rate of reproducibility was 94.1% (145 of 154 questions).Conclusion ChatGPT generated substantially accurate and reproducible information to diverse medical queries related to HNC. Despite its limitations, it can be a useful source of information for both patients and medical professionals. With further developments in the model, ChatGPT can also play a crucial role in clinical decision support to provide the clinicians with up-to-date information.

引用

页数：7

共 50 条

[21] Assessing the applicability and appropriateness of ChatGPT in answering clinical pharmacy questions
Fournier, A.
Fallet, C.
Sadeghipour, F.
Perrottet, N.
ANNALES PHARMACEUTIQUES FRANCAISES, 2024, 82 (03): : 507 - 513
[22] EVALUATING PERFORMANCE OF CHATGPT IN ANSWERING PHARMACIST RELICENSING EXAM QUESTIONS
Rugova, Mimoza
Kastrati, Natyra
Hoti, Kreshnik
RESEARCH IN SOCIAL & ADMINISTRATIVE PHARMACY, 2024, 20 (12):
[23] Performance of ChatGPT in Answering Clinical Questions on the Practical Guideline of Blepharoptosis
Shiraishi, Makoto
Tomioka, Yoko
Miyakuni, Ami
Ishii, Saaya
Hori, Asei
Park, Hwayoung
Ohba, Jun
Okazaki, Mutsumi
AESTHETIC PLASTIC SURGERY, 2024, : 2389 - 2398
[24] How good is ChatGPT at answering patients' questions related to early detection of oral (mouth) cancer?
Hassona, Yazan
Alqaisi, Dua'a
AL-Haddad, Alaa
Georgakopoulou, Eleni A.
Malamos, Dimitris
Alrashdan, Mohammad S.
Sawair, Faleh
ORAL SURGERY ORAL MEDICINE ORAL PATHOLOGY ORAL RADIOLOGY, 2024, 138 (02): : 269 - 278
[25] Comments on "Performance of ChatGPT in Answering Clinical Questions on the Practical Guideline of Blepharoptosis"
Hashemi, Saleh
Karbalaei, Mohsen
Keikha, Masoud
AESTHETIC PLASTIC SURGERY, 2024,
[26] Comparison of the performances between ChatGPT and Gemini in answering questions on viral hepatitis
Sahin Ozdemir, Meryem
Ozdemir, Yusuf Emre
SCIENTIFIC REPORTS, 2025, 15 (01):
[27] Evaluation of ChatGPT and Gemini in Answering Patient Questions After Gynecologic Surgery
Voigt, P.
Sharma, R.
Milad, M. P.
Chaudhari, A.
Tsai, S.
Yang, L.
OBSTETRICS AND GYNECOLOGY, 2025, 145 (5S): : 39S - 39S
[28] Clarification Regarding the Association of Cannabis Use and Head and Neck Cancer
Katz, Joseph
JAMA OTOLARYNGOLOGY-HEAD & NECK SURGERY, 2025, 151 (02)
[29] Answering questions relating to the reliable extrapolation of creep rupture data
Stewart, Calvin M.
Cano, Jaime A.
Haque, Mohammad Shafinul
INTERNATIONAL JOURNAL OF PRESSURE VESSELS AND PIPING, 2023, 201
[30] Questions Regarding Patient-Reported Symptom Burden as a Predictor of Emergency Department Use and Unplanned Hospitalization in Head and Neck Cancer
Yokoyama, Kazuki
Ishiki, Hiroto
JOURNAL OF CLINICAL ONCOLOGY, 2021, 39 (21) : 2415 - +

← 1 2 3 4 5 →