Evaluating ChatGPT-4's correctness in patient-focused informing and awareness for atrial fi brillation

被引：0

作者：

Zeljkovic, Ivan ^{[1
,2
]}

Novak, Matea ^{[2
,3
,5
]}

Jordan, Ana ^{[1
]}

Lisicic, Ante ^{[1
]}

Nemeth-Blazic, Tatjana ^{[4
]}

Pavlovic, Nikola ^{[1
]}

Manola, Sime ^{[1
]}

机构：

[1] Dubrava Univ Hosp, Dept Cardiovasc Dis, Ave Gojka Suska, Zagreb, Croatia

[2] Catholic Univ Croatia, Zagreb, Croatia

[3] RIT Croatia, Rochester Inst Technol, Zagreb, Croatia

[4] Croatian Inst Publ Hlth, Zagreb, Croatia

[5] Catholic Univ Croatia, Sch Med, Zagreb, Croatia

来源：

HEART RHYTHM O2 | 2025年 / 6卷 / 01期

关键词：

GPT-4; Patient; Informing; Atrial fi brillation; Large language models; AI in health care;

D O I：

10.1016/j.hroo.2024.10.005

中图分类号：

R5 [内科学];

学科分类号：

1002 ; 100201 ;

摘要：

BACKGROUND As artificial intelligence and large language models continue to evolve, their application in health care is expanding. OpenAI's Chat Generative Pre-trained Transformer 4 (ChatGPT-4) represents the latest advancement in this technology, capable of engaging in complex dialogues and providing information. OBJECTIVE This study explores the correctness of ChatGPT-4 in informing patients about atrial fi brillation. METHODS This cross-sectional observational study involved ChatGPT-4 in responding to a structured set of 108 questions across 10 categories related to atrial fi brillation. These categories included basic information, treatment options, lifestyle adjustments, and more, reflecting common patient inquiries. The model's responses were evaluated by a panel of 3 cardiologists on the basis of accuracy, comprehensiveness, clarity, relevance to clinical practice, and patient safety. The total correctness of ChatGPT-4 was quantitatively assessed through scores assigned in each category, and statistical analysis was performed to identify significant differences in performance across categories. RESULTS ChatGPT-4 provided correct and relevant answers with considerable variability across categories. It excelled in "Lifestyle Adjustments" and "Daily Life and Management" with perfect and near-perfect scores but struggled with "Miscellaneous Concerns" scoring lower. Statistical analysis confirmed significant differences in total scores across categories (P 5 . 020). CONCLUSION Our results suggest that while ChatGPT-4 is reliable in categories with structured and direct queries, it shows limitations when handling complex medical queries that require in-depth explanations or clinical judgment. ChatGPT-4 demonstrates promising potential as a tool for patient-focused informing in atrial fi brilla- tion, particularly in straightforward informing content.

引用

页码：58 / 63

页数：6

共 12 条

[1] Evaluating ChatGPT-4's Diagnostic Accuracy: Impact of Visual Data Integration
Hirosawa, Takanobu
Harada, Yukinori
Tokumasu, Kazuki
Ito, Takahiro
Suzuki, Tomoharu
Shimizu, Taro
JMIR MEDICAL INFORMATICS, 2024, 12
[2] Evaluating ChatGPT-4's performance as a digital health advisor for otosclerosis surgery
Sahin, Samil
Erkmen, Burak
Duymaz, Yasar Kemal
Bayram, Furkan
Tekin, Ahmet Mahmut
Topsakal, Vedat
FRONTIERS IN SURGERY, 2024, 11
[3] Revolutionizing Diagnostics: Evaluating ChatGPT-4's Performance in Ulcerative Colitis Endoscopic Assessment
Levartovsky, A.
Albshesh, A.
Grinman, A.
Shachar, E.
Lahat, A.
Eliakim, R.
Kopylov, U.
JOURNAL OF CROHNS & COLITIS, 2025, 19 : I748 - I748
[4] Evaluating ChatGPT-4's historical accuracy: a case study on the origins of SWOT analysis
Puyt, Richard W.
Madsen, Dag oivind
FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2024, 7
[5] Evaluating ChatGPT-4 Vision on Brazil's National Undergraduate Computer Science Exam
Mendonca, Nabor C.
ACM TRANSACTIONS ON COMPUTING EDUCATION, 2024, 24 (03):
[6] Evaluating ChatGPT-4's performance on oral and maxillofacial queries: Chain of Thought and standard method
Ji, Kaiyuan
Wu, Zhihan
Han, Jing
Zhai, Guangtao
Liu, Jiannan
FRONTIERS IN ORAL HEALTH, 2025, 6
[7] Evaluating ChatGPT-4's Performance in Identifying Radiological Anatomy in FRCR Part 1 Examination Questions
Sarangi, Pradosh Kumar
Datta, Suvrankar
Panda, Braja Behari
Panda, Swaha
Mondal, Himel
INDIAN JOURNAL OF RADIOLOGY AND IMAGING, 2024,
[8] Evaluating Artificial Intelligence Efficacy: A Comparative Study between ChatGPT-4's Treatment Recommendations and Orthopaedic Clinical Practice Guidelines
Dagher, Tanios
Dwyer, Emma
Baker, Hayden P.
Kalidoss, Senthooran
Strelzow, Jason
JOURNAL OF THE AMERICAN COLLEGE OF SURGEONS, 2024, 239 (05) : S325 - S326
[9] Evaluating AI Capabilities in Bariatric Surgery: A Study on ChatGPT-4 and DALL<middle dot>E 3's Recognition and Illustration Accuracy
Mahjoubi, Mohammad
Shahabi, Shahab
Sheikhbahaei, Saba
Jazi, Amir Hossein Davarpanah
OBESITY SURGERY, 2025, 35 (02) : 638 - 641
[10] AI-driven patient support: Evaluating the effectiveness of ChatGPT-4 in addressing queries about ovarian cancer compared with healthcare professionals in gynecologic oncology
Chou, Hung-Hsueh
Chen, Yi Hua
Lin, Chiu-Tzu
Chang, Hsien-Tsung
Wu, An-Chieh
Tsai, Jia-Ling
Chen, Hsiao-Wei
Hsu, Ching-Chun
Liu, Shu-Ya
Lee, Jian Tao
SUPPORTIVE CARE IN CANCER, 2025, 33 (04)

← 1 2 →