Evaluating ChatGPT-4's correctness in patient-focused informing and awareness for atrial fi brillation

被引:0
|
作者
Zeljkovic, Ivan [1 ,2 ]
Novak, Matea [2 ,3 ,5 ]
Jordan, Ana [1 ]
Lisicic, Ante [1 ]
Nemeth-Blazic, Tatjana [4 ]
Pavlovic, Nikola [1 ]
Manola, Sime [1 ]
机构
[1] Dubrava Univ Hosp, Dept Cardiovasc Dis, Ave Gojka Suska, Zagreb, Croatia
[2] Catholic Univ Croatia, Zagreb, Croatia
[3] RIT Croatia, Rochester Inst Technol, Zagreb, Croatia
[4] Croatian Inst Publ Hlth, Zagreb, Croatia
[5] Catholic Univ Croatia, Sch Med, Zagreb, Croatia
来源
HEART RHYTHM O2 | 2025年 / 6卷 / 01期
关键词
GPT-4; Patient; Informing; Atrial fi brillation; Large language models; AI in health care;
D O I
10.1016/j.hroo.2024.10.005
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
BACKGROUND As artificial intelligence and large language models continue to evolve, their application in health care is expanding. OpenAI's Chat Generative Pre-trained Transformer 4 (ChatGPT-4) represents the latest advancement in this technology, capable of engaging in complex dialogues and providing information. OBJECTIVE This study explores the correctness of ChatGPT-4 in informing patients about atrial fi brillation. METHODS This cross-sectional observational study involved ChatGPT-4 in responding to a structured set of 108 questions across 10 categories related to atrial fi brillation. These categories included basic information, treatment options, lifestyle adjustments, and more, reflecting common patient inquiries. The model's responses were evaluated by a panel of 3 cardiologists on the basis of accuracy, comprehensiveness, clarity, relevance to clinical practice, and patient safety. The total correctness of ChatGPT-4 was quantitatively assessed through scores assigned in each category, and statistical analysis was performed to identify significant differences in performance across categories. RESULTS ChatGPT-4 provided correct and relevant answers with considerable variability across categories. It excelled in "Lifestyle Adjustments" and "Daily Life and Management" with perfect and near-perfect scores but struggled with "Miscellaneous Concerns" scoring lower. Statistical analysis confirmed significant differences in total scores across categories (P 5 . 020). CONCLUSION Our results suggest that while ChatGPT-4 is reliable in categories with structured and direct queries, it shows limitations when handling complex medical queries that require in-depth explanations or clinical judgment. ChatGPT-4 demonstrates promising potential as a tool for patient-focused informing in atrial fi brilla- tion, particularly in straightforward informing content.
引用
收藏
页码:58 / 63
页数:6
相关论文
共 12 条
  • [1] Evaluating ChatGPT-4's Diagnostic Accuracy: Impact of Visual Data Integration
    Hirosawa, Takanobu
    Harada, Yukinori
    Tokumasu, Kazuki
    Ito, Takahiro
    Suzuki, Tomoharu
    Shimizu, Taro
    JMIR MEDICAL INFORMATICS, 2024, 12
  • [2] Evaluating ChatGPT-4's performance as a digital health advisor for otosclerosis surgery
    Sahin, Samil
    Erkmen, Burak
    Duymaz, Yasar Kemal
    Bayram, Furkan
    Tekin, Ahmet Mahmut
    Topsakal, Vedat
    FRONTIERS IN SURGERY, 2024, 11
  • [3] Revolutionizing Diagnostics: Evaluating ChatGPT-4's Performance in Ulcerative Colitis Endoscopic Assessment
    Levartovsky, A.
    Albshesh, A.
    Grinman, A.
    Shachar, E.
    Lahat, A.
    Eliakim, R.
    Kopylov, U.
    JOURNAL OF CROHNS & COLITIS, 2025, 19 : I748 - I748
  • [4] Evaluating ChatGPT-4's historical accuracy: a case study on the origins of SWOT analysis
    Puyt, Richard W.
    Madsen, Dag oivind
    FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2024, 7
  • [5] Evaluating ChatGPT-4 Vision on Brazil's National Undergraduate Computer Science Exam
    Mendonca, Nabor C.
    ACM TRANSACTIONS ON COMPUTING EDUCATION, 2024, 24 (03):
  • [6] Evaluating ChatGPT-4's performance on oral and maxillofacial queries: Chain of Thought and standard method
    Ji, Kaiyuan
    Wu, Zhihan
    Han, Jing
    Zhai, Guangtao
    Liu, Jiannan
    FRONTIERS IN ORAL HEALTH, 2025, 6
  • [7] Evaluating ChatGPT-4's Performance in Identifying Radiological Anatomy in FRCR Part 1 Examination Questions
    Sarangi, Pradosh Kumar
    Datta, Suvrankar
    Panda, Braja Behari
    Panda, Swaha
    Mondal, Himel
    INDIAN JOURNAL OF RADIOLOGY AND IMAGING, 2024,
  • [8] Evaluating Artificial Intelligence Efficacy: A Comparative Study between ChatGPT-4's Treatment Recommendations and Orthopaedic Clinical Practice Guidelines
    Dagher, Tanios
    Dwyer, Emma
    Baker, Hayden P.
    Kalidoss, Senthooran
    Strelzow, Jason
    JOURNAL OF THE AMERICAN COLLEGE OF SURGEONS, 2024, 239 (05) : S325 - S326
  • [9] Evaluating AI Capabilities in Bariatric Surgery: A Study on ChatGPT-4 and DALL<middle dot>E 3's Recognition and Illustration Accuracy
    Mahjoubi, Mohammad
    Shahabi, Shahab
    Sheikhbahaei, Saba
    Jazi, Amir Hossein Davarpanah
    OBESITY SURGERY, 2025, 35 (02) : 638 - 641
  • [10] AI-driven patient support: Evaluating the effectiveness of ChatGPT-4 in addressing queries about ovarian cancer compared with healthcare professionals in gynecologic oncology
    Chou, Hung-Hsueh
    Chen, Yi Hua
    Lin, Chiu-Tzu
    Chang, Hsien-Tsung
    Wu, An-Chieh
    Tsai, Jia-Ling
    Chen, Hsiao-Wei
    Hsu, Ching-Chun
    Liu, Shu-Ya
    Lee, Jian Tao
    SUPPORTIVE CARE IN CANCER, 2025, 33 (04)