Evaluating base and retrieval augmented LLMs with document or online support for evidence based neurology

被引:0
|
作者
Masanneck, Lars [1 ]
Meuth, Sven G.
Pawlitzki, Marc
机构
[1] Heinrich Heine Univ Dusseldorf, Med Fac, Dept Neurol, Dusseldorf, Germany
来源
NPJ DIGITAL MEDICINE | 2025年 / 8卷 / 01期
关键词
D O I
10.1038/s41746-025-01536-y
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Effectively managing evidence-based information is increasingly challenging. This study tested large language models (LLMs), including document- and online-enabled retrieval-augmented generation (RAG) systems, using 13 recent neurology guidelines across 130 questions. Results showed substantial variability. RAG improved accuracy compared to base models but still produced potentially harmful answers. RAG-based systems performed worse on case-based than knowledge-based questions. Further refinement and improved regulation is needed for safe clinical integration of RAG-enhanced LLMs.
引用
收藏
页数:5
相关论文
共 50 条
  • [31] Content-based medical retrieval systems with evidence-based diagnosis for enhanced clinical decision support
    Karthik, K.
    Kamath, S. Sowmya
    Supreetha, R.
    Katlam, Ashish
    EXPERT SYSTEMS WITH APPLICATIONS, 2025, 272
  • [32] An Online Community of Practice to Support Evidence-Based Physiotherapy Practice in Manual Therapy
    Evans, Cathy
    Yeung, Euson
    Markoulakis, Roula
    Guilcher, Sara
    JOURNAL OF CONTINUING EDUCATION IN THE HEALTH PROFESSIONS, 2014, 34 (04) : 215 - 223
  • [33] Evaluating the effectiveness of an evidence-based online training program for health professionals in eating disorders
    Sarah Maguire
    Ang Li
    Michelle Cunich
    Danielle Maloney
    Journal of Eating Disorders, 7
  • [34] Evaluating the effectiveness of an evidence-based online training program for health professionals in eating disorders
    Maguire, Sarah
    Li, Ang
    Cunich, Michelle
    Maloney, Danielle
    JOURNAL OF EATING DISORDERS, 2019, 7 (1)
  • [35] A Dynamic-Selection-Based, Retrieval-Augmented Generation Framework: Enhancing Multi-Document Question-Answering for Commercial Applications
    Kwon, Mincheol
    Bang, Jimin
    Hwang, Seyoung
    Jang, Junghoon
    Lee, Woosin
    ELECTRONICS, 2025, 14 (04):
  • [36] An Approach to Evaluating Multisector Partnerships to Support Evidence-Based Quality Improvement in Primary Care
    McHugh, Megan
    Philbin, Sarah
    Carroll, Allison J.
    Vu, My H.
    Ciolino, Jody D.
    Maki, Bruce
    Day, Anya
    Smith, Justin D.
    Walunas, Theresa
    JOINT COMMISSION JOURNAL ON QUALITY AND PATIENT SAFETY, 2023, 49 (04): : 199 - 206
  • [37] Evidence-based development of online peer support for caregivers: Co-creating success
    Teed, Moira
    Arthur, Gavin
    Takacs, Judit
    Jacques, Stephanie
    Tardif, Andreane
    Young, Colleen
    Hamilton-Page, Michelle
    INTERNATIONAL JOURNAL OF STROKE, 2019, 14 (3_SUPPL) : 20 - 20
  • [38] Evaluating Evidence-Based Nutrition Support Practice Among Healthcare Professionals With and Without the Certified Nutrition Support Clinician Credential
    Brody, Rebecca
    Hise, Mary
    Marcus, Andrea Fleisch
    Harvey-Banchik, Lillian
    Matarese, Laura E.
    JOURNAL OF PARENTERAL AND ENTERAL NUTRITION, 2016, 40 (01) : 107 - 114
  • [39] Instruments evaluating child outcomes used in evidence-based family support programs: A scoping review
    Uka, Ana
    Stefanek, Elisabeth
    Skuciene, Daiva
    Schneckenreiter, Carmen
    Spiel, Georg
    CHILDREN AND YOUTH SERVICES REVIEW, 2024, 166
  • [40] Providing Evidence-Based, Intelligent Support for Flood Resilient Planning and Policy: The PEARL Knowledge Base
    Karavokiros, George
    Lykou, Archontia
    Koutiva, Ifigenia
    Batica, Jelena
    Kostaridis, Antonis
    Alves, Alida
    Makropoulos, Christos
    WATER, 2016, 8 (09):