Identifying signs and symptoms of urinary tract infection from emergency department clinical notes using large language models

被引:4
|
作者
Iscoe, Mark [1 ,2 ]
Socrates, Vimig [2 ,3 ]
Gilson, Aidan [4 ]
Chi, Ling [5 ]
Li, Huan [3 ]
Huang, Thomas [4 ]
Kearns, Thomas [1 ]
Perkins, Rachelle [1 ]
Khandjian, Laura [1 ]
Taylor, R. Andrew [1 ,2 ]
机构
[1] Yale Sch Med, Dept Emergency Med, New Haven, CT 06519 USA
[2] Yale Univ, Sch Med, Sect Biomed Informat & Data Sci, New Haven, CT USA
[3] Yale Univ, Program Computat Biol & Bioinformat, New Haven, CT USA
[4] Yale Sch Med, New Haven, CT 06519 USA
[5] Yale Sch Publ Hlth, Dept Biostat, New Haven, CT USA
关键词
emergency medicine; infectious diseases; informatics; large language models; named entity recognition; natural language processing; urinary tract infection; INFORMATION; AGREEMENT; CARE; EXTRACTION; MANAGEMENT; ACCURACY; CRITERIA;
D O I
10.1111/acem.14883
中图分类号
R4 [临床医学];
学科分类号
1002 ; 100602 ;
摘要
BackgroundNatural language processing (NLP) tools including recently developed large language models (LLMs) have myriad potential applications in medical care and research, including the efficient labeling and classification of unstructured text such as electronic health record (EHR) notes. This opens the door to large-scale projects that rely on variables that are not typically recorded in a structured form, such as patient signs and symptoms.ObjectivesThis study is designed to acquaint the emergency medicine research community with the foundational elements of NLP, highlighting essential terminology, annotation methodologies, and the intricacies involved in training and evaluating NLP models. Symptom characterization is critical to urinary tract infection (UTI) diagnosis, but identification of symptoms from the EHR has historically been challenging, limiting large-scale research, public health surveillance, and EHR-based clinical decision support. We therefore developed and compared two NLP models to identify UTI symptoms from unstructured emergency department (ED) notes.MethodsThe study population consisted of patients aged >= 18 who presented to an ED in a northeastern U.S. health system between June 2013 and August 2021 and had a urinalysis performed. We annotated a random subset of 1250 ED clinician notes from these visits for a list of 17 UTI symptoms. We then developed two task-specific LLMs to perform the task of named entity recognition: a convolutional neural network-based model (SpaCy) and a transformer-based model designed to process longer documents (Clinical Longformer). Models were trained on 1000 notes and tested on a holdout set of 250 notes. We compared model performance (precision, recall, F1 measure) at identifying the presence or absence of UTI symptoms at the note level.ResultsA total of 8135 entities were identified in 1250 notes; 83.6% of notes included at least one entity. Overall F1 measure for note-level symptom identification weighted by entity frequency was 0.84 for the SpaCy model and 0.88 for the Longformer model. F1 measure for identifying presence or absence of any UTI symptom in a clinical note was 0.96 (232/250 correctly classified) for the SpaCy model and 0.98 (240/250 correctly classified) for the Longformer model.ConclusionsThe study demonstrated the utility of LLMs and transformer-based models in particular for extracting UTI symptoms from unstructured ED clinical notes; models were highly accurate for detecting the presence or absence of any UTI symptom on the note level, with variable performance for individual symptoms.
引用
收藏
页码:599 / 610
页数:12
相关论文
共 50 条
  • [21] The foundational capabilities of large language models in predicting postoperative risks using clinical notes
    Alba, Charles
    Xue, Bing
    Abraham, Joanna
    Kannampallil, Thomas
    Lu, Chenyang
    NPJ DIGITAL MEDICINE, 2025, 8 (01):
  • [22] Identifying Symptoms of Delirium from Clinical Narratives Using Natural Language Processing
    Chen, Aokun
    Paredes, Daniel
    Yu, Zehao
    Lou, Xiwei
    Brunson, Roberta
    Thomas, Jamie N.
    Martinez, Kimberly A.
    Lucero, Robert J.
    Magoc, Tanja
    Solberg, Laurence M.
    Snigurska, Urszula A.
    Ser, Sarah E.
    Prosperi, Mattia
    Bian, Jiang
    Bjarnadottir, Ragnhildur, I
    Wu, Yonghui
    2024 IEEE 12TH INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS, ICHI 2024, 2024, : 305 - 311
  • [23] Enhancing phenotype recognition in clinical notes using large language models: PhenoBCBERT and PhenoGPT
    Yang, Jingye
    Liu, Cong
    Deng, Wendy
    Wu, Da
    Weng, Chunhua
    Zhou, Yunyun
    Wang, Kai
    PATTERNS, 2024, 5 (01):
  • [24] LARGE LANGUAGE MODELS FOR MORTALITY PREDICTION USING STRUCTURED EHR AND UNSTRUCTURED CLINICAL NOTES
    Contreras, Miguel
    Rashidi, Parisa
    Kapoor, Sumit
    CRITICAL CARE MEDICINE, 2025, 53 (01)
  • [25] Terminology Expansion with Prototype Embeddings: Extracting Symptoms of Urinary Tract Infection from Clinical Text
    Ul Alam, Mahbub
    Henriksson, Aron
    Tanushi, Hideyuki
    Thiman, Emil
    Naucler, Pontus
    Dalianis, Hercules
    HEALTHINF: PROCEEDINGS OF THE 14TH INTERNATIONAL JOINT CONFERENCE ON BIOMEDICAL ENGINEERING SYSTEMS AND TECHNOLOGIES - VOL. 5: HEALTHINF, 2021, : 47 - 57
  • [26] Incidence, admission rates, and economic burden of pediatric emergency department visits for urinary tract infection: Data from the nationwide emergency department sample, 2006 to 2011
    Sood, Akshay
    Penna, Frank J.
    Eleswarapu, Sriram
    Pucheril, Dan
    Weaver, John
    Abd-El-Barr, Abd-El-Rahman
    Wagner, Jordan C.
    Lakshmanan, Yegappan
    Menon, Mani
    Trinh, Quoc-Dien
    Sammon, Jesse D.
    Elder, Jack S.
    JOURNAL OF PEDIATRIC UROLOGY, 2015, 11 (05) : 246.e1 - 246.e8
  • [27] A Qualitative Study of Factors Facilitating Clinical Nurse Engagement in Emergency Department Catheter-Associated Urinary Tract Infection Prevention
    Carter, Eileen J.
    Sinnette, Corine
    Pallin, Daniel J.
    Schuur, Jeremiah D.
    Mandel, Leslie
    JOURNAL OF NURSING ADMINISTRATION, 2016, 46 (10): : 495 - 500
  • [28] Large Language Models for Symptoms Monitoring on the Basis of Conversational Data from Clinical Encounters
    Davis, J.
    Van Dongen, C.
    Sciacca, K.
    Durieux, B.
    Lindvall, C.
    JOURNAL OF THE AMERICAN GERIATRICS SOCIETY, 2024, 72 : S241 - S241
  • [29] Pregnant patient in the emergency department: An observational investigation of predictive values of symptoms and lab measures in predicting culture confirmed urinary tract infection.
    Gendel, Gleb
    Nolan, Robert
    AMERICAN JOURNAL OF EMERGENCY MEDICINE, 2021, 44 : 439 - 440
  • [30] Identifying Type II workplace violence from clinical notes using natural language processing
    Byon, Ha Do
    Harris, Catherine
    Crandall, Mary
    Song, Jiyoun
    Topaz, Maxim
    WORKPLACE HEALTH & SAFETY, 2023, 71 (10) : 484 - 490