Development and Validation of a Natural Language Processing Algorithm to Extract Descriptors of Microbial Keratitis From the Electronic Health Record

被引:3
|
作者
Woodward, Maria A. [1 ,2 ]
Maganti, Nenita [1 ,3 ]
Niziol, Leslie M. [1 ]
Amin, Sejal [4 ]
Hou, Andrew [4 ]
Singh, Karandeep [2 ,5 ,6 ]
机构
[1] Univ Michigan, WK Kellogg Eye Ctr, Dept Ophthalmol & Visual Sci, 1000 Wall St, Ann Arbor, MI 48105 USA
[2] Univ Michigan, Inst Healthcare Policy & Innovat, Ann Arbor, MI 48105 USA
[3] Northwestern Univ, Feinberg Sch Med, Chicago, IL 60611 USA
[4] Henry Ford Hlth Syst, Dept Ophthalmol, Detroit, MI USA
[5] Univ Michigan, Dept Learning Hlth Syst, Ann Arbor, MI 48109 USA
[6] Univ Michigan, Dept Internal Med, Ann Arbor, MI 48109 USA
基金
美国国家卫生研究院;
关键词
cornea; microbial keratitis; measurement; Natural Language Processing; electronic health record; BACTERIAL KERATITIS; CORNEAL ULCERS; SYSTEMS; DISEASE;
D O I
10.1097/ICO.0000000000002755
中图分类号
R77 [眼科学];
学科分类号
100212 ;
摘要
Purpose: The purpose of this article was to develop and validate a natural language processing (NLP) algorithm to extract qualitative descriptors of microbial keratitis (MK) from electronic health records. Methods: In this retrospective cohort study, patients with MK diagnoses from 2 academic centers were identified using electronic health records. An NLP algorithm was created to extract MK centrality, depth, and thinning. A random sample of patient with MK encounters were used to train the algorithm (400 encounters of 100 patients) and compared with expert chart review. The algorithm was evaluated in internal (n = 100) and external validation data sets (n = 59) in comparison with masked chart review. Outcomes were sensitivity and specificity of the NLP algorithm to extract qualitative MK features as compared with masked chart review performed by an ophthalmologist. Results: Across data sets, gold-standard chart review found centrality was documented in 64.0% to 79.3% of charts, depth in 15.0% to 20.3%, and thinning in 25.4% to 31.3%. Compared with chart review, the NLP algorithm had a sensitivity of 80.3%, 50.0%, and 66.7% for identifying central MK, 85.4%, 66.7%, and 100% for deep MK, and 100.0%, 95.2%, and 100% for thin MK, in the training, internal, and external validation samples, respectively. Specificity was 41.1%, 38.6%, and 46.2% for centrality, 100%, 83.3%, and 71.4% for depth, and 93.3%, 100%, and was not applicable (n = 0) to the external data for thinning, in the samples, respectively. Conclusions: MK features are not documented consistently showing a lack of standardization in recording MK examination elements. NLP shows promise but will be limited if the available clinical data are missing from the chart.
引用
收藏
页码:1548 / 1553
页数:6
相关论文
共 50 条
  • [31] A Multi-Institutional Natural Language Processing Pipeline to Extract Performance Status From Electronic Health Records
    Maghsoudi, Arash
    Sada, Yvonne H.
    Nowakowski, Sara
    Guffey, Danielle
    Zhu, Huili
    Yarlagadda, Sudha R.
    Li, Ang
    Razjouyan, Javad
    CANCER CONTROL, 2024, 31
  • [32] Development and Validation of an Algorithm to Identify Prostate Cancer Related Mortality in Electronic Medical Records Using Natural Language Processing
    DiBello, Julia R.
    Wallner, Lauren P.
    Zheng, Chengyi
    Yu, Wei
    Li, Bonnie H.
    VanDenEeden, Stephen K.
    Weinmann, Sheila
    Ritzwoller, Debra
    Richert-Boe, Kathryn
    Jacobsen, Stephen J.
    PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2015, 24 : 418 - 419
  • [33] DEVELOPMENT AND VALIDATION OF A NATURAL LANGUAGE PROCESSING TOOL TO EXTRACT STEATOHEPATITIS DIAGNOSES FROM LIVER BIOPSY PATHOLOGY REPORTS
    Lynch, Julie
    Mukherjee, Samiran
    Chang, Kyong-Mi
    Pridgen, Kathryn
    Alba, Patrick
    HEPATOLOGY, 2024, 80 : S466 - S467
  • [34] Using natural language processing to identify opioid use disorder in electronic health record data
    Singleton, Jade
    Li, Chengxi
    Akpunonu, Peter D.
    Abner, Erin L.
    Kucharska-Newton, Anna M.
    INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2023, 170
  • [35] Natural Language Processing and Inference Rules as Strategies for Updating Problem List in an Electronic Health Record
    Plazzotta, Fernando
    Otero, Carlos
    Luna, Daniel
    Bernaldo de Quiros, Fernan Gonzalez
    MEDINFO 2013: PROCEEDINGS OF THE 14TH WORLD CONGRESS ON MEDICAL AND HEALTH INFORMATICS, PTS 1 AND 2, 2013, 192 : 1163 - 1163
  • [36] Logic-Based Natural Language Processing Identifies VT Recurrence in the Electronic Health Record
    Brennan, Kelly A.
    Feng, Ruibin
    Goyal, Jatin
    Azizi, Zahra
    Baykaner, Tina
    Rogers, Albert J.
    Narayan, Sanjiv M.
    CIRCULATION, 2023, 148
  • [37] Using Natural Language Processing and Machine Learning to Identify Opioids in Electronic Health Record Data
    McDermott, Sean P.
    Wasan, Ajay D.
    JOURNAL OF PAIN RESEARCH, 2023, 16 : 2133 - 2140
  • [38] Natural Language Processing Accurately Differentiates Cancer Symptom Information in Electronic Health Record Narratives
    Albashayreh, Alaa
    Bandyopadhyay, Anindita
    Zeinali, Nahid
    Zhang, Min
    Fan, Weiguo
    White, Stephanie Gilbertson
    JCO CLINICAL CANCER INFORMATICS, 2024, 8
  • [39] Exploring the full potential of the electronic health record: the application of natural language processing for clinical practice
    Van Bulck, Liesbet
    Turchioe, Meghan Reading
    Topaz, Maxim
    Song, Jiyoun
    EUROPEAN JOURNAL OF CARDIOVASCULAR NURSING, 2024,
  • [40] Electronic Medical Record Data Mining and Processing Based on Natural Language Processing
    Zhang, Shichen
    PROCEEDINGS OF 2024 INTERNATIONAL CONFERENCE ON MACHINE INTELLIGENCE AND DIGITAL APPLICATIONS, MIDA2024, 2024, : 212 - 217