Development and Validation of a Natural Language Processing Algorithm to Extract Descriptors of Microbial Keratitis From the Electronic Health Record

被引:3
|
作者
Woodward, Maria A. [1 ,2 ]
Maganti, Nenita [1 ,3 ]
Niziol, Leslie M. [1 ]
Amin, Sejal [4 ]
Hou, Andrew [4 ]
Singh, Karandeep [2 ,5 ,6 ]
机构
[1] Univ Michigan, WK Kellogg Eye Ctr, Dept Ophthalmol & Visual Sci, 1000 Wall St, Ann Arbor, MI 48105 USA
[2] Univ Michigan, Inst Healthcare Policy & Innovat, Ann Arbor, MI 48105 USA
[3] Northwestern Univ, Feinberg Sch Med, Chicago, IL 60611 USA
[4] Henry Ford Hlth Syst, Dept Ophthalmol, Detroit, MI USA
[5] Univ Michigan, Dept Learning Hlth Syst, Ann Arbor, MI 48109 USA
[6] Univ Michigan, Dept Internal Med, Ann Arbor, MI 48109 USA
基金
美国国家卫生研究院;
关键词
cornea; microbial keratitis; measurement; Natural Language Processing; electronic health record; BACTERIAL KERATITIS; CORNEAL ULCERS; SYSTEMS; DISEASE;
D O I
10.1097/ICO.0000000000002755
中图分类号
R77 [眼科学];
学科分类号
100212 ;
摘要
Purpose: The purpose of this article was to develop and validate a natural language processing (NLP) algorithm to extract qualitative descriptors of microbial keratitis (MK) from electronic health records. Methods: In this retrospective cohort study, patients with MK diagnoses from 2 academic centers were identified using electronic health records. An NLP algorithm was created to extract MK centrality, depth, and thinning. A random sample of patient with MK encounters were used to train the algorithm (400 encounters of 100 patients) and compared with expert chart review. The algorithm was evaluated in internal (n = 100) and external validation data sets (n = 59) in comparison with masked chart review. Outcomes were sensitivity and specificity of the NLP algorithm to extract qualitative MK features as compared with masked chart review performed by an ophthalmologist. Results: Across data sets, gold-standard chart review found centrality was documented in 64.0% to 79.3% of charts, depth in 15.0% to 20.3%, and thinning in 25.4% to 31.3%. Compared with chart review, the NLP algorithm had a sensitivity of 80.3%, 50.0%, and 66.7% for identifying central MK, 85.4%, 66.7%, and 100% for deep MK, and 100.0%, 95.2%, and 100% for thin MK, in the training, internal, and external validation samples, respectively. Specificity was 41.1%, 38.6%, and 46.2% for centrality, 100%, 83.3%, and 71.4% for depth, and 93.3%, 100%, and was not applicable (n = 0) to the external data for thinning, in the samples, respectively. Conclusions: MK features are not documented consistently showing a lack of standardization in recording MK examination elements. NLP shows promise but will be limited if the available clinical data are missing from the chart.
引用
收藏
页码:1548 / 1553
页数:6
相关论文
共 50 条
  • [1] Development of a natural language processing algorithm to extract seizure types and frequencies from the electronic health record
    Decker, Barbara M.
    Turco, Alexandra
    Xu, Jian
    Terman, Samuel W.
    Kosaraju, Nikitha
    Jamil, Alisha
    Davis, Kathryn A.
    Litt, Brian
    Ellis, Colin A.
    Khankhanian, Pouya
    Hill, Chloe E.
    SEIZURE-EUROPEAN JOURNAL OF EPILEPSY, 2022, 101 : 48 - 51
  • [2] Performance of a Natural Language Processing Method to Extract Stone Composition From the Electronic Health Record
    Bejan, Cosmin A.
    Lee, Daniel J.
    Xu, Yaomin
    Hsi, Ryan S.
    UROLOGY, 2019, 132 : 56 - 62
  • [3] Developing Natural Language Processing to Extract Complementary and Integrative Health Information from Electronic Health Record Data
    Zhou, Huixue
    2022 IEEE 10TH INTERNATIONAL CONFERENCE ON HEALTHCARE INFORMATICS (ICHI 2022), 2022, : 474 - 475
  • [4] DEVELOPMENT OF A NATURAL LANGUAGE PROCESSING ALGORITHM TO IDENTIFY AND EVALUATE TRANSGENDER PATIENTS IN ELECTRONIC HEALTH RECORD SYSTEMS
    Ehrenfelt, Jesse M.
    Gottlieb, Keanan Gabriel
    Beach, Lauren Brittany
    Monahan, Shelby E.
    Fabbri, Daniel
    ETHNICITY & DISEASE, 2019, 29 : 441 - 450
  • [5] Natural Language Processing and Machine Learning for Identifying Incident Stroke From Electronic Health Records: Algorithm Development and Validation
    Zhao, Yiqing
    Fu, Sunyang
    Bielinski, Suzette J.
    Decker, Paul A.
    Chamberlain, Alanna M.
    Roger, Veronique L.
    Liu, Hongfang
    Larson, Nicholas B.
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2021, 23 (03)
  • [6] Natural language processing and machine learning to identify alcohol misuse from the electronic health record in trauma patients: development and internal validation
    Afshar, Majid
    Phillips, Andrew
    Karnik, Niranjan
    Mueller, Jeanne
    To, Daniel
    Gonzalez, Richard
    Price, Ron
    Cooper, Richard
    Joyce, Cara
    Dligach, Dmitriy
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2019, 26 (03) : 254 - 261
  • [7] Natural Language Processing to Quantify Microbial Keratitis Measurements
    Maganti, Nenita
    Tan, Huan
    Niziol, Leslie M.
    Amin, Sejal
    Hou, Andrew
    Singh, Karandeep
    Ballouz, Dena
    Woodward, Maria A.
    OPHTHALMOLOGY, 2019, 126 (12) : 1722 - 1724
  • [8] Classifying Individuals With Rheumatic Conditions as Financially Insecure Using Electronic Health Record Data and Natural Language Processing: Algorithm Derivation and Validation
    Chandler, Mia T.
    Cai, Tianrun
    Santacroce, Leah
    Ulysse, Sciaska
    Liao, Katherine P.
    Feldman, Candace H.
    ACR OPEN RHEUMATOLOGY, 2024, 6 (08) : 481 - 488
  • [9] Classifying Individuals with Rheumatic Conditions as Financially Insecure Using Electronic Health Record Data and Natural Language Processing: Algorithm Derivation and Validation
    Chandler, Mia
    Cai, Tianrun
    Santacroce, Leah
    Ulysse, Sciaska
    Liao, Katherine
    Feldman, Candace
    ARTHRITIS & RHEUMATOLOGY, 2023, 75 : 351 - 353
  • [10] Natural language processing to identify substance misuse in the electronic health record
    Riddick, Tyne A.
    Choo, Esther K.
    LANCET DIGITAL HEALTH, 2022, 4 (06): : E401 - E402