A Methodological Approach to Validate Pneumonia Encounters from Radiology Reports Using Natural Language Processing

被引:4
|
作者
Panny, AlokSagar [1 ]
Hegde, Harshad [1 ]
Glurich, Ingrid [1 ]
Scannapieco, Frank A. [2 ]
Vedre, Jayanth G. [3 ]
VanWormer, Jeffrey J. [4 ]
Miecznikowski, Jeffrey [5 ]
Acharya, Amit [1 ,6 ]
机构
[1] Marshfield Clin Res Inst, Ctr Oral Syst Hlth, Marshfield, WI USA
[2] SUNY Buffalo, Sch Dent Med, Dept Oral Biol, Buffalo, NY USA
[3] Marshfield Clin Hlth Syst, Dept Crit Care Med, Marshfield, WI USA
[4] Marshfield Clin Res Inst, Ctr Clin Epidemiol & Populat Hlth, Marshfield, WI USA
[5] SUNY Buffalo, Sch Publ Hlth & Hlth Profess, Dept Biostat, Buffalo, NY USA
[6] Advocate Aurora Hlth, Advocate Aurora Res Inst, Downers Grove, IL 60515 USA
基金
美国国家卫生研究院;
关键词
pneumonia; natural language processing; knowledge bases;
D O I
10.1055/a-1817-7008
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Introduction Pneumonia is caused by microbes that establish an infectious process in the lungs. The gold standard for pneumonia diagnosis is radiologist-documented pneumonia-related features in radiology notes that are captured in electronic health records in an unstructured format. Objective The study objective was to develop a methodological approach for assessing validity of a pneumonia diagnosis based on identifying presence or absence of key radiographic features in radiology reports with subsequent rendering of diagnostic decisions into a structured format. Methods A pneumonia-specific natural language processing (NLP) pipeline was strategically developed applying Clinical Text Analysis and Knowledge Extraction System (cTAKES) to validate pneumonia diagnoses following development of a pneumonia feature-specific lexicon. Radiographic reports of study-eligible subjects identified by International Classification of Diseases (ICD) codes were parsed through the NLP pipeline. Classification rules were developed to assign each pneumonia episode into one of three categories: "positive," "negative," or "not classified: requires manual review" based on tagged concepts that support or refute diagnostic codes. Results A total of 91,998 pneumonia episodes diagnosed in 65,904 patients were retrieved retrospectively. Approximately 89% (81,707/91,998) of the total pneumonia episodes were documented by 225,893 chest X-ray reports. NLP classified and validated 33% (26,800/81,707) of pneumonia episodes classified as "Pneumonia-positive," 19% as (15401/81,707) as "Pneumonia-negative," and 48% (39,209/81,707) as "episode classification pending further manual review." NLP pipeline performance metrics included accuracy (76.3%), sensitivity (88%), and specificity (75%). Conclusion The pneumonia-specific NLP pipeline exhibited good performance comparable to other pneumonia-specific NLP systems developed to date.
引用
收藏
页码:38 / 45
页数:8
相关论文
共 50 条
  • [31] Natural Language Processing to Identify Pulmonary Nodules and Extract Nodule Characteristics From Radiology Reports
    Zheng, Chengyi
    Huang, Brian Z.
    Agazaryan, Andranik A.
    Creekmur, Beth
    Osuj, Thearis A.
    Gould, Michael K.
    CHEST, 2021, 160 (05) : 1902 - 1914
  • [32] Identifying Erosive Disease from Radiology Reports of Veterans with Inflammatory Arthritis with Natural Language Processing
    Penmetsa, Gopi
    Walsh, Jessica
    Pei, Shaobo
    Sauer, Brian
    Douglas, Kevin
    Walker, Jodi
    Clewell, Jerry
    Feng, Bingjian
    ARTHRITIS & RHEUMATOLOGY, 2021, 73 : 322 - 324
  • [33] Natural Language Processing of Radiology Reports in Patients With Hepatocellular Carcinoma to Predict Radiology Resource Utilization
    Brown, A. D.
    Kachura, J. R.
    JOURNAL OF THE AMERICAN COLLEGE OF RADIOLOGY, 2019, 16 (06) : 840 - 844
  • [34] Use of Natural Language Processing to Infer Sites of Metastatic Disease From Radiology Reports at Scale
    Tay, See Boon
    Low, Guat Hwa
    Wong, Gillian Jing En
    Tey, Han Jieh
    Leong, Fun Loon
    Li, Constance
    Chua, Melvin Lee Kiang
    Tan, Daniel Shao Weng
    Thng, Choon Hua
    Tan, Iain Bee Huat
    Tan, Ryan Shea Ying Cong
    JCO CLINICAL CANCER INFORMATICS, 2024, 8
  • [35] Natural language processing and foreign language learning: Methodological approach from the performance of a linguistic task
    Canas, Alejandro Ramirez
    CUADERNO ACTIVA, 2022, (14): : 43 - 63
  • [36] Quality Management of Pulmonary Nodule Radiology Reports Based on Natural Language Processing
    Fei, Xiaolu
    Chen, Pengyu
    Wei, Lan
    Huang, Yue
    Xin, Yi
    Li, Jia
    BIOENGINEERING-BASEL, 2022, 9 (06):
  • [37] Machine learning based natural language processing of radiology reports in orthopaedic trauma
    Olthof, A. W.
    Shouche, P.
    Fennema, E. M.
    IJpma, F. F. A.
    Koolstra, R. H. C.
    Stirler, V. M. A.
    van Ooijen, P. M. A.
    Cornelissen, L. J.
    COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2021, 208
  • [38] Natural language processing to predict isocitrate dehydrogenase genotype in diffuse glioma using MR radiology reports
    Kim, Minjae
    Ong, Kai Tzu-iunn
    Choi, Seonah
    Yeo, Jinyoung
    Kim, Sooyon
    Han, Kyunghwa
    Park, Ji Eun
    Kim, Ho Sung
    Choi, Yoon Seong
    Ahn, Sung Soo
    Kim, Jinna
    Lee, Seung-Koo
    Sohn, Beomseok
    EUROPEAN RADIOLOGY, 2023, 33 (11) : 8017 - 8025
  • [39] Natural language processing to predict isocitrate dehydrogenase genotype in diffuse glioma using MR radiology reports
    Minjae Kim
    Kai Tzu-iunn Ong
    Seonah Choi
    Jinyoung Yeo
    Sooyon Kim
    Kyunghwa Han
    Ji Eun Park
    Ho Sung Kim
    Yoon Seong Choi
    Sung Soo Ahn
    Jinna Kim
    Seung-Koo Lee
    Beomseok Sohn
    European Radiology, 2023, 33 : 8017 - 8025
  • [40] Review of Natural Language Processing in Radiology
    Luo, Jack W.
    Chong, Jaron J. R.
    NEUROIMAGING CLINICS OF NORTH AMERICA, 2020, 30 (04) : 447 - +