Entity and relation extraction from clinical case reports of COVID-19: a natural language processing approach

被引:7
|
作者
Raza, Shaina [1 ,2 ]
Schwartz, Brian [1 ,2 ]
机构
[1] Publ Hlth Ontario PHO, Toronto, ON, Canada
[2] Univ Toronto, Dalla Lana Sch Publ Hlth, Toronto, ON, Canada
关键词
Natural language processing; Data cohort; COVID-19; Named entity; Relation extraction; Transfer learning; Artificial intelligence; RECOGNITION;
D O I
10.1186/s12911-023-02117-3
中图分类号
R-058 [];
学科分类号
摘要
BackgroundExtracting relevant information about infectious diseases is an essential task. However, a significant obstacle in supporting public health research is the lack of methods for effectively mining large amounts of health data.ObjectiveThis study aims to use natural language processing (NLP) to extract the key information (clinical factors, social determinants of health) from published cases in the literature.MethodsThe proposed framework integrates a data layer for preparing a data cohort from clinical case reports; an NLP layer to find the clinical and demographic-named entities and relations in the texts; and an evaluation layer for benchmarking performance and analysis. The focus of this study is to extract valuable information from COVID-19 case reports.ResultsThe named entity recognition implementation in the NLP layer achieves a performance gain of about 1-3% compared to benchmark methods. Furthermore, even without extensive data labeling, the relation extraction method outperforms benchmark methods in terms of accuracy (by 1-8% better). A thorough examination reveals the disease's presence and symptoms prevalence in patients.ConclusionsA similar approach can be generalized to other infectious diseases. It is worthwhile to use prior knowledge acquired through transfer learning when researching other infectious diseases.
引用
收藏
页数:17
相关论文
共 50 条
  • [41] NATURAL LANGUAGE PROCESSING OF ESOPHAGOGASTRODUODENOSCOPY REPORTS FOR INFORMATION EXTRACTION OF GASTRIC DISEASES
    Bae, Jung Ho
    Han, Hyun Wook
    Song, Gyuseon
    GASTROINTESTINAL ENDOSCOPY, 2022, 95 (06) : AB247 - AB248
  • [42] Predicting Recovery Status of COVID-19 Vaccinated Patients with Natural Language Processing Approaches
    Jiang, Xiangxiang
    Lv, Gang
    Li, Sam
    Lu, Kevin
    PHARMACOEPIDEMIOLOGY AND DRUG SAFETY, 2024, 33 : 502 - 502
  • [43] Leveraging Natural Language Processing to Mine Issues on Twitter During the COVID-19 Pandemic
    Agarwal, Ankita
    Salehundam, Preetham
    Padhee, Swati
    Romine, William L.
    Banerjee, Tanvi
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 886 - 891
  • [44] The impact of the learning shift during COVID-19 on students using natural language processing
    Shaiba, Hadil
    John, Maya
    INTERNATIONAL JOURNAL OF TECHNOLOGY ENHANCED LEARNING, 2023, 15 (02) : 195 - 214
  • [45] COVID-Twitter-BERT: A natural language processing model to analyse COVID-19 content on Twitter
    Müller, Martin
    Salathe, Marcel
    Kummervold, Per E.
    FRONTIERS IN ARTIFICIAL INTELLIGENCE, 2023, 6
  • [46] Case Reports: Rhabdomyolysis Associated with COVID-19
    Singh, Balraj
    Kaur, Parminder
    Reid, Ro-Jay Romor
    AMERICAN FAMILY PHYSICIAN, 2020, 102 (11) : 645 - 648
  • [49] Using Local Grammar for Entity Extraction from Clinical Reports
    Ghoulam, Aicha
    Barigou, Fatiha
    Belalem, Ghalem
    Meziane, Farid
    INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE, 2015, 3 (03): : 16 - 24
  • [50] Identifying COVID-19 cases and extracting patient reported symptoms from Reddit using natural language processing
    Guo, Muzhe
    Ma, Yong
    Eworuke, Efe
    Khashei, Melissa
    Song, Jaejoon
    Zhao, Yueqin
    Jin, Fang
    SCIENTIFIC REPORTS, 2023, 13 (01)