Entity and relation extraction from clinical case reports of COVID-19: a natural language processing approach

被引:7
|
作者
Raza, Shaina [1 ,2 ]
Schwartz, Brian [1 ,2 ]
机构
[1] Publ Hlth Ontario PHO, Toronto, ON, Canada
[2] Univ Toronto, Dalla Lana Sch Publ Hlth, Toronto, ON, Canada
关键词
Natural language processing; Data cohort; COVID-19; Named entity; Relation extraction; Transfer learning; Artificial intelligence; RECOGNITION;
D O I
10.1186/s12911-023-02117-3
中图分类号
R-058 [];
学科分类号
摘要
BackgroundExtracting relevant information about infectious diseases is an essential task. However, a significant obstacle in supporting public health research is the lack of methods for effectively mining large amounts of health data.ObjectiveThis study aims to use natural language processing (NLP) to extract the key information (clinical factors, social determinants of health) from published cases in the literature.MethodsThe proposed framework integrates a data layer for preparing a data cohort from clinical case reports; an NLP layer to find the clinical and demographic-named entities and relations in the texts; and an evaluation layer for benchmarking performance and analysis. The focus of this study is to extract valuable information from COVID-19 case reports.ResultsThe named entity recognition implementation in the NLP layer achieves a performance gain of about 1-3% compared to benchmark methods. Furthermore, even without extensive data labeling, the relation extraction method outperforms benchmark methods in terms of accuracy (by 1-8% better). A thorough examination reveals the disease's presence and symptoms prevalence in patients.ConclusionsA similar approach can be generalized to other infectious diseases. It is worthwhile to use prior knowledge acquired through transfer learning when researching other infectious diseases.
引用
收藏
页数:17
相关论文
共 50 条
  • [21] Hemophagocytic lymphohistiocytosis in COVID-19 Case reports of a stepwise approach
    Schnaubelt, Sebastian
    Tihanyi, Daniel
    Strassl, Robert
    Schmidt, Ralf
    Anders, Sonja
    Laggner, Anton N.
    Agis, Hermine
    Domanovits, Hans
    MEDICINE, 2021, 100 (12) : E25170
  • [22] Sex and Gender Bias in Covid-19 Clinical Case Reports
    Salter-Volz, Aysha E. E.
    Oyasu, Abigail
    Yeh, Chen
    Muhammad, Lutfiyya N. N.
    Woitowich, Nicole C. C.
    FRONTIERS IN GLOBAL WOMENS HEALTH, 2021, 2
  • [23] Natural Language Processing Methods and Techniques for Knowledge Extraction from School Reports
    Venturi, Giulia
    Dell'Orletta, Felice
    Montemagni, Simonetta
    Morini, Elettra
    Sagri, Maria Teresa
    CADMO, 2020, (02): : 49 - +
  • [24] A NATURAL LANGUAGE PROCESSING SYSTEM TO EXTRACT COVID-19 SYMPTOMS FROM ELECTRONIC HEALTH RECORDS
    Chen, Jinying
    Fukunaga, Mayuko Ito
    Jones, Evan
    Balakrishnan, Kavitha
    Cutrona, Sarah
    JOURNAL OF GENERAL INTERNAL MEDICINE, 2021, 36 (SUPPL 1) : S2 - S2
  • [25] COVID-19: review of case reports
    Oda, Yutaka
    JOURNAL OF ANESTHESIA, 2021, 35 (03) : 337 - 340
  • [26] COVID-19: review of case reports
    Yutaka Oda
    Journal of Anesthesia, 2021, 35 : 337 - 340
  • [27] Artificial Intelligence in Action: Addressing the COVID-19 Pandemic with Natural Language Processing
    Chen, Qingyu
    Leaman, Robert
    Allot, Alexis
    Luo, Ling
    Wei, Chih-Hsuan
    Yan, Shankai
    Lu, Zhiyong
    ANNUAL REVIEW OF BIOMEDICAL DATA SCIENCE, VOL 4, 2021, 4 : 313 - 339
  • [28] Concept Relation Extraction from Construction Documents Using Natural Language Processing
    Al Qady, Mohammed
    Kandil, Amr
    JOURNAL OF CONSTRUCTION ENGINEERING AND MANAGEMENT, 2010, 136 (03) : 294 - 302
  • [29] Combat COVID-19 infodemic using explainable natural language processing models
    Ayoub, Jackie
    Yang, X. Jessie
    Zhou, Feng
    INFORMATION PROCESSING & MANAGEMENT, 2021, 58 (04)
  • [30] Unsupervised natural language processing in the identification of patients with suspected COVID-19 infection
    da Silva, Rildo Pinto
    Pollettini, Juliana Tarossi
    Pazin Filho, Antonio
    CADERNOS DE SAUDE PUBLICA, 2023, 39 (11):