Entity and relation extraction from clinical case reports of COVID-19: a natural language processing approach

被引:7
|
作者
Raza, Shaina [1 ,2 ]
Schwartz, Brian [1 ,2 ]
机构
[1] Publ Hlth Ontario PHO, Toronto, ON, Canada
[2] Univ Toronto, Dalla Lana Sch Publ Hlth, Toronto, ON, Canada
关键词
Natural language processing; Data cohort; COVID-19; Named entity; Relation extraction; Transfer learning; Artificial intelligence; RECOGNITION;
D O I
10.1186/s12911-023-02117-3
中图分类号
R-058 [];
学科分类号
摘要
BackgroundExtracting relevant information about infectious diseases is an essential task. However, a significant obstacle in supporting public health research is the lack of methods for effectively mining large amounts of health data.ObjectiveThis study aims to use natural language processing (NLP) to extract the key information (clinical factors, social determinants of health) from published cases in the literature.MethodsThe proposed framework integrates a data layer for preparing a data cohort from clinical case reports; an NLP layer to find the clinical and demographic-named entities and relations in the texts; and an evaluation layer for benchmarking performance and analysis. The focus of this study is to extract valuable information from COVID-19 case reports.ResultsThe named entity recognition implementation in the NLP layer achieves a performance gain of about 1-3% compared to benchmark methods. Furthermore, even without extensive data labeling, the relation extraction method outperforms benchmark methods in terms of accuracy (by 1-8% better). A thorough examination reveals the disease's presence and symptoms prevalence in patients.ConclusionsA similar approach can be generalized to other infectious diseases. It is worthwhile to use prior knowledge acquired through transfer learning when researching other infectious diseases.
引用
收藏
页数:17
相关论文
共 50 条
  • [1] Entity and relation extraction from clinical case reports of COVID-19: a natural language processing approach
    Shaina Raza
    Brian Schwartz
    BMC Medical Informatics and Decision Making, 23
  • [2] Clinical Application of Detecting COVID-19 Risks: A Natural Language Processing Approach
    Bashir, Syed Raza
    Raza, Shaina
    Kocaman, Veysel
    Qamar, Urooj
    VIRUSES-BASEL, 2022, 14 (12):
  • [3] Novel approach by natural language processing for COVID-19 knowledge discovery
    Wang, Li
    Jiang, Lei
    Pan, Dongyan
    Wang, Qinghua
    Yin, Zeyu
    Kang, Zijian
    Tian, Haoran
    Geng, Xuqiang
    Shao, Jinsong
    Pan, Wenjie
    Yin, Jian
    Fang, Li
    Wang, Yue
    Zhang, Weide
    Li, Zhixiu
    Zheng, Jun
    Hu, Wenxin
    Pan, Yunbao
    Yu, Dong
    Guo, Shicheng
    Lu, Wei
    Li, Qiang
    Zhou, Yunyun
    Xu, Huji
    BIOMEDICAL JOURNAL, 2022, 45 (03) : 472 - 481
  • [4] Natural language processing to convert unstructured COVID-19 chest-CT reports into structured reports
    Fanni, Salvatore Claudio
    Romei, Chiara
    Ferrando, Giovanni
    Volpi, Federica
    D'Amore, Caterina Aida
    Bedini, Claudio
    Ubbiali, Sandro
    Valentino, Salvatore
    Neri, Emanuele
    EUROPEAN JOURNAL OF RADIOLOGY OPEN, 2023, 11
  • [5] Obtaining Knowledge in Pathology Reports Through a Natural Language Processing Approach With Classification, Named-Entity Recognition, and Relation-Extraction Heuristics
    Oliwa, Tomasz
    Maron, Steven B.
    Chase, Leah M.
    Lomnicki, Samantha
    Catenacci, Daniel V. T.
    Furner, Brian
    Volchenboum, Samuel L.
    JCO CLINICAL CANCER INFORMATICS, 2019, 3 : 1 - 8
  • [6] Perceived Impact of COVID-19 in an Underserved Community: A Natural Language Processing Approach
    Holmes, Ashleigh
    Sachar, Amanjot Singh
    Chang, Yu-Ping
    JOURNAL OF ADVANCED NURSING, 2024,
  • [7] Research on COVID-19 Text Entity Relation Extraction and Dataset Construction Methods
    Yang, Chongluo
    Sheng, Long
    Wei, Zhongcheng
    Wang, Wei
    Computer Engineering and Applications, 2023, 59 (08) : 97 - 104
  • [8] Clinical named entity recognition and relation extraction using natural language processing of medical free text: A systematic review
    Navarro, David Fraile
    Ijaz, Kiran
    Rezazadegan, Dana
    Rahimi-Ardabili, Hania
    Dras, Mark
    Coiera, Enrico
    Berkovsky, Shlomo
    INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2023, 177
  • [9] Analysis of Stroke Detection during the COVID-19 Pandemic Using Natural Language Processing of Radiology Reports
    Li, M. D.
    Lang, M.
    Deng, F.
    Chang, K.
    Buch, K.
    Rincon, S.
    Mehan, W. A.
    Leslie-Mazwi, T. M.
    Kalpathy-Cramer, J.
    AMERICAN JOURNAL OF NEURORADIOLOGY, 2021, 42 (03) : 429 - 434
  • [10] Analysis of COVID-19 clinical trials: A data-driven, ontology-based, and natural language processing approach
    Alag, Shray
    PLOS ONE, 2020, 15 (09):