Extracting cancer concepts from clinical notes using natural language processing: a systematic review

被引:13
|
作者
Gholipour, Maryam [1 ]
Khajouei, Reza [2 ]
Amiri, Parastoo [1 ]
Gohari, Sadrieh Hajesmaeel [3 ]
Ahmadian, Leila [2 ]
机构
[1] Kerman Univ Med Sci, Student Res Comm, Kerman, Iran
[2] Kerman Univ Med Sci, Fac Management & Med Informat Sci, Dept Hlth Informat Sci, Kerman, Iran
[3] Kerman Univ Med Sci, Inst Futures Studies Hlth, Med Informat Res Ctr, Kerman, Iran
关键词
Neoplasms; Natural language processing; NLP; Machine learning; Terminology; Information system; Systematic review; RADIOLOGY REPORTS; CLASSIFICATION; RETRIEVAL; RECORDS;
D O I
10.1186/s12859-023-05480-0
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
BackgroundExtracting information from free texts using natural language processing (NLP) can save time and reduce the hassle of manually extracting large quantities of data from incredibly complex clinical notes of cancer patients. This study aimed to systematically review studies that used NLP methods to identify cancer concepts from clinical notes automatically.MethodsPubMed, Scopus, Web of Science, and Embase were searched for English language papers using a combination of the terms concerning "Cancer", "NLP", "Coding", and "Registries" until June 29, 2021. Two reviewers independently assessed the eligibility of papers for inclusion in the review.ResultsMost of the software programs used for concept extraction reported were developed by the researchers (n = 7). Rule-based algorithms were the most frequently used algorithms for developing these programs. In most articles, the criteria of accuracy (n = 14) and sensitivity (n = 12) were used to evaluate the algorithms. In addition, Systematized Nomenclature of Medicine-Clinical Terms (SNOMED-CT) and Unified Medical Language System (UMLS) were the most commonly used terminologies to identify concepts. Most studies focused on breast cancer (n = 4, 19%) and lung cancer (n = 4, 19%).ConclusionThe use of NLP for extracting the concepts and symptoms of cancer has increased in recent years. The rule-based algorithms are well-liked algorithms by developers. Due to these algorithms' high accuracy and sensitivity in identifying and extracting cancer concepts, we suggested that future studies use these algorithms to extract the concepts of other diseases as well.
引用
收藏
页数:16
相关论文
共 50 条
  • [1] Extracting cancer concepts from clinical notes using natural language processing: a systematic review
    Maryam Gholipour
    Reza Khajouei
    Parastoo Amiri
    Sadrieh Hajesmaeel Gohari
    Leila Ahmadian
    BMC Bioinformatics, 24
  • [2] Natural language processing for clinical notes in dentistry: A systematic review
    Pethani, Farhana
    Dunn, Adam G.
    JOURNAL OF BIOMEDICAL INFORMATICS, 2023, 138
  • [3] Natural Language Processing of Clinical Notes on Chronic Diseases: Systematic Review
    Sheikhalishahi, Seyedmostafa
    Miotto, Riccardo
    Dudley, Joel T.
    Lavelli, Alberto
    Rinaldi, Fabio
    Osmani, Venet
    JMIR MEDICAL INFORMATICS, 2019, 7 (02) : 15 - 32
  • [4] Extracting concepts from the software requirements specification using natural language processing
    Kuchta, Jaroslaw
    Padhiyar, Priti
    2018 11TH INTERNATIONAL CONFERENCE ON HUMAN SYSTEM INTERACTION (HSI), 2018, : 443 - 448
  • [5] Identification of pancreatic cancer risk factors from clinical notes using natural language processing
    Sarwal, Dhruv
    Wang, Liwei
    Gandhi, Sonal
    Pour, Elham Sagheb Hossein
    Janssens, Laurens P.
    Delgado, Adriana M.
    Doering, Karen A.
    Mishra, Anup Kumar
    Greenwood, Jason D.
    Liu, Hongfang
    Majumder, Shounak
    PANCREATOLOGY, 2024, 24 (04) : 572 - 578
  • [6] FEASIBILITY OF USING NATURAL LANGUAGE PROCESSING TO EXTRACT CANCER PAIN SCORE FROM CLINICAL NOTES
    Naseri, Hossien
    RADIOTHERAPY AND ONCOLOGY, 2019, 139 : S65 - S65
  • [7] Using natural language processing to automatically extract cancer outcomes data from clinical notes
    Liptrot, Tom
    Karystianis, George
    Nenadic, Goran
    Keane, John
    Livsey, Jacqueline
    Barker-Hewitt, Matthew
    O'Hara, Catherine
    EUROPEAN JOURNAL OF CANCER CARE, 2015, 24 : 11 - 11
  • [8] Extracting social determinants of health from electronic health records using natural language processing: a systematic review
    Patra, Braja G.
    Sharma, Mohit M.
    Vekaria, Veer
    Adekkanattu, Prakash
    Patterson, Olga, V
    Glicksberg, Benjamin
    Lepow, Lauren A.
    Ryu, Euijung
    Biernacka, Joanna M.
    Furmanchuk, Al'ona
    George, Thomas J.
    Hogan, William
    Wu, Yonghui
    Yang, Xi
    Bian, Jiang
    Weissman, Myrna
    Wickramaratne, Priya
    Mann, J. John
    Olfson, Mark
    Campion, Thomas R., Jr.
    Weiner, Mark
    Pathak, Jyotishman
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2021, 28 (12) : 2716 - 2727
  • [9] Extracting Sexual Trauma Mentions from Electronic Medical Notes Using Natural Language Processing
    Divita, Guy
    Brignone, Emily
    Carter, Marjorie E.
    Suo, Ying
    Blais, Rebecca K.
    Samore, Matthew H.
    Fargo, Jamison D.
    Gundlapalli, Adi V.
    MEDINFO 2017: PRECISION HEALTHCARE THROUGH INFORMATICS, 2017, 245 : 351 - 355
  • [10] DeepPhe: A Natural Language Processing System for Extracting Cancer Phenotypes from Clinical Records
    Savova, Guergana K.
    Tseytlin, Eugene
    Finan, Sean
    Castine, Melissa
    Miller, Timothy
    Medvedeva, Olga
    Harris, David
    Hochheiser, Harry
    Lin, Chen
    Chavan, Girish
    Jacobson, Rebecca S.
    CANCER RESEARCH, 2017, 77 (21) : E115 - E118