Extracting cancer concepts from clinical notes using natural language processing: a systematic review

被引:13
|
作者
Gholipour, Maryam [1 ]
Khajouei, Reza [2 ]
Amiri, Parastoo [1 ]
Gohari, Sadrieh Hajesmaeel [3 ]
Ahmadian, Leila [2 ]
机构
[1] Kerman Univ Med Sci, Student Res Comm, Kerman, Iran
[2] Kerman Univ Med Sci, Fac Management & Med Informat Sci, Dept Hlth Informat Sci, Kerman, Iran
[3] Kerman Univ Med Sci, Inst Futures Studies Hlth, Med Informat Res Ctr, Kerman, Iran
关键词
Neoplasms; Natural language processing; NLP; Machine learning; Terminology; Information system; Systematic review; RADIOLOGY REPORTS; CLASSIFICATION; RETRIEVAL; RECORDS;
D O I
10.1186/s12859-023-05480-0
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
BackgroundExtracting information from free texts using natural language processing (NLP) can save time and reduce the hassle of manually extracting large quantities of data from incredibly complex clinical notes of cancer patients. This study aimed to systematically review studies that used NLP methods to identify cancer concepts from clinical notes automatically.MethodsPubMed, Scopus, Web of Science, and Embase were searched for English language papers using a combination of the terms concerning "Cancer", "NLP", "Coding", and "Registries" until June 29, 2021. Two reviewers independently assessed the eligibility of papers for inclusion in the review.ResultsMost of the software programs used for concept extraction reported were developed by the researchers (n = 7). Rule-based algorithms were the most frequently used algorithms for developing these programs. In most articles, the criteria of accuracy (n = 14) and sensitivity (n = 12) were used to evaluate the algorithms. In addition, Systematized Nomenclature of Medicine-Clinical Terms (SNOMED-CT) and Unified Medical Language System (UMLS) were the most commonly used terminologies to identify concepts. Most studies focused on breast cancer (n = 4, 19%) and lung cancer (n = 4, 19%).ConclusionThe use of NLP for extracting the concepts and symptoms of cancer has increased in recent years. The rule-based algorithms are well-liked algorithms by developers. Due to these algorithms' high accuracy and sensitivity in identifying and extracting cancer concepts, we suggested that future studies use these algorithms to extract the concepts of other diseases as well.
引用
收藏
页数:16
相关论文
共 50 条
  • [41] Tobacco use status from clinical notes using Natural Language Processing and rule based algorithm
    Hegde, Harshad
    Shimpi, Neel
    Glurich, Ingrid
    Acharya, Amit
    TECHNOLOGY AND HEALTH CARE, 2018, 26 (03) : 445 - 456
  • [42] Mining peripheral arterial disease cases from narrative clinical notes using natural language processing
    Afzal, Naveed
    Sohn, Sunghwan
    Abram, Sara
    Scott, Christopher G.
    Chaudhry, Rajeev
    Liu, Hongfang
    Kullo, Iftikhar J.
    Arruda-Olson, Adelaide M.
    JOURNAL OF VASCULAR SURGERY, 2017, 65 (06) : 1753 - 1761
  • [43] Using Natural Language Processing and Machine Learning to Identify Gout Flares From Electronic Clinical Notes
    Zheng, Chengyi
    Rashid, Nazia
    Wu, Yi-Lin
    Koblick, River
    Lin, Antony T.
    Levy, Gerald D.
    Cheetham, T. Craig
    ARTHRITIS CARE & RESEARCH, 2014, 66 (11) : 1740 - 1748
  • [44] Characterisation of digital therapeutic clinical trials: a systematic review with natural language processing
    Miao, Brenda Y.
    Sushil, Madhumita
    Xu, Ava
    Wang, Michelle
    Arneson, Douglas
    Berkley, Ellen
    Subash, Meera
    Vashisht, Rohit
    Rudrapatna, Vivek
    Butte, Atul J.
    LANCET DIGITAL HEALTH, 2024, 6 (03): : e222 - e229
  • [45] Clinical Decision Support and Natural Language Processing inMedicine:Systematic Literature Review
    Eguia, Hans
    Sanchez-Bocanegra, Carlos Luis
    Vinciarelli, Franco
    Alvarez-Lopez, Fernando
    Saigi-Rubio, Francesc
    JOURNAL OF MEDICAL INTERNET RESEARCH, 2024, 26
  • [46] Characterisation of digital therapeutic clinical trials: a systematic review with natural language processing
    Miao B.Y.
    Sushil M.
    Xu A.
    Wang M.
    Arneson D.
    Berkley E.
    Subash M.
    Vashisht R.
    Rudrapatna V.
    Butte A.J.
    The Lancet Digital Health, 2024, 6 (03): : e222 - e229
  • [47] A systematic review on natural language processing systems for eligibility prescreening in clinical research
    Idnay, Betina
    Dreisbach, Caitlin
    Weng, Chunhua
    Schnall, Rebecca
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2021, 29 (01) : 197 - 206
  • [48] Natural Language Processing to Ascertain Cancer Outcomes From Medical Oncologist Notes
    Kehl, Kenneth L.
    Xu, Wenxin
    Lepisto, Eva
    Elmarakeby, Haitham
    Hassett, Michael J.
    Van Allen, Eliezer M.
    Johnson, Bruce E.
    Schrag, Deborah
    JCO CLINICAL CANCER INFORMATICS, 2020, 4 : 680 - 690
  • [49] Development and Validation of a Natural Language Processing Algorithm for Extracting Clinical and Pathological Features of Breast Cancer From Pathology Reports
    Munzone, Elisabetta
    Marra, Antonio
    Comotto, Federico
    Guercio, Lorenzo
    Sangalli, Claudia Anna
    Lo Cascio, Martina
    Pagan, Eleonora
    Sangalli, Davide
    Bigoni, Ilaria
    Porta, Francesca Maria
    D'Ercole, Marianna
    Ritorti, Fabiana
    Bagnardi, Vincenzo
    Fusco, Nicola
    Curigliano, Giuseppe
    JCO CLINICAL CANCER INFORMATICS, 2024, 8
  • [50] Natural language processing algorithms for mapping clinical text fragments onto ontology concepts: a systematic review and recommendations for future studies
    Kersloot, Martijn G.
    van Putten, Florentien J. P.
    Abu-Hanna, Ameen
    Cornet, Ronald
    Arts, Derk L.
    JOURNAL OF BIOMEDICAL SEMANTICS, 2020, 11 (01)