Extracting cancer concepts from clinical notes using natural language processing: a systematic review

被引:13
|
作者
Gholipour, Maryam [1 ]
Khajouei, Reza [2 ]
Amiri, Parastoo [1 ]
Gohari, Sadrieh Hajesmaeel [3 ]
Ahmadian, Leila [2 ]
机构
[1] Kerman Univ Med Sci, Student Res Comm, Kerman, Iran
[2] Kerman Univ Med Sci, Fac Management & Med Informat Sci, Dept Hlth Informat Sci, Kerman, Iran
[3] Kerman Univ Med Sci, Inst Futures Studies Hlth, Med Informat Res Ctr, Kerman, Iran
关键词
Neoplasms; Natural language processing; NLP; Machine learning; Terminology; Information system; Systematic review; RADIOLOGY REPORTS; CLASSIFICATION; RETRIEVAL; RECORDS;
D O I
10.1186/s12859-023-05480-0
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
BackgroundExtracting information from free texts using natural language processing (NLP) can save time and reduce the hassle of manually extracting large quantities of data from incredibly complex clinical notes of cancer patients. This study aimed to systematically review studies that used NLP methods to identify cancer concepts from clinical notes automatically.MethodsPubMed, Scopus, Web of Science, and Embase were searched for English language papers using a combination of the terms concerning "Cancer", "NLP", "Coding", and "Registries" until June 29, 2021. Two reviewers independently assessed the eligibility of papers for inclusion in the review.ResultsMost of the software programs used for concept extraction reported were developed by the researchers (n = 7). Rule-based algorithms were the most frequently used algorithms for developing these programs. In most articles, the criteria of accuracy (n = 14) and sensitivity (n = 12) were used to evaluate the algorithms. In addition, Systematized Nomenclature of Medicine-Clinical Terms (SNOMED-CT) and Unified Medical Language System (UMLS) were the most commonly used terminologies to identify concepts. Most studies focused on breast cancer (n = 4, 19%) and lung cancer (n = 4, 19%).ConclusionThe use of NLP for extracting the concepts and symptoms of cancer has increased in recent years. The rule-based algorithms are well-liked algorithms by developers. Due to these algorithms' high accuracy and sensitivity in identifying and extracting cancer concepts, we suggested that future studies use these algorithms to extract the concepts of other diseases as well.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] Applying Natural Language Processing to Textual Data From Clinical Data Warehouses: Systematic Review
    Bazoge, Adrien
    Morin, Emmanuel
    Daille, Beatrice
    Gourraud, Pierre -Antoine
    JMIR MEDICAL INFORMATICS, 2023, 11
  • [32] From admission to discharge: a systematic review of clinical natural language processing along the patient journey
    Klug, Katrin
    Beckh, Katharina
    Antweiler, Dario
    Chakraborty, Nilesh
    Baldini, Giulia
    Laue, Katharina
    Hosch, Rene
    Nensa, Felix
    Schuler, Martin
    Giesselbach, Sven
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2024, 24 (01)
  • [33] Natural language processing pipeline to extract prostate cancer-related information from clinical notes
    Nakai, Hirotsugu
    Suman, Garima
    Adamo, Daniel A.
    Navin, Patrick J.
    Bookwalter, Candice A.
    LeGout, Jordan D.
    Chen, Frank K.
    Wellnitz, Clinton V.
    Silva, Alvin C.
    Thomas, John V.
    Kawashima, Akira
    Fan, Jungwei W.
    Froemming, Adam T.
    Lomas, Derek J.
    Humphreys, Mitchell R.
    Dora, Chandler
    Korfiatis, Panagiotis
    Takahashi, Naoki
    EUROPEAN RADIOLOGY, 2024, 34 (12) : 7878 - 7891
  • [34] Identifying stigmatizing and positive/preferred language in obstetric clinical notes using natural language processing
    Scroggins, Jihye Kim
    Hulchafo, Ismael I.
    Harkins, Sarah
    Scharp, Danielle
    Moen, Hans
    Davoudi, Anahita
    Cato, Kenrick
    Tadiello, Michele
    Topaz, Maxim
    Barcelona, Veronica
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2024,
  • [35] A Systematic Review of Natural Language Processing in Healthcare
    Panchbhai, Bhanudas Suresh
    Pathak, Varsha Makarand
    JOURNAL OF ALGEBRAIC STATISTICS, 2022, 13 (01) : 682 - 707
  • [36] Natural Language Processing in Radiology: A Systematic Review
    Pons, Ewoud
    Braun, Loes M. M.
    Hunink, M. G. Myriam
    Kors, Jan A.
    RADIOLOGY, 2016, 279 (02) : 329 - 343
  • [37] Extracting seizure control metrics from clinic notes of patients with epilepsy: A natural language processing approach
    Fernandes, Marta
    Cardall, Aidan
    Moura, Lidia M. V. R.
    Mcgraw, Christopher
    Zafar, Sahar F.
    Westover, M. Brandon
    EPILEPSY RESEARCH, 2024, 207
  • [38] Extracting seizure frequency from epilepsy clinic notes: a machine reading approach to natural language processing
    Xie, Kevin
    Gallagher, Ryan S.
    Conrad, Erin C.
    Garrick, Chadric O.
    Baldassano, Steven N.
    Bernabei, John M.
    Galer, Peter D.
    Ghosn, Nina J.
    Greenblatt, Adam S.
    Jennings, Tara
    Kornspun, Alana
    Kulick-Soper, Catherine, V
    Panchal, Jal M.
    Pattnaik, Akash R.
    Scheid, Brittany H.
    Wei, Danmeng
    Weitzman, Micah
    Muthukrishnan, Ramya
    Kim, Joongwon
    Litt, Brian
    Ellis, Colin A.
    Roth, Dan
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2022, 29 (05) : 873 - 881
  • [39] Extracting adverse drug events from clinical Notes: A systematic review of approaches used
    Modi, Salisu
    Kasmiran, Khairul Azhar
    Sharef, Nurfadhlina Mohd
    Sharum, Mohd Yunus
    JOURNAL OF BIOMEDICAL INFORMATICS, 2024, 151
  • [40] Using Natural Language Processing and Machine Learning To Identify Gout Flares From Electronic Clinical Notes
    Zheng, Chengyi
    Rashid, Nazia
    Cheetham, T. Craig
    Wu, Yi-Lin
    Levy, Gerald D.
    ARTHRITIS AND RHEUMATISM, 2013, 65 : S856 - S857