Natural language processing for automated quantification of bone metastases reported in free-text bone scintigraphy reports

被引:17
|
作者
Groot, Olivier Q. [1 ,2 ]
Bongers, Michiel E. R. [1 ]
Karhade, Aditya V. [1 ]
Kapoor, Neal D. [1 ]
Fenn, Brian P. [1 ]
Kim, Jason [1 ]
Verlaan, J. J. [2 ]
Schwab, Joseph H. [1 ]
机构
[1] Harvard Med Sch, Massachusetts Gen Hosp, Orthopaed Oncol Serv, Dept Orthopaed Surg, 55 Fruit St, Boston, MA 02114 USA
[2] Univ Utrecht, Univ Med Ctr Utrecht, Dept Orthopaed Surg, Utrecht, Netherlands
基金
美国国家卫生研究院;
关键词
D O I
10.1080/0284186X.2020.1819563
中图分类号
R73 [肿瘤学];
学科分类号
100214 ;
摘要
Background The widespread use of electronic patient-generated health data has led to unprecedented opportunities for automated extraction of clinical features from free-text medical notes. However, processing this rich resource of data for clinical and research purposes, depends on labor-intensive and potentially error-prone manual review. The aim of this study was to develop a natural language processing (NLP) algorithm for binary classification (single metastasis versus two or more metastases) in bone scintigraphy reports of patients undergoing surgery for bone metastases. Material and methods Bone scintigraphy reports of patients undergoing surgery for bone metastases were labeled each by three independent reviewers using a binary classification (single metastasis versus two or more metastases) to establish a ground truth. A stratified 80:20 split was used to develop and test an extreme-gradient boosting supervised machine learning NLP algorithm. Results A total of 704 free-text bone scintigraphy reports from 704 patients were included in this study and 617 (88%) had multiple bone metastases. In the independent test set (n = 141) not used for model development, the NLP algorithm achieved an 0.97 AUC-ROC (95% confidence interval [CI], 0.92-0.99) for classification of multiple bone metastases and an 0.99 AUC-PRC (95% CI, 0.99-0.99). At a threshold of 0.90, NLP algorithm correctly identified multiple bone metastases in 117 of the 124 who had multiple bone metastases in the testing cohort (sensitivity 0.94) and yielded 3 false positives (specificity 0.82). At the same threshold, the NLP algorithm had a positive predictive value of 0.97 and F1-score of 0.96. Conclusions NLP has the potential to automate clinical data extraction from free text radiology notes in orthopedics, thereby optimizing the speed, accuracy, and consistency of clinical chart review. Pending external validation, the NLP algorithm developed in this study may be implemented as a means to aid researchers in tackling large amounts of data.
引用
收藏
页码:1455 / 1460
页数:6
相关论文
共 50 条
  • [21] COGNITIVE ASPECTS IN NATURAL-LANGUAGE AND FREE-TEXT SEARCHING
    WORMELL, I
    SOCIAL SCIENCE INFORMATION STUDIES, 1984, 4 (2-3): : 131 - 141
  • [22] FREE-TEXT DOCUMENTATION OF SLEEP DISTURBANCE IN ACUTE MYELOID LEUKEMIA: A NATURAL LANGUAGE PROCESSING STUDY
    Jeon, Bomin
    Chae, Sena
    SLEEP, 2024, 47 : A389 - A389
  • [23] Developing a Word Lexicon from Electronic Health Records for Natural Language Processing Analysis of Free-Text Reports for Patients with Venous Thromboembolism
    Jagasia, S.
    Krauze, A. V.
    INTERNATIONAL JOURNAL OF RADIATION ONCOLOGY BIOLOGY PHYSICS, 2023, 117 (02): : E469 - E469
  • [24] Deep-Learning-Based Natural Language Processing of Serial Free-Text Radiological Reports for Predicting Rectal Cancer Patient Survival
    Kim, Sunkyu
    Lee, Choong-kun
    Choi, Yonghwa
    Baek, Eun Sil
    Choi, Jeong Eun
    Lim, Joon Seok
    Kang, Jaewoo
    Shin, Sang Joon
    FRONTIERS IN ONCOLOGY, 2021, 11
  • [25] Automated Classification of Free-text Pathology Reports for Registration of Incident Cases of Cancer
    Jouhet, V.
    Defossez, G.
    Burgun, A.
    le Beux, P.
    Levillain, P.
    Ingrand, P.
    Claveau, V.
    METHODS OF INFORMATION IN MEDICINE, 2012, 51 (03) : 242 - 251
  • [26] Natural language processing of symptoms documented in free-text narratives of electronic health records: a systematic review
    Koleck, Theresa A.
    Dreisbach, Caitlin
    Bourne, Philip E.
    Bakken, Suzanne
    JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2019, 26 (04) : 364 - 379
  • [27] Analyzing free-text clinical narratives for veterans with lymphoid malignancies using natural language processing (NLP)
    He, Lu
    Moldenhauer, Matthew
    Zheng, Kai
    Ma, Helen
    JOURNAL OF CLINICAL ONCOLOGY, 2023, 41 (16)
  • [28] Prognosis of P16 and HPV Discordant Oropharyngeal Cancers: Natural Language Processing to Extract Data from Free-Text Pathology Reports
    Shin, E.
    Cartano, O.
    Lee, N. Y.
    Kang, J. J.
    INTERNATIONAL JOURNAL OF RADIATION ONCOLOGY BIOLOGY PHYSICS, 2022, 114 (03): : E322 - E322
  • [29] Natural Language Processing for Automatic Identification of Major Depressive Disorders in Free-Text Electronic Health Records
    Nunez, Nicolas
    Biernacka, Joanna M.
    Gardea-Resendez, Manuel
    Kshatriya, Bhavani Singh Agnikula
    Ryu, Euijung
    Fu, Sunyang
    Singh, Balwinder
    Coombes, Brandon
    Frye, Mark
    Wang, Yanshan
    BIOLOGICAL PSYCHIATRY, 2021, 89 (09) : S155 - S155
  • [30] Natural Language Processing of Symptoms Preceding Diagnosis and Palliative Radiotherapy for Bone Metastases
    Chen, J. J.
    Friesner, I.
    Chang, C.
    Ni, L.
    Braunstein, S. E.
    Boreta, L.
    Hong, J. C.
    INTERNATIONAL JOURNAL OF RADIATION ONCOLOGY BIOLOGY PHYSICS, 2022, 114 (03): : S18 - S18