Deep Learning for Cancer Symptoms Monitoring on the Basis of Electronic Health Record Unstructured Clinical Notes

被引:21
|
作者
Lindvall, Charlotta [1 ,2 ,3 ]
Deng, Chih-Ying [1 ]
Agaronnik, Nicole D. [1 ,2 ]
Kwok, Anne [1 ]
Samineni, Soujanya [1 ]
Umeton, Renato [1 ]
Mackie-Jenkins, Warren [1 ,3 ]
Kehl, Kenneth L. [1 ,2 ,3 ]
Tulsky, James A. [1 ,2 ,3 ]
Enzinger, Andrea C. [1 ,2 ,3 ]
机构
[1] Dana Farber Canc Inst, 450 Brookline Ave,LW-670, Boston, MA 02215 USA
[2] Harvard Med Sch, Boston, MA 02115 USA
[3] Brigham & Womens Hosp, 75 Francis St, Boston, MA 02115 USA
来源
关键词
PATIENT-REPORTED OUTCOMES; CARE; IDENTIFICATION; DOCUMENTATION; RELIABILITY; ONCOLOGY;
D O I
10.1200/CCI.21.00136
中图分类号
R73 [肿瘤学];
学科分类号
100214 ;
摘要
Deep learning models can identify a wide range of EHR-documented symptoms relevant to cancer care and electronic PROs. PURPOSE Symptoms are vital outcomes for cancer clinical trials, observational research, and population-level surveillance. Patient-reported outcomes (PROs) are valuable for monitoring symptoms, yet there are many challenges to collecting PROs at scale. We sought to develop, test, and externally validate a deep learning model to extract symptoms from unstructured clinical notes in the electronic health record. METHODS We randomly selected 1,225 outpatient progress notes from among patients treated at the Dana-Farber Cancer Institute between January 2016 and December 2019 and used 1,125 notes as our training/validation data set and 100 notes as our test data set. We evaluated the performance of 10 deep learning models for detecting 80 symptoms included in the National Cancer Institute's Patient-Reported Outcomes version of the Common Terminology Criteria for Adverse Events (PRO-CTCAE) framework. Model performance as compared with manual chart abstraction was assessed using standard metrics, and the highest performer was externally validated on a sample of 100 physician notes from a different clinical context. RESULTS In our training and test data sets, 75 of the 80 candidate symptoms were identified. The ELECTRA-small model had the highest performance for symptom identification at the token level (ie, at the individual symptom level), with an F1 of 0.87 and a processing time of 3.95 seconds per note. For the 10 most common symptoms in the test data set, the F1 score ranged from 0.98 for anxious to 0.86 for fatigue. For external validation of the same symptoms, the note-level performance ranged from F1 = 0.97 for diarrhea and dizziness to F1 = 0.73 for swelling. CONCLUSION Training a deep learning model to identify a wide range of electronic health record-documented symptoms relevant to cancer care is feasible. This approach could be used at the health system scale to complement to electronic PROs.
引用
收藏
页数:10
相关论文
共 50 条
  • [1] Identifying and Predicting Intentional Self-Harm in Electronic Health Record Clinical Notes: Deep Learning Approach
    Obeid, Jihad S.
    Dahne, Jennifer
    Christensen, Sean
    Howard, Samuel
    Crawford, Tami
    Frey, Lewis J.
    Stecker, Tracy
    Bunnell, Brian E.
    JMIR MEDICAL INFORMATICS, 2020, 8 (07)
  • [2] Generating Clinical Notes for Electronic Health Record Systems
    Rosenbloom, S. T.
    Stead, W. W.
    Denny, J. C.
    Giuse, D.
    Lorenzi, N. M.
    Brown, S. H.
    Johnson, K. B.
    APPLIED CLINICAL INFORMATICS, 2010, 1 (03): : 232 - 243
  • [3] Deep Learning Techniques for Electronic Health Record Analysis
    Caroprese, Luciano
    Veltri, Pierangelo
    Vocaturo, Eugenio
    Zumpano, Ester
    2018 9TH INTERNATIONAL CONFERENCE ON INFORMATION, INTELLIGENCE, SYSTEMS AND APPLICATIONS (IISA), 2018, : 734 - 737
  • [4] Monitoring clinical decision support in the electronic health record
    Lam, Jason H.
    Ng, Olivia
    AMERICAN JOURNAL OF HEALTH-SYSTEM PHARMACY, 2017, 74 (15) : 1130 - 1133
  • [5] Survey: Deep Learning Concepts and Techniques for Electronic Health Record
    al-Aiad, Ahmad
    Duwairi, Rehab
    Fraihat, Manar
    2018 IEEE/ACS 15TH INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2018,
  • [7] Conceptualizing Machine Learning for Dynamic Information Retrieval of Electronic Health Record Notes
    Jiang, Sharon
    Shen, Shannon
    Agrawal, Monica
    Lam, Barbara
    Kurtzman, Nicholas
    Horng, Steven
    Karger, David R.
    Sontag, David
    MACHINE LEARNING FOR HEALTHCARE CONFERENCE, VOL 219, 2023, 219
  • [8] The Learning Electronic Health Record
    Clermont, Gilles
    CRITICAL CARE CLINICS, 2023, 39 (04) : 689 - 700
  • [9] Identifying Patient-Level Social Determinants of Health in Unstructured Clinical Notes From Electronic Health Records
    Nikbakht, M.
    Kumar, V
    Rasouliyan, L.
    VALUE IN HEALTH, 2022, 25 (12) : S460 - S461
  • [10] Electronic Health Record Data in Cancer Learning Health Systems: Challenges and Opportunities
    Post, Andrew R.
    Burningham, Zachary
    Halwani, Ahmad S.
    JCO CLINICAL CANCER INFORMATICS, 2022, 6