Deep Learning for Cancer Symptoms Monitoring on the Basis of Electronic Health Record Unstructured Clinical Notes

被引：21

作者：

Lindvall, Charlotta ^{[1
,2
,3
]}

Deng, Chih-Ying ^{[1
]}

Agaronnik, Nicole D. ^{[1
,2
]}

Kwok, Anne ^{[1
]}

Samineni, Soujanya ^{[1
]}

Umeton, Renato ^{[1
]}

Mackie-Jenkins, Warren ^{[1
,3
]}

Kehl, Kenneth L. ^{[1
,2
,3
]}

Tulsky, James A. ^{[1
,2
,3
]}

Enzinger, Andrea C. ^{[1
,2
,3
]}

机构：

[1] Dana Farber Canc Inst, 450 Brookline Ave,LW-670, Boston, MA 02215 USA

[2] Harvard Med Sch, Boston, MA 02115 USA

[3] Brigham & Womens Hosp, 75 Francis St, Boston, MA 02115 USA

来源：

JCO CLINICAL CANCER INFORMATICS | 2022年 / 6卷

关键词：

PATIENT-REPORTED OUTCOMES; CARE; IDENTIFICATION; DOCUMENTATION; RELIABILITY; ONCOLOGY;

D O I：

10.1200/CCI.21.00136

中图分类号：

R73 [肿瘤学];

学科分类号：

100214 ;

摘要：

Deep learning models can identify a wide range of EHR-documented symptoms relevant to cancer care and electronic PROs. PURPOSE Symptoms are vital outcomes for cancer clinical trials, observational research, and population-level surveillance. Patient-reported outcomes (PROs) are valuable for monitoring symptoms, yet there are many challenges to collecting PROs at scale. We sought to develop, test, and externally validate a deep learning model to extract symptoms from unstructured clinical notes in the electronic health record. METHODS We randomly selected 1,225 outpatient progress notes from among patients treated at the Dana-Farber Cancer Institute between January 2016 and December 2019 and used 1,125 notes as our training/validation data set and 100 notes as our test data set. We evaluated the performance of 10 deep learning models for detecting 80 symptoms included in the National Cancer Institute's Patient-Reported Outcomes version of the Common Terminology Criteria for Adverse Events (PRO-CTCAE) framework. Model performance as compared with manual chart abstraction was assessed using standard metrics, and the highest performer was externally validated on a sample of 100 physician notes from a different clinical context. RESULTS In our training and test data sets, 75 of the 80 candidate symptoms were identified. The ELECTRA-small model had the highest performance for symptom identification at the token level (ie, at the individual symptom level), with an F1 of 0.87 and a processing time of 3.95 seconds per note. For the 10 most common symptoms in the test data set, the F1 score ranged from 0.98 for anxious to 0.86 for fatigue. For external validation of the same symptoms, the note-level performance ranged from F1 = 0.97 for diarrhea and dizziness to F1 = 0.73 for swelling. CONCLUSION Training a deep learning model to identify a wide range of electronic health record-documented symptoms relevant to cancer care is feasible. This approach could be used at the health system scale to complement to electronic PROs.

引用

页数：10

共 50 条

[1] Identifying and Predicting Intentional Self-Harm in Electronic Health Record Clinical Notes: Deep Learning Approach
Obeid, Jihad S.
Dahne, Jennifer
Christensen, Sean
Howard, Samuel
Crawford, Tami
Frey, Lewis J.
Stecker, Tracy
Bunnell, Brian E.
JMIR MEDICAL INFORMATICS, 2020, 8 (07)
[2] Generating Clinical Notes for Electronic Health Record Systems
Rosenbloom, S. T.
Stead, W. W.
Denny, J. C.
Giuse, D.
Lorenzi, N. M.
Brown, S. H.
Johnson, K. B.
APPLIED CLINICAL INFORMATICS, 2010, 1 (03): : 232 - 243
[3] Deep Learning Techniques for Electronic Health Record Analysis
Caroprese, Luciano
Veltri, Pierangelo
Vocaturo, Eugenio
Zumpano, Ester
2018 9TH INTERNATIONAL CONFERENCE ON INFORMATION, INTELLIGENCE, SYSTEMS AND APPLICATIONS (IISA), 2018, : 734 - 737
[4] Monitoring clinical decision support in the electronic health record
Lam, Jason H.
Ng, Olivia
AMERICAN JOURNAL OF HEALTH-SYSTEM PHARMACY, 2017, 74 (15) : 1130 - 1133
[5] Survey: Deep Learning Concepts and Techniques for Electronic Health Record
al-Aiad, Ahmad
Duwairi, Rehab
Fraihat, Manar
2018 IEEE/ACS 15TH INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2018,
[6] Deep learning based approach to unstructured record linkage
Jurek-Loughrey, Anna
INTERNATIONAL JOURNAL OF WEB INFORMATION SYSTEMS, 2021, 17 (06) : 607 - 621
[7] Conceptualizing Machine Learning for Dynamic Information Retrieval of Electronic Health Record Notes
Jiang, Sharon
Shen, Shannon
Agrawal, Monica
Lam, Barbara
Kurtzman, Nicholas
Horng, Steven
Karger, David R.
Sontag, David
MACHINE LEARNING FOR HEALTHCARE CONFERENCE, VOL 219, 2023, 219
[8] The Learning Electronic Health Record
Clermont, Gilles
CRITICAL CARE CLINICS, 2023, 39 (04) : 689 - 700
[9] Identifying Patient-Level Social Determinants of Health in Unstructured Clinical Notes From Electronic Health Records
Nikbakht, M.
Kumar, V
Rasouliyan, L.
VALUE IN HEALTH, 2022, 25 (12) : S460 - S461
[10] Electronic Health Record Data in Cancer Learning Health Systems: Challenges and Opportunities
Post, Andrew R.
Burningham, Zachary
Halwani, Ahmad S.
JCO CLINICAL CANCER INFORMATICS, 2022, 6

← 1 2 3 4 5 →