Identifying and Predicting Intentional Self-Harm in Electronic Health Record Clinical Notes: Deep Learning Approach

被引:22
|
作者
Obeid, Jihad S. [1 ]
Dahne, Jennifer [1 ]
Christensen, Sean [1 ]
Howard, Samuel [1 ]
Crawford, Tami [1 ]
Frey, Lewis J. [1 ]
Stecker, Tracy [1 ]
Bunnell, Brian E. [2 ]
机构
[1] Med Univ South Carolina, 135 Cannon St Suite 405 MSC200, Charleston, SC 29425 USA
[2] Univ S Florida, Tampa, FL 33620 USA
基金
美国国家卫生研究院;
关键词
machine learning; deep learning; suicide; attempted; electronic health records; natural language processing; SUICIDE ATTEMPTS; RISK-FACTORS; METAANALYSIS; BEHAVIORS; THOUGHTS; MODEL;
D O I
10.2196/17784
中图分类号
R-058 [];
学科分类号
摘要
Background: Suicide is an important public health concern in the United States and around the world. There has been significant work examining machine learning approaches to identify and predict intentional self-harm and suicide using existing data sets. With recent advances in computing, deep learning applications in health care are gaining momentum. Objective: This study aimed to leverage the information in clinical notes using deep neural networks (DNNs) to (1) improve the identification of patients treated for intentional self-harm and (2) predict future self-harm events. Methods: We extracted clinical text notes from electronic health records (EHRs) of 835 patients with International Classification of Diseases (ICD) codes for intentional self-harm and 1670 matched controls who never had any intentional self-harm ICD codes. The data were divided into training and holdout test sets. We tested a number of algorithms on clinical notes associated with the intentional self-harm codes using the training set, including several traditional bag-of-words-based models and 2 DNN models: a convolutional neural network (CNN) and a long short-term memory model. We also evaluated the predictive performance of the DNNs on a subset of patients who had clinical notes 1 to 6 months before the first intentional self-harm event. Finally, we evaluated the impact of a pretrained model using Word2vec (W2V) on performance. Results: The area under the receiver operating characteristic curve (AUC) for the CNN on the phenotyping task, that is, the detection of intentional self-harm in clinical notes concurrent with the events was 0.999, with an Fl score of 0.985. In the predictive task, the CNN achieved the highest performance with an AUC of 0.882 and an Fl score of 0.769. Although pretraining with W2V shortened the DNN training time, it did not improve performance. Conclusions: The strong performance on the first task, namely, phenotyping based on clinical notes, suggests that such models could be used effectively for surveillance of intentional self-harm in clinical text in an EHR. The modest performance on the predictive task notwithstanding, the results using DNN models on clinical text alone are competitive with other reports in the literature using risk factors from structured EHR data.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] Survey: Deep Learning Concepts and Techniques for Electronic Health Record
    al-Aiad, Ahmad
    Duwairi, Rehab
    Fraihat, Manar
    2018 IEEE/ACS 15TH INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2018,
  • [22] A practical approach to identifying autistic adults within the electronic health record
    Malow, Beth A.
    Veatch, Olivia J.
    Niu, Xinnan
    Fitzpatrick, Kasey A.
    Hucks, Donald
    Maxwell-Horn, Angie
    Davis, Lea K.
    AUTISM RESEARCH, 2023, 16 (01) : 52 - 65
  • [23] Harm minimisation for the management of self-harm: a mixed-methods analysis of electronic health records in secondary mental healthcare
    Cliffe, Charlotte
    Pitman, Alexandra
    Sedgwick, Rosemary
    Pritchard, Megan
    Dutta, Rina
    Rowe, Sarah
    BJPSYCH OPEN, 2021, 7 (04):
  • [24] DETECTION OF EMERGENCY DEPARTMENT VISITS FOR YOUTH SUICIDE AND SELF-HARM USING ELECTRONIC HEALTH RECORDS
    Edgcomb, Juliet B.
    Wilkerson, Catherine A.
    Zima, Bonnie T.
    JOURNAL OF THE AMERICAN ACADEMY OF CHILD AND ADOLESCENT PSYCHIATRY, 2022, 61 (10): : S220 - S220
  • [25] IDENTIFYING PATIENTS WITH ACUTE GASTROINTESTINAL BLEEDING USING NOTE TEXT IN THE ELECTRONIC HEALTH RECORD: A HYBRID NATURAL LANGUAGE PROCESSING AND DEEP LEARNING APPROACH
    Dupont, Johannes
    Zheng, Neil S.
    Laine, Loren
    Hsiao, Allen
    Thomas, Prem
    Partridge, Caitlin M.
    Shung, Dennis
    GASTROENTEROLOGY, 2023, 164 (06) : S838 - S838
  • [26] A Deep Learning Based Approach to Automate Clinical Coding of Electronic Health Records
    Kumar, Ashutosh
    Rathore, Santosh Singh
    BIG DATA ANALYTICS, BDA 2022, 2022, 13773 : 104 - 116
  • [27] Suicidal ideation and intentional self-harm in pregnancy as a neglected agenda in maternal health; an experience from rural Sri Lanka
    Nimna Sachini Malawara Arachchi
    Ranjan Ganegama
    Abdul Wahib Fathima Husna
    Delo Lashan Chandima
    Nandana Hettigama
    Jagath Premadasa
    Jagath Herath
    Harindra Ranaweera
    Thilini Chanchala Agampodi
    Suneth Buddhika Agampodi
    Reproductive Health, 16
  • [28] Suicidal ideation and intentional self-harm in pregnancy as a neglected agenda in maternal health; an experience from rural Sri Lanka
    Arachchi, Nimna Sachini Malawara
    Ganegama, Ranjan
    Husna, Abdul Wahib Fathima
    Chandima, Delo Lashan
    Hettigama, Nandana
    Premadasa, Jagath
    Herath, Jagath
    Ranaweera, Harindra
    Agampodi, Thilini Chanchala
    Agampodi, Suneth Buddhika
    REPRODUCTIVE HEALTH, 2019, 16 (01)
  • [29] Identifying plausible causal risk factors for self-harm: a polygenic risk scoring and Mendelian randomization approach
    Lim, Kai
    Rijsdijk, Fruhling
    Hagenaars, Saskia
    Socrates, Adam
    Choi, Shing Wan
    Coleman, Jonathan R. I.
    Glanville, Kylie P.
    Lewis, Cathryn M.
    Pingault, Jean-Baptiste
    BEHAVIOR GENETICS, 2019, 49 (06) : 522 - 523
  • [30] Predicting Clinical Outcomes Across Changing Electronic Health Record Systems
    Gong, Jen J.
    Naumann, Tristan
    Szolovits, Peter
    Guttag, John V.
    KDD'17: PROCEEDINGS OF THE 23RD ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING, 2017, : 1497 - 1505