Contrastive Learning of Temporal Distinctiveness for Survival Analysis in Electronic Health Records

被引:3
|
作者
Kerdabadi, Mohsen Nayebi [1 ]
Moghaddam, Arya Hadizadeh [1 ]
Liu, Bin [2 ]
Liu, Mei [3 ]
Yao, Zijun [1 ]
机构
[1] Univ Kansas, Lawrence, KS 66045 USA
[2] West Virginia Univ, Morgantown, WV USA
[3] Univ Florida, Gainesville, FL 32611 USA
基金
美国国家科学基金会;
关键词
Survival Analysis; Contrastive Learning; Electronic Health Records; ACUTE KIDNEY INJURY;
D O I
10.1145/3583780.3614824
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Survival analysis plays a crucial role in many healthcare decisions, where the risk prediction for the events of interest can support an informative outlook for a patient's medical journey. Given the existence of data censoring, an effective way of survival analysis is to enforce the pairwise temporal concordance between censored and observed data, aiming to utilize the time interval before censoring as partially observed time-to-event labels for supervised learning. Although existing studies mostly employed ranking methods to pursue an ordering objective, contrastive methods which learn a discriminative embedding by having data contrast against each other, have not been explored thoroughly for survival analysis. Therefore, in this paper, we propose a novel Ontology-aware Temporality-based Contrastive Survival (OTCSurv) analysis framework that utilizes survival durations from both censored and observed data to define temporal distinctiveness and construct negative sample pairs with varying hardness for contrastive learning. Specifically, we first use an ontological encoder and a sequential self-attention encoder to represent the longitudinal EHR data with rich contexts. Second, we design a temporal contrastive loss to capture varying survival durations in a supervised setting through a hardness-aware negative sampling mechanism. Last, we incorporate the contrastive task into the time-to-event predictive task with multiple loss components. We conduct extensive experiments using a large EHR dataset to forecast the risk of hospitalized patients who are in danger of developing acute kidney injury (AKI), a critical and urgent medical condition. The effectiveness and explainability of the proposed model are validated through comprehensive quantitative and qualitative studies.
引用
收藏
页码:1897 / 1906
页数:10
相关论文
共 50 条
  • [21] A machine learning approach to leveraging electronic health records for enhanced omics analysis
    Mataraso, Samson J.
    Espinosa, Camilo A.
    Seong, David
    Reincke, S. Momsen
    Berson, Eloise
    Reiss, Jonathan D.
    Kim, Yeasul
    Ghanem, Marc
    Shu, Chi-Hung
    James, Tomin
    Tan, Yuqi
    Shome, Sayane
    Stelzer, Ina A.
    Feyaerts, Dorien
    Wong, Ronald J.
    Shaw, Gary M.
    Angst, Martin S.
    Gaudilliere, Brice
    Stevenson, David K.
    Aghaeepour, Nima
    NATURE MACHINE INTELLIGENCE, 2025, 7 (02) : 293 - 306
  • [22] MedGraph: Structural and Temporal Representation Learning of Electronic Medical Records
    Hettige, Bhagya
    Wang, Weiqing
    Li, Yuan-Fang
    Le, Suong
    Buntine, Wray
    ECAI 2020: 24TH EUROPEAN CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2020, 325 : 1810 - 1817
  • [23] Temporal Weighting of Clinical Events In Electronic Health Records for Pharmacovigilance
    Zhao, Jing
    PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2015, : 375 - 381
  • [24] Neural Temporal Point Processes Modelling Electronic Health Records
    Enguehard, Joseph
    Busbridge, Dan
    Bozson, Adam
    Woodcock, Claire
    Hammerla, Nils
    MACHINE LEARNING FOR HEALTH, VOL 136, 2020, 136 : 85 - 113
  • [25] Batch Integrated Gradients: Explanations for Temporal Electronic Health Records
    Duell, Jamie
    Fan, Xiuyi
    Fu, Hsuan
    Seisenberger, Monika
    ARTIFICIAL INTELLIGENCE IN MEDICINE, AIME 2023, 2023, 13897 : 120 - 124
  • [26] Predictive Modeling with Temporal Graphical Representation on Electronic Health Records
    Chen, Jiayuan
    Yin, Changchang
    Wang, Yuanlong
    Zhang, Ping
    PROCEEDINGS OF THE THIRTY-THIRD INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, IJCAI 2024, 2024, : 5763 - 5771
  • [27] Identifying prognostic factors for survival in intensive care unit patients with SIRS or sepsis by machine learning analysis on electronic health records
    Mollura, Maximiliano
    Chicco, Davide
    Paglialonga, Alessia
    Barbieri, Riccardo
    PLOS DIGITAL HEALTH, 2024, 3 (03):
  • [28] Learning from heterogeneous temporal data in electronic health records (vol 65, pg 105, 2017)
    Zhao, Jing
    Papapetrou, Panagiotis
    Asker, Lars
    Bostrom, Henrik
    JOURNAL OF BIOMEDICAL INFORMATICS, 2020, 101
  • [29] Semi-supervised Double Deep Learning Temporal Risk Prediction (SeDDLeR) with Electronic Health Records
    Nogues, Isabelle-Emmanuella
    Wen, Jun
    Zhao, Yihan
    Bonzel, Clara-Lea
    Castro, Victor M.
    Lin, Yucong
    Xu, Shike
    Hou, Jue
    Cai, Tianxi
    JOURNAL OF BIOMEDICAL INFORMATICS, 2024, 157
  • [30] Scalable and accurate deep learning with electronic health records
    Alvin Rajkomar
    Eyal Oren
    Kai Chen
    Andrew M. Dai
    Nissan Hajaj
    Michaela Hardt
    Peter J. Liu
    Xiaobing Liu
    Jake Marcus
    Mimi Sun
    Patrik Sundberg
    Hector Yee
    Kun Zhang
    Yi Zhang
    Gerardo Flores
    Gavin E. Duggan
    Jamie Irvine
    Quoc Le
    Kurt Litsch
    Alexander Mossin
    Justin Tansuwan
    De Wang
    James Wexler
    Jimbo Wilson
    Dana Ludwig
    Samuel L. Volchenboum
    Katherine Chou
    Michael Pearson
    Srinivasan Madabushi
    Nigam H. Shah
    Atul J. Butte
    Michael D. Howell
    Claire Cui
    Greg S. Corrado
    Jeffrey Dean
    npj Digital Medicine, 1