Learning to Train and to Explain a Deep Survival Model with Large-Scale Ovarian Cancer Transcriptomic Data

被引:0
|
作者
Menand, Elena Spirina [1 ,2 ]
De Vries-Brilland, Manon [2 ,3 ]
Tessier, Leslie [2 ]
Dauve, Jonathan [2 ]
Campone, Mario [4 ,5 ]
Verriele, Veronique [6 ]
Jrad, Nisrine [1 ]
Marion, Jean-Marie [1 ]
Chauvet, Pierre [1 ]
Passot, Christophe [2 ]
Morel, Alain [2 ,5 ]
机构
[1] Univ Angers, Lab Angevin Rech Ingn Syst EA7315, F-49035 Angers, France
[2] Inst Cancerol Ouest Nantes Angers, Unite Genom Fonct, F-49055 Angers, France
[3] Inst Cancerol Ouest Nantes Angers, F-49000 Angers, France
[4] Inst Cancerol Ouest Nantes Angers, F-49000 Angers, France
[5] Nantes Univ, Univ Angers, CNRS, Inserm,CRCI2NA,SFR ICAT, F-49000 Angers, France
[6] Inst Cancerol Ouest Nantes Angers, Dept Anat & Cytol Pathol, F-49055 Angers, France
关键词
TCGA; ovarian cancer; RNA-seq; survival analysis; deep learning; molecular pathways; SIGNATURES;
D O I
10.3390/biomedicines12122881
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Background/Objectives: Ovarian cancer is a complex disease with poor outcomes that affects women worldwide. The lack of successful therapeutic options for this malignancy has led to the need to identify novel biomarkers for patient stratification. Here, we aim to develop the outcome predictors based on the gene expression data as they may serve to identify categories of patients who are more likely to respond to certain therapies. Methods: We used The Cancer Genome Atlas (TCGA) ovarian cancer transcriptomic data from 372 patients and approximately 16,600 genes to train and evaluate the deep learning survival models. In addition, we collected an in-house validation dataset of 12 patients to assess the performance of the trained survival models for their direct use in clinical practice. Despite deceptive generalization capabilities, we demonstrated how our model can be interpreted to uncover biological processes associated with survival. We calculated the contributions of the input genes to the output of the best trained model and derived the corresponding molecular pathways. Results: These pathways allowed us to stratify the TCGA patients into high-risk and low-risk groups (p-value 0.025). We validated the stratification ability of the identified pathways on the in-house dataset consisting of 12 patients (p-value 0.229) and on the external clinical and molecular dataset consisting of 274 patients (p-value 0.006). Conclusions: The deep learning-based models for survival prediction with RNA-seq data could be used to detect and interpret the gene-sets associated with survival in ovarian cancer patients and open a new avenue for future research.
引用
收藏
页数:16
相关论文
共 50 条
  • [41] Heading Direction Estimation Using Deep Learning with Automatic Large-scale Data Acquisition
    Berriel, Rodrigo E.
    Tones, Lucas Tabelini
    Cardoso, Vinicius B.
    Guidolini, Ranik
    Badue, Claudine
    De Souza, Alberto F.
    Oliveira-Santos, Thiago
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018,
  • [42] LARGE-SCALE VEGETATION HEIGHT MAPPING FROM SENTINEL DATA USING DEEP LEARNING
    Waldeland, Anders U.
    Salberg, Arnt-Borre
    Trier, Oivind D.
    Vollrath, Andreas
    IGARSS 2020 - 2020 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM, 2020, : 1877 - 1880
  • [43] Learning hand-eye coordination for robotic grasping with deep learning and large-scale data collection
    Levine, Sergey
    Pastor, Peter
    Krizhevsky, Alex
    Ibarz, Julian
    Quillen, Deirdre
    INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 2018, 37 (4-5): : 421 - 436
  • [44] PREDICTING ONCOLOGIC SURVIVAL OUTCOMES USING MACHINE LEARNING AND LARGE-SCALE REGISTRY DATA THE BC CANCER REGISTRY EXPERIENCE
    Zhao, Rachel
    Kim, Melodie
    Marwaha, Arshdeep
    Stubbs, Terry
    Sandhu, Prableen
    Narinesingh, Dylan
    Badragan, Iulian
    Proulx, Ryan
    Krauze, Andra
    RADIOTHERAPY AND ONCOLOGY, 2021, 163 : S56 - S56
  • [45] Automated curation of large-scale cancer histopathology image datasets using deep learning
    Hilgers, Lars
    Laleh, Narmin Ghaffari
    West, Nicholas P.
    Westwood, Alice
    Hewitt, Katherine J.
    Quirke, Philip
    Grabsch, Heike, I
    Carrero, Zunamys, I
    Matthaei, Emylou
    Loeffler, Chiara M. L.
    Brinker, Titus J.
    Yuan, Tanwei
    Brenner, Hermann
    Brobeil, Alexander
    Hoffmeister, Michael
    Kather, Jakob Nikolas
    HISTOPATHOLOGY, 2024, 84 (07) : 1139 - 1153
  • [46] Data model for large-scale structural experiments
    Lee, Chang-Ho
    Chin, Chung H.
    Marullo, Thomas
    Bryan, Peter
    Sause, Richard
    Ricles, James M.
    JOURNAL OF EARTHQUAKE ENGINEERING, 2008, 12 (01) : 115 - 135
  • [47] Large-Scale Semi-Supervised Training in Deep Learning Acoustic Model for ASR
    Long, Yanhua
    Li, Yijie
    Wei, Shuang
    Zhang, Qiaozheng
    Yang, Chunxia
    IEEE ACCESS, 2019, 7 : 133615 - 133627
  • [48] A Hybrid Data Model for Large-Scale Analytics
    Feo, John
    2018 ACM INTERNATIONAL CONFERENCE ON COMPUTING FRONTIERS, 2018, : 269 - 269
  • [49] Rec-PF: Data-Driven Large-Scale Deep Learning Recommendation Model Training Optimization Based on Tensor-Train Embedding Table With Photovoltaic Forecast
    Li, Yunfeng
    Wang, Zheng
    Ren, Chenhao
    Hou, Xiaoming
    Zhang, Shengli
    IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2025, 55 (01): : 573 - 586
  • [50] Nonparametric Data Reduction Approach for Large-Scale Survival Data Analysis
    Sadeghzadeh, Keivan
    Fard, Nasser
    2015 61ST ANNUAL RELIABILITY AND MAINTAINABILITY SYMPOSIUM (RAMS 2015), 2015,