Learning to Train and to Explain a Deep Survival Model with Large-Scale Ovarian Cancer Transcriptomic Data

被引:0
|
作者
Menand, Elena Spirina [1 ,2 ]
De Vries-Brilland, Manon [2 ,3 ]
Tessier, Leslie [2 ]
Dauve, Jonathan [2 ]
Campone, Mario [4 ,5 ]
Verriele, Veronique [6 ]
Jrad, Nisrine [1 ]
Marion, Jean-Marie [1 ]
Chauvet, Pierre [1 ]
Passot, Christophe [2 ]
Morel, Alain [2 ,5 ]
机构
[1] Univ Angers, Lab Angevin Rech Ingn Syst EA7315, F-49035 Angers, France
[2] Inst Cancerol Ouest Nantes Angers, Unite Genom Fonct, F-49055 Angers, France
[3] Inst Cancerol Ouest Nantes Angers, F-49000 Angers, France
[4] Inst Cancerol Ouest Nantes Angers, F-49000 Angers, France
[5] Nantes Univ, Univ Angers, CNRS, Inserm,CRCI2NA,SFR ICAT, F-49000 Angers, France
[6] Inst Cancerol Ouest Nantes Angers, Dept Anat & Cytol Pathol, F-49055 Angers, France
关键词
TCGA; ovarian cancer; RNA-seq; survival analysis; deep learning; molecular pathways; SIGNATURES;
D O I
10.3390/biomedicines12122881
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Background/Objectives: Ovarian cancer is a complex disease with poor outcomes that affects women worldwide. The lack of successful therapeutic options for this malignancy has led to the need to identify novel biomarkers for patient stratification. Here, we aim to develop the outcome predictors based on the gene expression data as they may serve to identify categories of patients who are more likely to respond to certain therapies. Methods: We used The Cancer Genome Atlas (TCGA) ovarian cancer transcriptomic data from 372 patients and approximately 16,600 genes to train and evaluate the deep learning survival models. In addition, we collected an in-house validation dataset of 12 patients to assess the performance of the trained survival models for their direct use in clinical practice. Despite deceptive generalization capabilities, we demonstrated how our model can be interpreted to uncover biological processes associated with survival. We calculated the contributions of the input genes to the output of the best trained model and derived the corresponding molecular pathways. Results: These pathways allowed us to stratify the TCGA patients into high-risk and low-risk groups (p-value 0.025). We validated the stratification ability of the identified pathways on the in-house dataset consisting of 12 patients (p-value 0.229) and on the external clinical and molecular dataset consisting of 274 patients (p-value 0.006). Conclusions: The deep learning-based models for survival prediction with RNA-seq data could be used to detect and interpret the gene-sets associated with survival in ovarian cancer patients and open a new avenue for future research.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] Software Abstractions for Large-Scale Deep Learning Models in Big Data Analytics
    Khan, Ayaz H.
    Qamar, Ali Mustafa
    Yusuf, Aneeq
    Khan, Rehanullah
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2019, 10 (04) : 557 - 566
  • [22] Learning Compact Model for Large-Scale Multi-Label Data
    Wei, Tong
    Li, Yu-Feng
    THIRTY-THIRD AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FIRST INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE / NINTH AAAI SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2019, : 5385 - 5392
  • [23] A Supervised Learning Model for High-Dimensional and Large-Scale Data
    Peng, Chong
    Cheng, Jie
    Cheng, Qiang
    ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2017, 8 (02)
  • [24] Large-scale data classification method based on machine learning model
    Department of Electrical Engineering, Dalian Institute of Science and Technology, Dalian, China
    Int. J. Database Theory Appl., 2 (71-80):
  • [25] Hybrid Deep Learning Ensemble Model for Improved Large-Scale Car Recognition
    Verma, Abhishek
    Liu, Yu
    2017 IEEE SMARTWORLD, UBIQUITOUS INTELLIGENCE & COMPUTING, ADVANCED & TRUSTED COMPUTED, SCALABLE COMPUTING & COMMUNICATIONS, CLOUD & BIG DATA COMPUTING, INTERNET OF PEOPLE AND SMART CITY INNOVATION (SMARTWORLD/SCALCOM/UIC/ATC/CBDCOM/IOP/SCI), 2017,
  • [26] NaijaFaceVoice: A Large-Scale Deep Learning Model and Database of Nigerian Faces and Voices
    Akinrinmade, Adekunle Anthony
    Adetiba, Emmanuel
    Badejo, Joke A.
    Oshin, Oluwadamilola
    IEEE ACCESS, 2023, 11 : 58228 - 58243
  • [27] Efficient Use of GPU Memory for Large-Scale Deep Learning Model Training
    Choi, Hyeonseong
    Lee, Jaehwan
    APPLIED SCIENCES-BASEL, 2021, 11 (21):
  • [28] Deep Learning loss model for large-scale low voltage smart grids
    Velasco, Jose Angel
    Amaris, Hortensia
    Alonso, Monica
    INTERNATIONAL JOURNAL OF ELECTRICAL POWER & ENERGY SYSTEMS, 2020, 121
  • [29] Building large-scale density model via a deep-learning-based data-driven method
    Gao, Zhaoqi
    Li, Chuang
    Zhang, Bing
    Jiang, Xiudi
    Pan, Zhibin
    Gao, Jinghuai
    Xu, Zongben
    GEOPHYSICS, 2021, 86 (01) : M1 - M15
  • [30] Variational Autoencoder Model Combining Deep Learning and Probability Statistics and Its Application in Large-scale Data Analysis
    Zou, Lingguo
    Zhang, Meihua
    Informatica (Slovenia), 2024, 48 (22): : 31 - 46