Genomic Data Integration: A case study on next generation sequencing of cancer

被引:0
|
作者
Weitschek, Emanuel [1 ,2 ]
Cumbo, Fabio [2 ,3 ]
Cappelli, Eleonora [3 ]
Felici, Giovanni [2 ]
机构
[1] Uninettuno Int Univ, Dept Engn, Corso Vittorio Emanuele 2 39, I-00186 Rome, Italy
[2] CNR, Inst Syst Anal & Comp Sci, Via Taurini 19, I-00185 Rome, Italy
[3] Roma Tre Univ, Dept Engn, Via Vasca Navale 79, I-00146 Rome, Italy
关键词
CLASSIFICATION; SYSTEM;
D O I
10.1109/DEXA.2016.14
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Due to the great advances of Next Generation Sequencing (NGS) techniques, bioinformaticians are faced with large amounts of genomic and clinical data, which are growing exponentially. A striking example is The Cancer Genome Atlas (TCGA), whose aim is to provide a comprehensive archive of biomedical data about tumors. Indeed, TCGA contains more than 15 TB of genomic and clinical data, whose analysis and interpretation are posing great challenges to the bioinformatics community. In this work, we focus on integration and analysis of NGS data extracted from TCGA. In particular, we integrate RNA-seq and DNA-methylation experiments and perform a supervised classification analysis. Thanks to this integration, we are able to distinguish successfully the tumoral samples from the normal ones and to extract reliable rule-based classification models that contain salient features (i.e., genes and methylated sites). These features, which are related to the investigated tumor, can be studied by domain experts in order to obtain new knowledge about cancer. Finally, our proposed integration and analysis method can be adopted with success for further studies on different data sources and NGS experiments.
引用
收藏
页码:49 / 53
页数:5
相关论文
共 50 条
  • [41] Genomic data integration tutorial, a plant case study
    Mardoc, Emile
    Sow, Mamadou Dia
    Dejean, Sebastien
    Salse, Jerome
    BMC GENOMICS, 2024, 25 (01)
  • [42] Genomic data integration tutorial, a plant case study
    Emile Mardoc
    Mamadou Dia Sow
    Sébastien Déjean
    Jérôme Salse
    BMC Genomics, 25
  • [43] Identification of Viral Integration Sites in Cancer Genomes Using Unmapped Reads in Targeted Next-generation Sequencing Data
    Bowman, A. S.
    Middha, S.
    Vanderbilt, C.
    Ladanyi, M.
    Berger, M.
    Zehir, A.
    JOURNAL OF MOLECULAR DIAGNOSTICS, 2018, 20 (06): : 963 - 964
  • [44] Characterization of cancer genomic heterogeneity by next-generation sequencing advances precision medicine in cancer treatment
    Zhang, Jialing
    Spath, Stephan Stanislaw
    Marjani, Sadie L.
    Zhang, Wengeng
    Pan, Xinghua
    PRECISION CLINICAL MEDICINE, 2018, 1 (01) : 29 - 48
  • [45] COMPLETE GENOMIC SEQUENCING OF NOVEL HLA ALLELES USING NEXT GENERATION SEQUENCING
    Albrecht, Viviane
    Lang, Kathrin
    Heder, Carolin
    Schoene, Bianca
    Lange, Vinzenz
    Boehme, Irina
    Schmidt, Alexander H.
    TISSUE ANTIGENS, 2014, 84 (01): : 108 - 108
  • [46] Estimating the Length Distributions of Genomic Micro-satellites from Next Generation Sequencing Data
    Feng, Xuan
    Hu, Huan
    Zhao, Zhongmeng
    Zhang, Xuanping
    Wang, Jiayin
    BIOINFORMATICS AND BIOMEDICAL ENGINEERING, IWBBIO 2018, PT I, 2018, 10813 : 461 - 472
  • [47] Variant Callers for Next-Generation Sequencing Data: A Comparison Study
    Liu, Xiangtao
    Han, Shizhong
    Wang, Zuoheng
    Gelernter, Joel
    Yang, Bao-Zhu
    PLOS ONE, 2013, 8 (09):
  • [48] Genomic Profiling of Micropapillary Adenocarcinoma of the Lung by Next Generation Sequencing
    Sande, Christopher
    Guseva, Natalya
    Stence, Aaron
    Zhang, Jun
    Ma, Deqin
    LABORATORY INVESTIGATION, 2018, 98 : 749 - 750
  • [49] Genomic Next Generation Sequencing and quality assurance: challenges and opportunities
    Eggermann, Thomas
    MOLECULAR CYTOGENETICS, 2019, 12
  • [50] NGSNGS: next-generation simulator for next-generation sequencing data
    Henriksen, Rasmus Amund
    Zhao, Lei
    Korneliussen, Thorfinn Sand
    BIOINFORMATICS, 2023, 39 (01)