TCGAbiolinks: an R/Bioconductor package for integrative analysis of TCGA data

被引:2420
|
作者
Colaprico, Antonio [1 ,2 ]
Silva, Tiago C. [3 ,4 ]
Olsen, Catharina [1 ,2 ]
Garofano, Luciano [5 ,6 ]
Cava, Claudia [7 ]
Garolini, Davide [8 ]
Sabedot, Thais S. [3 ,4 ]
Malta, Tathiane M. [3 ,4 ]
Pagnotta, Stefano M. [5 ,9 ]
Castiglioni, Isabella
Ceccarelli, Michele [10 ]
Bontempi, Gianluca [1 ,2 ]
Noushmehr, Houtan [3 ,4 ]
机构
[1] Interuniv Inst Bioinformat Brussels, Brussels, Belgium
[2] Univ Libre Bruxelles, Dept Informat, Machine Learning Grp, Brussels, Belgium
[3] Univ Sao Paulo, Ribeirao Preto Med Sch, Dept Genet, Sao Paulo, Brazil
[4] NAP USP, Ctr Integrat Syst Biol CISBi, Sao Paulo, Brazil
[5] Univ Sannio, Dept Sci & Technol, Benevento, Italy
[6] Unltd Software Srl, Naples, Italy
[7] Natl Res Council IBFM CNR, Inst Mol Bioimaging & Physiol, Milan, Italy
[8] Univ Turin, Dept Phys, Phys Complex Syst, I-10124 Turin, Italy
[9] BIOGEM, Bioinformat Lab, Avellino, Italy
[10] HBKU, Qatar Comp Res Inst, Doha, Qatar
基金
巴西圣保罗研究基金会;
关键词
SOMATIC GENOMIC LANDSCAPE; CANCER GENOMICS; BIOCONDUCTOR; SOFTWARE;
D O I
10.1093/nar/gkv1507
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The Cancer Genome Atlas (TCGA) research network has made public a large collection of clinical and molecular phenotypes of more than 10 000 tumor patients across 33 different tumor types. Using this cohort, TCGA has published over 20 marker papers detailing the genomic and epigenomic alterations associated with these tumor types. Although many important discoveries have been made by TCGA's research network, opportunities still exist to implement novel methods, thereby elucidating new biological pathways and diagnostic markers. However, mining the TCGA data presents several bioinformatics challenges, such as data retrieval and integration with clinical data and other molecular data types (e.g. RNA and DNA methylation). We developed an R/Bioconductor package called TCGAbiolinks to address these challenges and offer bioinformatics solutions by using a guided workflow to allow users to query, download and perform integrative analyses of TCGA data. We combined methods from computer science and statistics into the pipeline and incorporated methodologies developed in previous TCGA marker studies and in our own group. Using four different TCGA tumor types (Kidney, Brain, Breast and Colon) as examples, we provide case studies to illustrate examples of reproducibility, integrative analysis and utilization of different Bioconductor packages to advance and accelerate novel discoveries.
引用
收藏
页数:11
相关论文
共 50 条
  • [1] SpidermiR: An R/Bioconductor Package for Integrative Analysis with miRNA Data
    Cava, Claudia
    Colaprico, Antonio
    Bertoli, Gloria
    Graudenzi, Alex
    Silva, Tiago C.
    Olsen, Catharina
    Noushmehr, Houtan
    Bontempi, Gianluca
    Mauri, Giancarlo
    Castiglioni, Isabella
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2017, 18 (02)
  • [2] GDCRNATools: an R/Bioconductor package for integrative analysis of lncRNA, miRNA and mRNA data in GDC
    Li, Ruidong
    Qu, Han
    Wang, Shibo
    Wei, Julong
    Zhang, Le
    Ma, Renyuan
    Lu, Jianming
    Zhu, Jianguo
    Zhong, Wei-De
    Jia, Zhenyu
    BIOINFORMATICS, 2018, 34 (14) : 2515 - 2517
  • [3] The Risa R/Bioconductor package: integrative data analysis from experimental metadata and back again
    Gonzalez-Beltran, Alejandra
    Neumann, Steffen
    Maguire, Eamonn
    Sansone, Susanna-Assunta
    Rocca-Serra, Philippe
    BMC BIOINFORMATICS, 2014, 15 : 1 - 12
  • [4] PathwayPCA: an R/Bioconductor Package for Pathway Based Integrative Analysis of Multi-Omics Data
    Odom, Gabriel J.
    Ban, Yuguang
    Colaprico, Antonio
    Liu, Lizhong
    Silva, Tiago Chedraoui
    Sun, Xiaodian
    Pico, Alexander R.
    Zhang, Bing
    Wang, Lily
    Chen, Xi
    PROTEOMICS, 2020, 20 (21-22)
  • [5] The Risa R/Bioconductor package: integrative data analysis from experimental metadata and back again
    Alejandra González-Beltrán
    Steffen Neumann
    Eamonn Maguire
    Susanna-Assunta Sansone
    Philippe Rocca-Serra
    BMC Bioinformatics, 15
  • [6] stam - a Bioconductor compliant R package for structured analysis of microarray data
    Lottaz, C
    Spang, R
    BMC BIOINFORMATICS, 2005, 6 (1)
  • [7] stam – a Bioconductor compliant R package for structured analysis of microarray data
    Claudio Lottaz
    Rainer Spang
    BMC Bioinformatics, 6
  • [8] TCGAplot: an R package for integrative pan-cancer analysis and visualization of TCGA multi-omics data
    Chenqi Liao
    Xiong Wang
    BMC Bioinformatics, 24
  • [9] TCGAplot: an R package for integrative pan-cancer analysis and visualization of TCGA multi-omics data
    Liao, Chenqi
    Wang, Xiong
    BMC BIOINFORMATICS, 2023, 24 (01)
  • [10] CytoTree: an R/Bioconductor package for analysis and visualization of flow and mass cytometry data
    Yuting Dai
    Aining Xu
    Jianfeng Li
    Liang Wu
    Shanhe Yu
    Jun Chen
    Weili Zhao
    Xiao-Jian Sun
    Jinyan Huang
    BMC Bioinformatics, 22