Sequenza: allele-specific copy number and mutation profiles from tumor sequencing data

被引:506
|
作者
Favero, F. [1 ]
Joshi, T. [1 ]
Marquard, A. M. [1 ]
Birkbak, N. J. [1 ]
Krzystanek, M. [1 ]
Li, Q. [1 ,2 ]
Szallasi, Z. [1 ,3 ]
Eklund, A. C. [1 ]
机构
[1] Tech Univ Denmark, Dept Syst Biol, Ctr Biol Sequence Anal, DK-2800 Lyngby, Denmark
[2] Xiamen Univ, Sch Med, Xiamen, Peoples R China
[3] Harvard Univ, Sch Med, Harvard MIT Div Hlth Sci & Technol CHIP HST, Childrens Hosp,Informat Program, Boston, MA USA
基金
欧盟第七框架计划;
关键词
cancer genomics; copy number alterations; mutations; next-generation sequencing; software; CANCER; EVOLUTION; PURITY;
D O I
10.1093/annonc/mdu479
中图分类号
R73 [肿瘤学];
学科分类号
100214 ;
摘要
Exome or whole-genome deep sequencing of tumor DNA along with paired normal DNA can potentially provide a detailed picture of the somatic mutations that characterize the tumor. However, analysis of such sequence data can be complicated by the presence of normal cells in the tumor specimen, by intratumor heterogeneity, and by the sheer size of the raw data. In particular, determination of copy number variations from exome sequencing data alone has proven difficult; thus, single nucleotide polymorphism (SNP) arrays have often been used for this task. Recently, algorithms to estimate absolute, but not allele-specific, copy number profiles from tumor sequencing data have been described. We developed Sequenza, a software package that uses paired tumor-normal DNA sequencing data to estimate tumor cellularity and ploidy, and to calculate allele-specific copy number profiles and mutation profiles. We applied Sequenza, as well as two previously published algorithms, to exome sequence data from 30 tumors from The Cancer Genome Atlas. We assessed the performance of these algorithms by comparing their results with those generated using matched SNP arrays and processed by the allele-specific copy number analysis of tumors (ASCAT) algorithm. Comparison between Sequenza/exome and SNP/ASCAT revealed strong correlation in cellularity (Pearson's r = 0.90) and ploidy estimates (r = 0.42, or r = 0.94 after manual inspecting alternative solutions). This performance was noticeably superior to previously published algorithms. In addition, in artificial data simulating normal-tumor admixtures, Sequenza detected the correct ploidy in samples with tumor content as low as 30%. The agreement between Sequenza and SNP array-based copy number profiles suggests that exome sequencing alone is sufficient not only for identifying small scale mutations but also for estimating cellularity and inferring DNA copy number aberrations.
引用
收藏
页码:64 / 70
页数:7
相关论文
共 50 条
  • [31] RBV: Allele-specific copy-number validation of whole genome sequence and whole exome sequence data
    Whitford, W.
    Lehnert, K.
    Snell, R. G.
    Jacobsen, J. C.
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2018, 26 : 705 - 706
  • [32] ASEQ: fast allele-specific studies from next-generation sequencing data
    Romanel, Alessandro
    Lago, Sara
    Prandi, Davide
    Sboner, Andrea
    Demichelis, Francesca
    BMC MEDICAL GENOMICS, 2015, 8
  • [33] Allele-specific aberrations and two dimensional disparity of copy number alterations in breast cancer
    Kaveh, F.
    Edvardsen, H.
    Borresen-Dale, A. L.
    Kristensen, V. N.
    Solvang, H. K.
    EJC SUPPLEMENTS, 2010, 8 (05): : 207 - 207
  • [34] ASEQ: fast allele-specific studies from next-generation sequencing data
    Alessandro Romanel
    Sara Lago
    Davide Prandi
    Andrea Sboner
    Francesca Demichelis
    BMC Medical Genomics, 8
  • [35] Allele-specific aberrations and two dimensional disparity of copy number alterations in breast cancer
    Kaveh, F.
    Solvang, H. K.
    Borresen-Dale, A. L.
    Kristensen, V. N.
    EJC SUPPLEMENTS, 2009, 7 (02): : 317 - 317
  • [36] Allele-specific variation in the gene copy number of human cytosine 5-methyltransferase
    Franchina, M
    Kay, PH
    HUMAN HEREDITY, 2000, 50 (02) : 112 - 117
  • [37] Estimating genome-wide copy number using allele-specific mixture models
    Wang, Wenyi
    Carvalho, Benilton
    Miller, Nathaniel D.
    Pevsner, Jonathan
    Chakravarti, Aravinda
    Irizarry, Rafael A.
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2008, 15 (07) : 857 - 866
  • [38] Advantage of Using Allele-Specific Copy Numbers When Testing for Association in Regions with Common Copy Number Variants
    Marenne, Gaelle
    Chanock, Stephen J.
    Malats, Nuria
    Genin, Emmanuelle
    PLOS ONE, 2013, 8 (09):
  • [39] TumorBoost: Normalization of allele-specific tumor copy numbers from a single pair of tumor-normal genotyping microarrays
    Bengtsson, Henrik
    Neuvial, Pierre
    Speed, Terence P.
    BMC BIOINFORMATICS, 2010, 11
  • [40] TumorBoost: Normalization of allele-specific tumor copy numbers from a single pair of tumor-normal genotyping microarrays
    Henrik Bengtsson
    Pierre Neuvial
    Terence P Speed
    BMC Bioinformatics, 11