A highly sensitive and specific workflow for detecting rare copy-number variants from exome sequencing data

被引:38
|
作者
Rajagopalan, Ramakrishnan [1 ,2 ]
Murrell, Jill R. [1 ,3 ]
Luo, Minjie [1 ,3 ]
Conlin, Laura K. [1 ,3 ]
机构
[1] Childrens Hosp Philadelphia, Dept Pathol & Lab Med, Div Genom Diagnost, Philadelphia, PA 19104 USA
[2] Drexel Univ, Sch Biomed Engn Sci & Hlth Syst, Philadelphia, PA 19104 USA
[3] Univ Penn, Dept Pathol & Lab Med, Perelman Sch Med, Philadelphia, PA 19104 USA
基金
美国国家卫生研究院;
关键词
Clinical exome sequencing; Copy-number variation; DETECTION TOOLS; DISCOVERY; RESOURCE; GENES; MODEL; SNP;
D O I
10.1186/s13073-020-0712-0
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Background Exome sequencing (ES) is a first-tier diagnostic test for many suspected Mendelian disorders. While it is routine to detect small sequence variants, it is not a standard practice in clinical settings to detect germline copy-number variants (CNVs) from ES data due to several reasons relating to performance. In this work, we comprehensively characterized one of the most sensitive ES-based CNV tools, ExomeDepth, against SNP array, a standard of care test in clinical settings to detect genome-wide CNVs. Methods We propose a modified ExomeDepth workflow by excluding exons with low mappability prior to variant calling to drastically reduce the false positives originating from the repetitive regions of the genome, and an iterative variant calling framework to assess the reproducibility. We used a cohort of 307 individuals with clinical ES data and clinical SNP array to estimate the sensitivity and false discovery rate of the CNV detection using exome sequencing. Further, we performed targeted testing of the STRC gene in 1972 individuals. To reduce the number of variants for downstream analysis, we performed a large-scale iterative variant calling process with random control cohorts to assess the reproducibility of the CNVs. Results The modified workflow presented in this paper reduced the number of total variants identified by one third while retaining a higher sensitivity of 97% and resulted in an improved false discovery rate of 11.4% compared to the default ExomeDepth pipeline. The exclusion of exons with low mappability removes 4.5% of the exons, including a subset of exons (0.6%) in disease-associated genes which are intractable by short-read next-generation sequencing (NGS). Results from the reproducibility analysis showed that the clinically reported variants were reproducible 100% of the time and that the modified workflow can be used to rank variants from high to low confidence. Targeted testing of 30 CNVs identified in STRC, a challenging gene to ascertain by NGS, showed a 100% validation rate. Conclusions In summary, we introduced a modification to the default ExomeDepth workflow to reduce the false positives originating from the repetitive regions of the genome, created a large-scale iterative variant calling framework for reproducibility, and provided recommendations for implementation in clinical settings.
引用
收藏
页数:11
相关论文
共 50 条
  • [41] Exome sequencing-based copy-number variation and loss of heterozygosity detection: ExomeCNV
    Sathirapongsasuti, Jarupon Fah
    Lee, Hane
    Horst, Basil A. J.
    Brunner, Georg
    Cochran, Alistair J.
    Binder, Scott
    Quackenbush, John
    Nelson, Stanley F.
    BIOINFORMATICS, 2011, 27 (19) : 2648 - 2654
  • [42] Gene discovery and functional assessment of rare copy-number variants in neurodevelopmental disorders
    Iyer, Janani
    Girirajan, Santhosh
    BRIEFINGS IN FUNCTIONAL GENOMICS, 2015, 14 (05) : 315 - 328
  • [43] Exome sequencing data analysis to chracterize rare germline copy number variants involved in colorectal cancer and serrated polyposis syndrome predisposition
    Franch-Exposito, S.
    Esteban-Jurado, C.
    Garre, P.
    Quintanilla, I.
    Duran, S.
    Hernandez-Illan, E.
    Cuatrecasas, M.
    Samper, E.
    Munoz, J.
    Diaz-Gay, M.
    Ocana, T.
    Carballal, S.
    Castells, A.
    Vila-Casadesus, M.
    Serra, E.
    Derdak, S.
    Laurie, S.
    Beltran, S.
    Carvajal, J.
    Bujanda, L.
    Ruiz-Ponte, C.
    Camps, J.
    Gironella, M.
    Lozano, J.
    Balaguer, F.
    Cubiella, J.
    Caldes, T.
    Castellvi-Bel, S.
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2018, 26 : 551 - 552
  • [44] Enhanced copy number variants detection from whole-exome sequencing data using EXCAVATOR2
    D'Aurizio, Romina
    Pippucci, Tommaso
    Tattini, Lorenzo
    Giusti, Betti
    Pellegrini, Marco
    Magi, Alberto
    NUCLEIC ACIDS RESEARCH, 2016, 44 (20)
  • [45] Detection of copy number variants and loss of heterozygosity from impure tumor samples using whole exome sequencing data
    Liu, Xiaocheng
    Li, Ao
    Xi, Jianing
    Feng, Huanqing
    Wang, Minghui
    ONCOLOGY LETTERS, 2018, 16 (04) : 4713 - 4720
  • [46] cnvOffSeq: detecting intergenic copy number variation using off-target exome sequencing data
    Bellos, Evangelos
    Coin, Lachlan J. M.
    BIOINFORMATICS, 2014, 30 (17) : I639 - I645
  • [47] A machine-learning approach for accurate detection of copy number variants from exome sequencing
    Pounraja, Vijay Kumar
    Jayakar, Gopal
    Jensen, Matthew
    Kelkar, Neil
    Girirajan, Santhosh
    GENOME RESEARCH, 2019, 29 (07) : 1134 - 1143
  • [48] RBV: Allele-specific copy-number validation of whole genome sequence and whole exome sequence data
    Whitford, W.
    Lehnert, K.
    Snell, R. G.
    Jacobsen, J. C.
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2018, 26 : 705 - 706
  • [49] Allele-specific copy-number based deconvolution of bulk tumour RNA sequencing data from the TRACERx study
    Castignani, Carla
    Demeulemeester, Jonas
    Cadieux, Elizabeth Larose
    Hynds, Robert E.
    Pearce, David R.
    Dentro, Stefan C.
    Van Loo, Peter
    Swanton, Charles
    Consortium, Tracerx
    CANCER RESEARCH, 2022, 82 (12)
  • [50] Detection of Clinically Relevant Copy Number Variants with Whole-Exome Sequencing
    de Ligt, Joep
    Boone, Philip M.
    Pfundt, Rolph
    Vissers, Lisenka E. L. M.
    Richmond, Todd
    Geoghegan, Joel
    O'Moore, Kathleen
    de Leeuw, Nicole
    Shaw, Christine
    Brunner, Han G.
    Lupski, James R.
    Veltman, Joris A.
    Hehir-Kwa, Jayne Y.
    HUMAN MUTATION, 2013, 34 (10) : 1439 - 1448