A highly sensitive and specific workflow for detecting rare copy-number variants from exome sequencing data

被引:38
|
作者
Rajagopalan, Ramakrishnan [1 ,2 ]
Murrell, Jill R. [1 ,3 ]
Luo, Minjie [1 ,3 ]
Conlin, Laura K. [1 ,3 ]
机构
[1] Childrens Hosp Philadelphia, Dept Pathol & Lab Med, Div Genom Diagnost, Philadelphia, PA 19104 USA
[2] Drexel Univ, Sch Biomed Engn Sci & Hlth Syst, Philadelphia, PA 19104 USA
[3] Univ Penn, Dept Pathol & Lab Med, Perelman Sch Med, Philadelphia, PA 19104 USA
基金
美国国家卫生研究院;
关键词
Clinical exome sequencing; Copy-number variation; DETECTION TOOLS; DISCOVERY; RESOURCE; GENES; MODEL; SNP;
D O I
10.1186/s13073-020-0712-0
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Background Exome sequencing (ES) is a first-tier diagnostic test for many suspected Mendelian disorders. While it is routine to detect small sequence variants, it is not a standard practice in clinical settings to detect germline copy-number variants (CNVs) from ES data due to several reasons relating to performance. In this work, we comprehensively characterized one of the most sensitive ES-based CNV tools, ExomeDepth, against SNP array, a standard of care test in clinical settings to detect genome-wide CNVs. Methods We propose a modified ExomeDepth workflow by excluding exons with low mappability prior to variant calling to drastically reduce the false positives originating from the repetitive regions of the genome, and an iterative variant calling framework to assess the reproducibility. We used a cohort of 307 individuals with clinical ES data and clinical SNP array to estimate the sensitivity and false discovery rate of the CNV detection using exome sequencing. Further, we performed targeted testing of the STRC gene in 1972 individuals. To reduce the number of variants for downstream analysis, we performed a large-scale iterative variant calling process with random control cohorts to assess the reproducibility of the CNVs. Results The modified workflow presented in this paper reduced the number of total variants identified by one third while retaining a higher sensitivity of 97% and resulted in an improved false discovery rate of 11.4% compared to the default ExomeDepth pipeline. The exclusion of exons with low mappability removes 4.5% of the exons, including a subset of exons (0.6%) in disease-associated genes which are intractable by short-read next-generation sequencing (NGS). Results from the reproducibility analysis showed that the clinically reported variants were reproducible 100% of the time and that the modified workflow can be used to rank variants from high to low confidence. Targeted testing of 30 CNVs identified in STRC, a challenging gene to ascertain by NGS, showed a 100% validation rate. Conclusions In summary, we introduced a modification to the default ExomeDepth workflow to reduce the false positives originating from the repetitive regions of the genome, created a large-scale iterative variant calling framework for reproducibility, and provided recommendations for implementation in clinical settings.
引用
收藏
页数:11
相关论文
共 50 条
  • [31] Exome sequencing identified rare recurrent copy number variants and hereditary breast cancer susceptibility
    Mantere, Tuomo
    Kumpula, Timo
    Vorimo, Sandra
    Mattila, Taneli
    O'Gorman, Luke
    Astuti, Galuh
    Tervasmaki, Anna
    Koivuluoma, Susanna
    Mattila, Tiina
    Grip, Mervi
    Winqvist, Robert
    Kuismin, Outi
    Moilanen, Jukka
    Hoischen, Alexander
    Gilissen, Christian
    Pylkas, Katri
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2024, 32 : 1250 - 1250
  • [32] Exome sequencing identified rare recurrent copy number variants and hereditary breast cancer susceptibility
    Kumpula, Timo A. A.
    Vorimo, Sandra
    Mattila, Taneli T. T.
    O'Gorman, Luke
    Astuti, Galuh
    Tervasmaki, Anna
    Koivuluoma, Susanna
    Mattila, Tiina M. M.
    Grip, Mervi
    Winqvist, Robert
    Kuismin, Outi
    Moilanen, Jukka
    Hoischen, Alexander
    Gilissen, Christian
    Mantere, Tuomo
    Pylkas, Katri
    PLOS GENETICS, 2023, 19 (08):
  • [33] Rare germline copy number variants in colorectal cancer predisposition characterized by exome sequencing analysis
    Franch-Exposito, Sebastia
    Esteban-Jurado, Clara
    Garre, Pilar
    Quintanilla, Isabel
    Duran-Sanchon, Saray
    Diaz-Gay, Marcos
    Bonjoch, Laia
    Cuatrecasas, Miriam
    Samper, Esther
    Munoz, Jenifer
    Ocana, Teresa
    Carballal, Sabela
    Lopez-Ceron, Maria
    Castells, Antoni
    Vila-Casadesus, Maria
    Derdak, Sophia
    Laurie, Steven
    Beltran, Sergi
    Carvajal, Jaime
    Bujanda, Luis
    Ruiz-Ponte, Clara
    Camps, Jordi
    Gironella, Meritxell
    Jose Lozano, Juan
    Balaguer, Francesc
    Cubiella, Joaquin
    Caldes, Trinidad
    Castellvi-Bel, Sergi
    JOURNAL OF GENETICS AND GENOMICS, 2018, 45 (01) : 41 - 45
  • [34] Discovery and Statistical Genotyping of Copy-Number Variation from Whole-Exome Sequencing Depth
    Fromer, Menachem
    Moran, Jennifer L.
    Chambert, Kimberly
    Banks, Eric
    Bergen, Sarah E.
    Ruderfer, Douglas M.
    Handsaker, Robert E.
    McCarroll, Steven A.
    O'Donovan, Michael C.
    Owen, Michael J.
    Kirov, George
    Sullivan, Patrick F.
    Hultman, Christina M.
    Sklar, Pamela
    Purcell, Shaun M.
    AMERICAN JOURNAL OF HUMAN GENETICS, 2012, 91 (04) : 597 - 607
  • [35] A Comparison of Tools for Copy-Number Variation Detection in Germline Whole Exome and Whole Genome Sequencing Data
    Gabrielaite, Migle
    Torp, Mathias Husted
    Rasmussen, Malthe Sebro
    Andreu-Sanchez, Sergio
    Vieira, Filipe Garrett
    Pedersen, Christina Bligaard
    Kinalis, Savvas
    Madsen, Majbritt Busk
    Kodama, Miyako
    Demircan, Guel Sude
    Simonyan, Arman
    Yde, Christina Westmose
    Olsen, Lars Ronn
    Marvig, Rasmus L.
    ostrup, Olga
    Rossing, Maria
    Nielsen, Finn Cilius
    Winther, Ole
    Bagger, Frederik Otzen
    CANCERS, 2021, 13 (24)
  • [36] VisCap: inference and visualization of germ-line copy-number variants from targeted clinical sequencing data
    Pugh, Trevor J.
    Amr, Sami S.
    Bowser, Mark J.
    Gowrisankar, Sivakumar
    Hynes, Elizabeth
    Mahanta, Lisa M.
    Rehm, Heidi L.
    Funke, Birgit
    Lebo, Matthew S.
    GENETICS IN MEDICINE, 2016, 18 (07) : 712 - 719
  • [37] Application of whole-exome sequencing for detecting copy number variants in CMT1A/HNPP
    Jo, H. -Y.
    Park, M. -H.
    Woo, H. -M.
    Han, M. H.
    Kim, B. -Y.
    Choi, B. -O.
    Chung, K. W.
    Koo, S. K.
    CLINICAL GENETICS, 2016, 90 (02) : 177 - 181
  • [38] Copy Number Analysis of Whole Exome Sequencing Data
    Madubata, Chinwe
    Bi, Xin
    Pang, Jiuhong
    Gu, Yue
    Koganti, Lahari
    Liao, Jun
    Hsiao, Susan
    Aggarwal, Vimla
    Mansukhani, Mahesh
    Jobanputra, Vaidehi
    AMERICAN JOURNAL OF CLINICAL PATHOLOGY, 2024, 162 : S158 - S159
  • [39] An Expanded Association Approach for Rare Germline Variants with Copy-Number Alternation
    Geng, Yu
    Zhao, Zhongmeng
    Cui, Daibin
    Zheng, Tian
    Zhang, Xuanping
    Xiao, Xiao
    Wang, Jiayin
    BIOINFORMATICS AND BIOMEDICAL ENGINEERING, IWBBIO 2017, PT II, 2017, 10209 : 81 - 94
  • [40] COMBINATORIAL ANALYSIS OF EXOME SEQUENCING DATA AND COPY NUMBER VARIANTS IN CONGENITAL HEART DISEASE PATIENTS
    Fotiou, Elisavet
    Williams, Simon
    Keavney, Bernard
    HEART, 2017, 103 : A115 - A116