Improvement of large copy number variant detection by whole genome nanopore sequencing

被引:6
|
作者
Cuenca-Guardiola, Javier [1 ]
de la Morena-Barrio, Belen [2 ,4 ]
Garcia, Juan L. [3 ]
Sanchis-Juan, Alba [4 ,5 ]
Corral, Javier [2 ]
Fernandez-Breis, Jesualdo T. [1 ]
机构
[1] Univ Murcia, Fac Informat, Dept Informat & Sistemas, CEIR Campus Mare Nostrum,IMIB Arrixaca, Campus Espinardo, Murcia 30100, Spain
[2] Univ Murcia, Hosp Univ Morales Meseguer, Ctr Reg Hemodonac, Serv Hematol & Oncol Med,IMIB Arrixaca,CIBERER, Ronda Garay S-N, Murcia 30003, Spain
[3] Univ Salamanca, Univ Hosp Salamanca, Dept Hematol, Inst Invest Biomed IBSAL,Dept Med,Canc Res Ctr IBM, Salamanca, Spain
[4] Univ Cambridge, Dept Haematol, Cambridge Biomed Campus, Cambridge CB2 0PT, England
[5] Cambridge Univ Hosp NHS Fdn, NIHR BioResource, Cambridge Biomed Campus, Cambridge CB2 0QQ, England
关键词
Nanopore; Structural variant; Third-generation sequencing; SERPINC1; STRUCTURAL VARIATION; HYBRIDIZATION; BROWSER;
D O I
10.1016/j.jare.2022.10.012
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Introduction: Whole-genome sequencing using nanopore technologies can uncover structural variants, which are DNA rearrangements larger than 50 base pairs. Nanopore technologies can also characterize their boundaries with single-base accuracy, owing to the kilobase-long reads that encompass either full variants or their junctions. Other methods, such as next-generation short read sequencing or PCR assays, are limited in their capabilities to detect or characterize structural variants. However, the existing software for nanopore sequencing data analysis still reports incomplete variant sets, which also contain erroneous calls, a considerable obstacle for the molecular diagnosis or accurate genotyping of populations. Methods: We compared multiple factors affecting variant calling, such as reference genome version, aligner (minimap2, NGMLR, and lra) choice, and variant caller combinations (Sniffles, CuteSV, SVIM, and NanoVar), to find the optimal group of tools for calling large (>50 kb) deletions and duplications, using data from seven patients exhibiting gross gene defects on SERPINC1 and from a reference variant set as the control. The goal was to obtain the most complete, yet reasonably specific group of large variants using a single cell of PromethION sequencing, which yielded lower depth coverage than short-read sequencing. We also used a custom method for the statistical analysis of the coverage value to refine the resulting datasets.Results: We found that for large deletions and duplications (>50 kb), the existing software performed worse than for smaller ones, in terms of both sensitivity and specificity, and newer tools had not improved this. Our novel software, disCoverage, could polish variant callers' results, improving specificity by up to 62% and sensitivity by 15%, the latter requiring other data or samples.Conclusion: We analyzed the current situation of >50-kb copy number variants with nanopore sequencing, which could be improved. The methods presented in this work could help to identify the known deletions and duplications in a set of patients, while also helping to filter out erroneous calls for these variants, which might aid the efforts to characterize a not-yet well-known fraction of genetic variability in the human genome.& COPY; 2023 The Authors. Published by Elsevier B.V. on behalf of Cairo University. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
引用
收藏
页码:145 / 158
页数:14
相关论文
共 50 条
  • [21] ConanVarvar: a versatile tool for the detection of large syndromic copy number variation from whole-genome sequencing data
    Gudkov, Mikhail
    Thibaut, Loic
    Khushi, Matloob
    Blue, Gillian M.
    Winlaw, David S.
    Dunwoodie, Sally L.
    Giannoulatou, Eleni
    BMC BIOINFORMATICS, 2023, 24 (01)
  • [22] ConanVarvar: a versatile tool for the detection of large syndromic copy number variation from whole-genome sequencing data
    Mikhail Gudkov
    Loïc Thibaut
    Matloob Khushi
    Gillian M. Blue
    David S. Winlaw
    Sally L. Dunwoodie
    Eleni Giannoulatou
    BMC Bioinformatics, 24
  • [23] Whole exome sequencing-based copy number variant detection in inherited retinal disease
    Coppieters, Frauke
    Van de Sompele, Stijn
    Van Schil, Kristof
    Van Laethem, Thalia
    Six, Rani
    De Jaegere, Sarah
    Meire, Francoise
    De Vries, Meindert
    Balikova, Irina
    De Zaeytijd, Julie
    Leroy, Bart P.
    Rosseel, Toon
    De Baere, Elfride
    INVESTIGATIVE OPHTHALMOLOGY & VISUAL SCIENCE, 2019, 60 (09)
  • [24] Combining callers improves the detection of copy number variants from whole-genome sequencing
    Coutelier, Marie
    Holtgrewe, Manuel
    Jaeger, Marten
    Floettman, Ricarda
    Mensah, Martin A.
    Spielmann, Malte
    Krawitz, Peter
    Horn, Denise
    Beule, Dieter
    Mundlos, Stefan
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2022, 30 (02) : 178 - 186
  • [25] Combining callers improves the detection of copy number variants from whole-genome sequencing
    Marie Coutelier
    Manuel Holtgrewe
    Marten Jäger
    Ricarda Flöttman
    Martin A. Mensah
    Malte Spielmann
    Peter Krawitz
    Denise Horn
    Dieter Beule
    Stefan Mundlos
    European Journal of Human Genetics, 2022, 30 : 178 - 186
  • [26] A Comparison of Tools for Copy-Number Variation Detection in Germline Whole Exome and Whole Genome Sequencing Data
    Gabrielaite, Migle
    Torp, Mathias Husted
    Rasmussen, Malthe Sebro
    Andreu-Sanchez, Sergio
    Vieira, Filipe Garrett
    Pedersen, Christina Bligaard
    Kinalis, Savvas
    Madsen, Majbritt Busk
    Kodama, Miyako
    Demircan, Guel Sude
    Simonyan, Arman
    Yde, Christina Westmose
    Olsen, Lars Ronn
    Marvig, Rasmus L.
    ostrup, Olga
    Rossing, Maria
    Nielsen, Finn Cilius
    Winther, Ole
    Bagger, Frederik Otzen
    CANCERS, 2021, 13 (24)
  • [27] Performance of copy number variants detection based on whole-genome sequencing by DNBSEQ platforms
    Rao, Junhua
    Peng, Lihua
    Liang, Xinming
    Jiang, Hui
    Geng, Chunyu
    Zhao, Xia
    Liu, Xin
    Fan, Guangyi
    Chen, Fang
    Mu, Feng
    BMC BIOINFORMATICS, 2020, 21 (01)
  • [28] Performance of copy number variants detection based on whole-genome sequencing by DNBSEQ platforms
    Junhua Rao
    Lihua Peng
    Xinming Liang
    Hui Jiang
    Chunyu Geng
    Xia Zhao
    Xin Liu
    Guangyi Fan
    Fang Chen
    Feng Mu
    BMC Bioinformatics, 21
  • [29] A validation study of copy number variant (CNVs) detection to replace constitutional microarray from low resolution whole genome sequencing data
    Chaubey, Alka
    Valencia, C. Alexander
    Zeqiang, Ma
    Irzyk, Gerard
    Szekeres, Edward, Jr.
    Markovic, Zdenek
    Wang, Yang
    Tanner, Alice
    Collins, Christin
    Hegde, Madhuri
    CANCER GENETICS, 2018, 224 : 56 - 56
  • [30] Genome-wide detection of copy number variation in American mink using whole-genome sequencing
    Pourya Davoudi
    Duy Ngoc Do
    Bruce Rathgeber
    Stefanie M. Colombo
    Mehdi Sargolzaei
    Graham Plastow
    Zhiquan Wang
    Karim Karimi
    Guoyu Hu
    Shafagh Valipour
    Younes Miar
    BMC Genomics, 23