Improvement of large copy number variant detection by whole genome nanopore sequencing

被引:6
|
作者
Cuenca-Guardiola, Javier [1 ]
de la Morena-Barrio, Belen [2 ,4 ]
Garcia, Juan L. [3 ]
Sanchis-Juan, Alba [4 ,5 ]
Corral, Javier [2 ]
Fernandez-Breis, Jesualdo T. [1 ]
机构
[1] Univ Murcia, Fac Informat, Dept Informat & Sistemas, CEIR Campus Mare Nostrum,IMIB Arrixaca, Campus Espinardo, Murcia 30100, Spain
[2] Univ Murcia, Hosp Univ Morales Meseguer, Ctr Reg Hemodonac, Serv Hematol & Oncol Med,IMIB Arrixaca,CIBERER, Ronda Garay S-N, Murcia 30003, Spain
[3] Univ Salamanca, Univ Hosp Salamanca, Dept Hematol, Inst Invest Biomed IBSAL,Dept Med,Canc Res Ctr IBM, Salamanca, Spain
[4] Univ Cambridge, Dept Haematol, Cambridge Biomed Campus, Cambridge CB2 0PT, England
[5] Cambridge Univ Hosp NHS Fdn, NIHR BioResource, Cambridge Biomed Campus, Cambridge CB2 0QQ, England
关键词
Nanopore; Structural variant; Third-generation sequencing; SERPINC1; STRUCTURAL VARIATION; HYBRIDIZATION; BROWSER;
D O I
10.1016/j.jare.2022.10.012
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Introduction: Whole-genome sequencing using nanopore technologies can uncover structural variants, which are DNA rearrangements larger than 50 base pairs. Nanopore technologies can also characterize their boundaries with single-base accuracy, owing to the kilobase-long reads that encompass either full variants or their junctions. Other methods, such as next-generation short read sequencing or PCR assays, are limited in their capabilities to detect or characterize structural variants. However, the existing software for nanopore sequencing data analysis still reports incomplete variant sets, which also contain erroneous calls, a considerable obstacle for the molecular diagnosis or accurate genotyping of populations. Methods: We compared multiple factors affecting variant calling, such as reference genome version, aligner (minimap2, NGMLR, and lra) choice, and variant caller combinations (Sniffles, CuteSV, SVIM, and NanoVar), to find the optimal group of tools for calling large (>50 kb) deletions and duplications, using data from seven patients exhibiting gross gene defects on SERPINC1 and from a reference variant set as the control. The goal was to obtain the most complete, yet reasonably specific group of large variants using a single cell of PromethION sequencing, which yielded lower depth coverage than short-read sequencing. We also used a custom method for the statistical analysis of the coverage value to refine the resulting datasets.Results: We found that for large deletions and duplications (>50 kb), the existing software performed worse than for smaller ones, in terms of both sensitivity and specificity, and newer tools had not improved this. Our novel software, disCoverage, could polish variant callers' results, improving specificity by up to 62% and sensitivity by 15%, the latter requiring other data or samples.Conclusion: We analyzed the current situation of >50-kb copy number variants with nanopore sequencing, which could be improved. The methods presented in this work could help to identify the known deletions and duplications in a set of patients, while also helping to filter out erroneous calls for these variants, which might aid the efforts to characterize a not-yet well-known fraction of genetic variability in the human genome.& COPY; 2023 The Authors. Published by Elsevier B.V. on behalf of Cairo University. This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
引用
收藏
页码:145 / 158
页数:14
相关论文
共 50 条
  • [1] Long insert whole genome sequencing for copy number variant and translocation detection
    Liang, Winnie S.
    Aldrich, Jessica
    Tembe, Waibhav
    Kurdoglu, Ahmet
    Cherni, Irene
    Phillips, Lori
    Reiman, Rebecca
    Baker, Angela
    Weiss, Glen J.
    Carpten, John D.
    Craig, David W.
    NUCLEIC ACIDS RESEARCH, 2014, 42 (02)
  • [2] Clinical utility of copy number variant (CNV) detection by whole genome sequencing (WGS)
    Lundie, B.
    Minoche, A. E.
    Gayevskiy, V.
    Lee, E.
    Ewans, L.
    Hollway, G.
    Ohnesorg, T.
    Sherstyuk, A.
    Dinger, M.
    Cowley, M. J.
    Burnett, L.
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2019, 27 : 502 - 502
  • [3] Exome sequencing and whole genome sequencing for the detection of copy number variation
    Hehir-Kwa, Jayne Y.
    Pfundt, Rolph
    Veltman, Joris A.
    EXPERT REVIEW OF MOLECULAR DIAGNOSTICS, 2015, 15 (08) : 1023 - 1032
  • [4] Whole genome sequencing for copy number variation and structural variant analyses
    Chatron, Nicolas
    Bernard, Virginie
    Richard, Celine
    Salaun, Gaelle
    Gouas, Laetitia
    Michel-Calemard, Laurence
    Charret, Quentin
    Klein, Valentin
    Fancello, Laura
    Nicolas, Laury
    Viari, Alain
    Ferrari, Anthony
    Blay, Jean Yves
    Till, Marianne
    Touraine, Renaud
    Ramond, Francis
    Harzallah, Ines
    Harbuz, Radu
    Satre, Veronique
    Schluth-Bolard, Caroline
    Vago, Philippe
    Coutton, Charles
    Vinciguerra, Christine
    Sanlaville, Damien
    Thevenon, Julien
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2023, 31 : 619 - 620
  • [5] Whole genome sequencing for copy number variant detection to improve diagnosis and management of rare diseases
    Bowman, Pamela
    Grimes, Hannah
    Dallosso, Anthony R.
    Berry, Ian
    Mullin, Stephen
    Rankin, Julia
    Low, Karen J.
    DEVELOPMENTAL MEDICINE AND CHILD NEUROLOGY, 2025, 67 (01): : 126 - 131
  • [6] ClinSV: clinical grade structural and copy number variant detection from whole genome sequencing data
    Andre E. Minoche
    Ben Lundie
    Greg B. Peters
    Thomas Ohnesorg
    Mark Pinese
    David M. Thomas
    Andreas Zankl
    Tony Roscioli
    Nicole Schonrock
    Sarah Kummerfeld
    Leslie Burnett
    Marcel E. Dinger
    Mark J. Cowley
    Genome Medicine, 13
  • [7] ClinSV: clinical grade structural and copy number variant detection from whole genome sequencing data
    Minoche, Andre E.
    Lundie, Ben
    Peters, Greg B.
    Ohnesorg, Thomas
    Pinese, Mark
    Thomas, David M.
    Zankl, Andreas
    Roscioli, Tony
    Schonrock, Nicole
    Kummerfeld, Sarah
    Burnett, Leslie
    Dinger, Marcel E.
    Cowley, Mark J.
    GENOME MEDICINE, 2021, 13 (01)
  • [8] Multiplex structural variant detection by whole-genome mapping and nanopore sequencing
    Lahari Uppuluri
    Yilin Wang
    Eleanor Young
    Jessica S. Wong
    Heba Z. Abid
    Ming Xiao
    Scientific Reports, 12
  • [9] Multiplex structural variant detection by whole-genome mapping and nanopore sequencing
    Uppuluri, Lahari
    Wang, Yilin
    Young, Eleanor
    Wong, Jessica S.
    Abid, Heba Z.
    Xiao, Ming
    SCIENTIFIC REPORTS, 2022, 12 (01)
  • [10] Evaluation of the performance of copy number variant prediction tools for the detection of deletions from whole genome sequencing data
    Whitford, Whitney
    Lehnert, Klaus
    Snell, Russell G.
    Jacobsen, Jessie C.
    JOURNAL OF BIOMEDICAL INFORMATICS, 2019, 94