Direct comparison of performance of single nucleotide variant calling in human genome with alignment-based and assembly-based approaches

被引:0
|
作者
Leihong Wu
Gokhan Yavas
Huixiao Hong
Weida Tong
Wenming Xiao
机构
[1] National Center for Toxicological Research,
[2] US Food and Drug Administration,undefined
来源
Scientific Reports | / 7卷
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Complementary to reference-based variant detection, recent studies revealed that many novel variants could be detected with de novo assembled genomes. To evaluate the effect of reads coverage and the accuracy of assembly-based variant calling, we simulated short reads containing more than 3 million of single nucleotide variants (SNVs) from the whole human genome and compared the efficiency of SNV calling between the assembly-based and alignment-based calling approaches. We assessed the quality of the assembled contig and found that a minimum of 30X coverage of short reads was needed to ensure reliable SNV calling and to generate assembled contigs with a good coverage of genome and genes. In addition, we observed that the assembly-based approach had a much lower recall rate and precision comparing to the alignment-based approach that would recover 99% of imputed SNVs. We observed similar results with experimental reads for NA24385, an individual whose germline variants were well characterized. Although there are additional values for SNVs detection, the assembly-based approach would have great risk of false discovery of novel SNVs. Further improvement of de novo assembly algorithms are needed in order to warrant a good completeness of genome with haplotype resolved and high fidelity of assembled sequences.
引用
收藏
相关论文
共 29 条
  • [21] Whole genome single-nucleotide variation profile-based phylogenetic tree building methods for analysis of viral, bacterial and human genomes
    Faison, William J.
    Rostovtsev, Alexandre
    Castro-Nallar, Eduardo
    Crandall, Keith A.
    Chumakov, Konstantin
    Simonyan, Vahan
    Mazumder, Raja
    GENOMICS, 2014, 104 (01) : 1 - 7
  • [22] Performance and Operational Feasibility of Epstein-Barr Virus-Based Screening for Detection of Nasopharyngeal Carcinoma: Direct Comparison of Two Alternative Approaches
    Lou, Pei-Jen
    Lam, W. K. Jacky
    Hsu, Wan-Lun
    Pfeiffer, Ruth M.
    Yu, Kelly J.
    Chan, Charles M. L.
    Lee, Vicky C. T.
    Chen, Tseng-Cheng
    Terng, Shyuang-Der
    Tsou, Yung-An
    Leu, Yi-Shing
    Liao, Li-Jen
    Chang, Yen-Liang
    Chien, Yin-Chu
    Wang, Cheng-Ping
    Lin, Ching-Yuan
    Hua, Chun-Hung
    Lee, Jehn-Chuan
    Yang, Tsung-Lin
    Hsiao, Chu-Hsing
    Wu, Ming-Shiang
    Tsai, Ming-Hsui
    Cheng, Hung-Chun
    Hildesheim, Allan
    Chen, Chien-Jen
    Chan, K. C. Allen
    Liu, Zhiwei
    JOURNAL OF CLINICAL ONCOLOGY, 2023, 41 (26) : 4257 - +
  • [23] Genome-wide genetic characterization of bladder cancer: A comparison of high-density single-nucleotide polymorphism arrays and PCR-based microsatellite analysis
    Hoque, MO
    Lee, CCR
    Cairns, P
    Schoenberg, M
    Sidransky, D
    CANCER RESEARCH, 2003, 63 (09) : 2216 - 2222
  • [24] Quantification of gold nanoparticle photon radiosensitization from direct and indirect effects using a complete human genome single cell model based on Geant4
    Zhao, Xiandong
    Liu, Ruirui
    Zhao, Tianyu
    Reynoso, Francisco J.
    MEDICAL PHYSICS, 2021, 48 (12) : 8127 - 8139
  • [25] Rapid, economical single-nucleotide polymorphism and microsatellite discovery based on de novo assembly of a reduced representation genome in a non-model organism: a case study of Atlantic cod Gadus morhua
    Carlsson, J.
    Gauthier, D. T.
    Carlsson, J. E. L.
    Coughlan, J. P.
    Dillane, E.
    Fitzgerald, R. D.
    Keating, U.
    McGinnity, P.
    Mirimin, L.
    Cross, T. F.
    JOURNAL OF FISH BIOLOGY, 2013, 82 (03) : 944 - 958
  • [26] Comparison of Low-Coverage Whole Genome Sequencing (LCWGS) with Single Nucleotide Polymorphism (SNP) Based Chromosomal Microarray in the Diagnostic Work-up of Histologically Ambiguous Melanocytic Tumors
    Logunova, Valentina
    Hoppman, Nicole
    Rowsey, Ross
    Erickson, Lori
    Flotte, Thomas
    Kocher, Jean-Pierre
    Sukov, William
    Geiersbach, Katherine
    Wang, Chen
    Guo, Ruifeng
    MODERN PATHOLOGY, 2019, 32
  • [27] Comparison of Low-Coverage Whole Genome Sequencing (LCWGS) with Single Nucleotide Polymorphism (SNP) Based Chromosomal Microarray in the Diagnostic Work-up of Histologically Ambiguous Melanocytic Tumors
    Logunova, Valentina
    Hoppman, Nicole
    Rowsey, Ross
    Erickson, Lori
    Flotte, Thomas
    Kocher, Jean-Pierre
    Sukov, William
    Geiersbach, Katherine
    Wang, Chen
    Guo, Ruifeng
    LABORATORY INVESTIGATION, 2019, 99
  • [28] Comparison of direct current, derivative direct current, pulse and square wave voltammetry at single disc, assembly and composite carbon electrodes: stripping voltammetry at thin film mercury microelectrodes with field-based instrumentation
    Bond, AM
    Czerwinski, WA
    Llorente, M
    ANALYST, 1998, 123 (06) : 1333 - 1337
  • [29] Genome-wide copy number profiling on high-density bacterial artificial chromosomes, single-nucleotide polymorphisms, and oligonucleotide microarrays: A platform comparison based on statistical power analysis
    Hehir-Kwa, Jayne Y.
    Egmont-Petersen, Michael
    Janssen, Irene M.
    Smeets, Dominique
    Van Kessel, Ad Geurts
    Veltman, Joris A.
    DNA RESEARCH, 2007, 14 (01) : 1 - 11