Replicate whole-genome next-generation sequencing data derived from Caucasian donor saliva samples

被引:0
|
作者
Hansen, Marcus Hay [1 ,2 ,3 ,4 ]
Nyvold, Charlotte Guldborg [1 ,2 ,3 ,4 ]
机构
[1] Univ Southern Denmark, Res Unit Haematol, Haematol Pathol Res Lab, Odense, Denmark
[2] Univ Southern Denmark, Res Unit Pathol, Odense, Denmark
[3] Odense Univ Hosp, Odense, Denmark
[4] Odense Univ Hosp, Dept Hematol, Odense, Denmark
来源
DATA IN BRIEF | 2021年 / 38卷
关键词
Whole-genome; Homo Sapiens genome; Next-generation sequencing (NGS); DNA sequencing; Raw data replicate;
D O I
10.1016/j.dib.2021.107349
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Next-generation sequencing (NGS) of whole genomes has become more accessible to biomedical researchers as the sequencing price continues to drop, and more laboratories have NGS facilities or have access to a core facility. However, the rapid and robust development of practical bioinformatics pipelines partly depends on convenient access to data for the testing of algorithms. Publicly available data sets constitute a part of this strategy. Here, we provide a triplicate whole-genome paired-end sequencing data set, consisting of 1.38 billion raw sequencing reads derived from saliva DNA from a single anonymous male Caucasian donor, with the average sequencing depths aimed at 30x for two of the samples and 4x for a low-coverage sample. The raw number of single nucleotide variants were 3.3-4 million and the median variant read depth of GATK4-passed variants in three samples was 22, 18, and 10. 81% of all variants were found in two or three of the samples, whereas 19% were singletons. The karyotype was evaluated as 46,XY with no apparent copy-number variation. The data set is provided without restrictions for research, educational or commercial purposes. (C) 2021 The Author(s). Published by Elsevier Inc.
引用
收藏
页数:5
相关论文
共 50 条
  • [41] DERIVING HLA TYPING FROM WHOLE MHC NEXT-GENERATION SEQUENCING DATA.
    Duke, Jamie L.
    Sasson, Ariella
    Mackiewicz, Kate
    Lind, Curt
    Major, Endre
    Hague, Tim
    Berces, Attila
    Monos, Dimitri
    HUMAN IMMUNOLOGY, 2013, 74 : 38 - 38
  • [42] Specific Capture and Whole-Genome Sequencing of Viruses from Clinical Samples
    Depledge, Daniel P.
    Palser, Anne L.
    Watson, Simon J.
    Lai, Imogen Yi-Chun
    Gray, Eleanor R.
    Grant, Paul
    Kanda, Ravinder K.
    Leproust, Emily
    Kellam, Paul
    Breuer, Judith
    PLOS ONE, 2011, 6 (11):
  • [43] Whole-genome enrichment and sequencing of Chlamydia trachomatisdirectly from clinical samples
    Mette T Christiansen
    Amanda C Brown
    Samit Kundu
    Helena J Tutill
    Rachel Williams
    Julianne R Brown
    Jolyon Holdstock
    Martin J Holland
    Simon Stevenson
    Jayshree Dave
    CY William Tong
    Katja Einer-Jensen
    Daniel P Depledge
    Judith Breuer
    BMC Infectious Diseases, 14
  • [44] A composite genome approach to identify phylogenetically informative data from next-generation sequencing
    Schwartz, Rachel S.
    Harkins, Kelly M.
    Stone, Anne C.
    Cartwright, Reed A.
    BMC BIOINFORMATICS, 2015, 16
  • [45] A composite genome approach to identify phylogenetically informative data from next-generation sequencing
    Schwartz, Rachel S.
    Harkins, Kelly M.
    Stone, Anne C.
    Cartwright, Reed A.
    BMC Bioinformatics, 2015, 16 (01)
  • [46] Selecting Classification Methods for Small Samples of Next-Generation Sequencing Data
    Zhu, Jiadi
    Yuan, Ziyang
    Shu, Lianjie
    Liao, Wenhui
    Zhao, Mingtao
    Zhou, Yan
    FRONTIERS IN GENETICS, 2021, 12
  • [47] A composite genome approach to identify phylogenetically informative data from next-generation sequencing
    Rachel S. Schwartz
    Kelly M. Harkins
    Anne C. Stone
    Reed A. Cartwright
    BMC Bioinformatics, 16
  • [48] Analysis of the sequencing quality of next-generation sequencing for the entire mitochondrial genome in decomposed human samples
    Lee, Seung Eun
    Kim, Ga Eun
    Lee, Dong Yeon
    Kim, Hajin
    Kim, Moon-Young
    INTERNATIONAL JOURNAL OF LEGAL MEDICINE, 2025, 139 (02) : 551 - 562
  • [49] A survey of tools for variant analysis of next-generation genome sequencing data
    Pabinger, Stephan
    Dander, Andreas
    Fischer, Maria
    Snajder, Rene
    Sperk, Michael
    Efremova, Mirjana
    Krabichler, Birgit
    Speicher, Michael R.
    Zschocke, Johannes
    Trajanoski, Zlatko
    BRIEFINGS IN BIOINFORMATICS, 2014, 15 (02) : 256 - 278
  • [50] Quantitative evaluation of bias in PCR amplification and next-generation sequencing derived from metabarcoding samples
    Pawluczyk, Marta
    Weiss, Julia
    Links, Matthew G.
    Egana Aranguren, Mikel
    Wilkinson, Mark D.
    Egea-Cortines, Marcos
    ANALYTICAL AND BIOANALYTICAL CHEMISTRY, 2015, 407 (07) : 1841 - 1848