An accurate assignment test for extremely low-coverage whole-genome sequence data

被引:9
|
作者
Ferrari, Giada [1 ]
Atmore, Lane M. [1 ]
Jentoft, Sissel [1 ]
Jakobsen, Kjetill S. [1 ]
Makowiecki, Daniel [2 ]
Barrett, James H. [3 ,4 ]
Star, Bastiaan [1 ]
机构
[1] Univ Oslo, Ctr Ecol & Evolutionary Synth, Dept Biosci, Oslo, Norway
[2] Nicolaus Copernicus Univ, Inst Archaeol, Dept Environm Archaeol & Human Paleoecol, Torun, Poland
[3] Univ Cambridge, McDonald Inst Archaeol Res, Dept Archaeol, Cambridge, England
[4] NTNU Univ Museum, Dept Archaeol & Cultural Hist, Trondheim, Norway
关键词
chromosomal inversion; ecotype; genome skimming; haplotype; population assignment; ATLANTIC COD; ADAPTIVE EVOLUTION; LOCAL ADAPTATION; ANCIENT DNA; INVERSION POLYMORPHISMS; REPRODUCTIVE ISOLATION; DIVERGENCE; ASSOCIATION; MIMICRY; DIFFERENTIATION;
D O I
10.1111/1755-0998.13551
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Genomic assignment tests can provide important diagnostic biological characteristics, such as population of origin or ecotype. Yet, assignment tests often rely on moderate- to high-coverage sequence data that can be difficult to obtain for fields such as molecular ecology and ancient DNA. We have developed a novel approach that efficiently assigns biologically relevant information (i.e., population identity or structural variants such as inversions) in extremely low-coverage sequence data. First, we generate databases from existing reference data using a subset of diagnostic single nucleotide polymorphisms (SNPs) associated with a biological characteristic. Low-coverage alignment files are subsequently compared to these databases to ascertain allelic state, yielding a joint probability for each association. To assess the efficacy of this approach, we assigned haplotypes and population identity in Heliconius butterflies, Atlantic herring, and Atlantic cod using chromosomal inversion sites and whole-genome data. We scored both modern and ancient specimens, including the first whole-genome sequence data recovered from ancient Atlantic herring bones. The method accurately assigns biological characteristics, including population membership, using extremely low-coverage data (as low as 0.0001x) based on genome-wide SNPs. This approach will therefore increase the number of samples in evolutionary, ecological and archaeological research for which relevant biological information can be obtained.
引用
收藏
页码:1330 / 1344
页数:15
相关论文
共 50 条
  • [31] Low-coverage and cost-effective whole-genome sequencing assay for glioma risk stratification
    Qin, Jia-Jun
    Xue, Fei
    Shen, Zhao-Li
    Chen, Xian-Zhen
    JOURNAL OF CANCER RESEARCH AND CLINICAL ONCOLOGY, 2023, 149 (11) : 8359 - 8367
  • [32] LoCo-Pipe: an automated pipeline for population genomics with low-coverage whole-genome sequencing
    Zhou, Zehua
    Sudmant, Peter
    Lou, Runyang Nicolas
    INTEGRATIVE AND COMPARATIVE BIOLOGY, 2024, 64 : S580 - S580
  • [33] A method for positive forensic identification of samples from extremely low-coverage sequence data
    Vohr, Samuel H.
    Najar, Carlos Fernando Buen Abad
    Shapiro, Beth
    Green, Richard E.
    BMC GENOMICS, 2015, 16
  • [34] A method for positive forensic identification of samples from extremely low-coverage sequence data
    Samuel H. Vohr
    Carlos Fernando Buen Abad Najar
    Beth Shapiro
    Richard E. Green
    BMC Genomics, 16
  • [35] Screening of triploid with low-coverage whole-genome sequencing by a single-nucleotide polymorphism-based test in miscarriage tissue
    Geng, Qian
    Cui, Xiaoli
    Zhang, Yaqi
    Zhang, Lijuan
    Zhang, Cai
    Wang, Kai
    Chen, Jianguo
    Zhu, Qingyan
    Xie, Jiansheng
    Xu, Zhiyong
    Liu, Yang
    Zhang, MengMeng
    Ding, Lijie
    Zhang, Wenyong
    Yang, Chuanchun
    JOURNAL OF ASSISTED REPRODUCTION AND GENETICS, 2019, 36 (12) : 2525 - 2531
  • [36] Screening of triploid with low-coverage whole-genome sequencing by a single-nucleotide polymorphism-based test in miscarriage tissue
    Qian Geng
    Xiaoli Cui
    Yaqi Zhang
    Lijuan Zhang
    Cai Zhang
    Kai Wang
    Jianguo Chen
    Qingyan Zhu
    Jiansheng Xie
    Zhiyong Xu
    Yang Liu
    MengMeng Zhang
    Lijie Ding
    Wenyong Zhang
    Chuanchun Yang
    Journal of Assisted Reproduction and Genetics, 2019, 36 : 2525 - 2531
  • [37] An improved approach for accurate and efficient calling of structural variations with low-coverage sequence data
    Jin Zhang
    Jiayin Wang
    Yufeng Wu
    BMC Bioinformatics, 13
  • [38] An improved approach for accurate and efficient calling of structural variations with low-coverage sequence data
    Zhang, Jin
    Wang, Jiayin
    Wu, Yufeng
    BMC BIOINFORMATICS, 2012, 13
  • [39] Genome-wide association study of Cannabis dependence in a native American population using low-coverage whole genome sequence data
    Gizer, Ian
    Bizon, Chris
    Wilhelmsen, Kirk
    Ehlers, Cindy
    BEHAVIOR GENETICS, 2014, 44 (06) : 660 - 660
  • [40] LOW-COVERAGE, WHOLE-GENOME SEQUENCING OF ARTOCARPUS CAMANSI (MORACEAE) FOR PHYLOGENETIC MARKER DEVELOPMENT AND GENE DISCOVERY
    Gardner, Elliot M.
    Johnson, Matthew G.
    Ragone, Diane
    Wickett, Norman J.
    Zerega, Nyree J. C.
    APPLICATIONS IN PLANT SCIENCES, 2016, 4 (07):