An accurate assignment test for extremely low-coverage whole-genome sequence data

被引:9
|
作者
Ferrari, Giada [1 ]
Atmore, Lane M. [1 ]
Jentoft, Sissel [1 ]
Jakobsen, Kjetill S. [1 ]
Makowiecki, Daniel [2 ]
Barrett, James H. [3 ,4 ]
Star, Bastiaan [1 ]
机构
[1] Univ Oslo, Ctr Ecol & Evolutionary Synth, Dept Biosci, Oslo, Norway
[2] Nicolaus Copernicus Univ, Inst Archaeol, Dept Environm Archaeol & Human Paleoecol, Torun, Poland
[3] Univ Cambridge, McDonald Inst Archaeol Res, Dept Archaeol, Cambridge, England
[4] NTNU Univ Museum, Dept Archaeol & Cultural Hist, Trondheim, Norway
关键词
chromosomal inversion; ecotype; genome skimming; haplotype; population assignment; ATLANTIC COD; ADAPTIVE EVOLUTION; LOCAL ADAPTATION; ANCIENT DNA; INVERSION POLYMORPHISMS; REPRODUCTIVE ISOLATION; DIVERGENCE; ASSOCIATION; MIMICRY; DIFFERENTIATION;
D O I
10.1111/1755-0998.13551
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Genomic assignment tests can provide important diagnostic biological characteristics, such as population of origin or ecotype. Yet, assignment tests often rely on moderate- to high-coverage sequence data that can be difficult to obtain for fields such as molecular ecology and ancient DNA. We have developed a novel approach that efficiently assigns biologically relevant information (i.e., population identity or structural variants such as inversions) in extremely low-coverage sequence data. First, we generate databases from existing reference data using a subset of diagnostic single nucleotide polymorphisms (SNPs) associated with a biological characteristic. Low-coverage alignment files are subsequently compared to these databases to ascertain allelic state, yielding a joint probability for each association. To assess the efficacy of this approach, we assigned haplotypes and population identity in Heliconius butterflies, Atlantic herring, and Atlantic cod using chromosomal inversion sites and whole-genome data. We scored both modern and ancient specimens, including the first whole-genome sequence data recovered from ancient Atlantic herring bones. The method accurately assigns biological characteristics, including population membership, using extremely low-coverage data (as low as 0.0001x) based on genome-wide SNPs. This approach will therefore increase the number of samples in evolutionary, ecological and archaeological research for which relevant biological information can be obtained.
引用
收藏
页码:1330 / 1344
页数:15
相关论文
共 50 条
  • [41] dpGMM: A Dirichlet Process Gaussian Mixture Model for Copy Number Variation Detection in Low-Coverage Whole-Genome Sequencing Data
    Li, Yaoyao
    Zhang, Junying
    Yuan, Xiguo
    Li, Junping
    IEEE ACCESS, 2020, 8 : 27973 - 27985
  • [42] Non-Invasive Detection of Breast Cancer by Low-Coverage Whole-Genome Sequencing from Plasma
    Peng, Li
    Yao, Ru
    Gao, Sihang
    Qu, Yang
    Qu, Li
    Zhang, Jingbo
    Zhou, Yidong
    CLINICAL AND EXPERIMENTAL OBSTETRICS & GYNECOLOGY, 2023, 50 (07):
  • [43] Cost-effective low-coverage whole-genome sequencing assay for the risk stratification of gastric cancer
    Ye, Li-Ping
    Mao, Xin-Li
    Zhou, Xian-Bin
    Wang, Yi
    Xu, Shi-Wen
    He, Sai-Qin
    Qian, Zi-Liang
    Zhang, Xiao-Gang
    Zhai, Li-Juan
    Peng, Jin-Bang
    Gu, Bin-Bin
    Jin, Xiu-Xiu
    Song, Ya-Qi
    Li, Shao-Wei
    WORLD JOURNAL OF GASTROINTESTINAL ONCOLOGY, 2022, 14 (03) : 690 - 702
  • [44] Cost-effective low-coverage whole-genome sequencing assay for the risk stratification of gastric cancer
    Li-Ping Ye
    Xin-Li Mao
    Xian-Bin Zhou
    Yi Wang
    Shi-Wen Xu
    Sai-Qin He
    Zi-Liang Qian
    Xiao-Gang Zhang
    Li-Juan Zhai
    Jin-Bang Peng
    Bin-Bin Gu
    Xiu-Xiu Jin
    Ya-Qi Song
    Shao-Wei Li
    World Journal of Gastrointestinal Oncology, 2022, (03) : 690 - 702
  • [45] Low-coverage whole-genome sequencing of extracellular vesicle-associated DNA in patients with metastatic cancer
    Nguyen, Bella
    Wong, Nicholas C.
    Semple, Tim
    Clark, Michael
    Wong, Stephen Q.
    Leslie, Connull
    Mirzai, Bob
    Millward, Michael
    Meehan, Katie
    Lim, Annette M.
    SCIENTIFIC REPORTS, 2021, 11 (01)
  • [46] Kinship Estimation Based on Extremely Low-Coverage Sequencing Data
    Dou, Jinzhuang
    Chothani, Sonia
    Sim, Xueling
    Hughes, Jason D.
    Reilly, Dermot F.
    Tai, E. Shyong
    Liu, Jianjun
    Wang, Chaolong
    GENETIC EPIDEMIOLOGY, 2016, 40 (07) : 619 - 620
  • [47] Induction and recovery of copy number variation in banana through gamma irradiation and low-coverage whole-genome sequencing
    Datta, Sneha
    Jankowicz-Cieslak, Joanna
    Nielen, Stephan
    Ingelbrecht, Ivan
    Till, Bradley J.
    PLANT BIOTECHNOLOGY JOURNAL, 2018, 16 (09) : 1644 - 1653
  • [48] Accurate Genotype Imputation in Multiparental Populations from Low-Coverage Sequence
    Zheng, Chaozhi
    Boer, Martin P.
    van Eeuwijk, Fred A.
    GENETICS, 2018, 210 (01) : 71 - 82
  • [49] A comparison of existing global DNA methylation assays to low-coverage whole-genome bisulfite sequencing for epidemiological studies
    Crary-Dooley, Florence K.
    Tam, Mitchell E.
    Dunaway, Keith W.
    Hertz-Picciotto, Irva
    Schmidt, Rebecca J.
    LaSalle, Janine M.
    EPIGENETICS, 2017, 12 (03) : 206 - 214
  • [50] A Novel Approach to Estimating Heterozygosity from Low-Coverage Genome Sequence
    Bryc, Katarzyna
    Patterson, Nick
    Reich, David
    GENETICS, 2013, 195 (02) : 553 - 561