Accurate genotype imputation from low-coverage whole-genome sequencing data of rainbow trout

被引:2
|
作者
Liu, Sixin [1 ]
Martin, Kyle E. [2 ]
Snelling, Warren M. [3 ]
Long, Roseanna [1 ]
Leeds, Timothy D. [1 ]
Vallejo, Roger L. [1 ]
Wiens, Gregory D. [1 ]
Palti, Yniv [1 ]
机构
[1] Agr Res Serv, Natl Ctr Cool & Cold Water Aquaculture, United States Dept Agr, 11861 Leetown Rd, Kearneysville, WV 25430 USA
[2] Troutlodge Inc, Sumner, WA 98390 USA
[3] Agr Res Serv, US Meat Anim Res Ctr, United States Dept Agr, Clay Ctr, NE 68933 USA
来源
G3-GENES GENOMES GENETICS | 2024年 / 14卷 / 09期
关键词
rainbow trout; low-coverage whole-genome sequencing; genotype imputation; haplotype reference panel; SNP; GLIMPSE2; low-pass sequencing; FORMAT;
D O I
10.1093/g3journal/jkae168
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
With the rapid and significant cost reduction of next-generation sequencing, low-coverage whole-genome sequencing (lcWGS), followed by genotype imputation, is becoming a cost-effective alternative to single-nucleotide polymorphism (SNP)-array genotyping. The objectives of this study were 2-fold: (1) construct a haplotype reference panel for genotype imputation from lcWGS data in rainbow trout (Oncorhynchus mykiss); and (2) evaluate the concordance between imputed genotypes and SNP-array genotypes in 2 breeding populations. Medium-coverage (12x) whole-genome sequences were obtained from a total of 410 fish representing 5 breeding populations with various spawning dates. The short-read sequences were mapped to the rainbow trout reference genome, and genetic variants were identified using GATK. After data filtering, 20,434,612 biallelic SNPs were retained. The reference panel was phased with SHAPEIT5 and was used as a reference to impute genotypes from lcWGS data employing GLIMPSE2. A total of 90 fish from the Troutlodge November breeding population were sequenced with an average coverage of 1.3x, and these fish were also genotyped with the Axiom 57K rainbow trout SNP array. The concordance between array-based genotypes and imputed genotypes was 99.1%. After downsampling the coverage to 0.5x, 0.2x, and 0.1x, the concordance between array-based genotypes and imputed genotypes was 98.7, 97.8, and 96.7%, respectively. In the USDA odd-year breeding population, the concordance between array-based genotypes and imputed genotypes was 97.8% for 109 fish downsampled to 0.5x coverage. Therefore, the reference haplotype panel reported in this study can be used to accurately impute genotypes from lcWGS data in rainbow trout breeding populations.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] Comparison of Genotype Imputation for SNP Array and Low-Coverage Whole-Genome Sequencing Data
    Deng, Tianyu
    Zhang, Pengfei
    Garrick, Dorian
    Gao, Huijiang
    Wang, Lixian
    Zhao, Fuping
    FRONTIERS IN GENETICS, 2022, 12
  • [2] Population assignment from genotype likelihoods for low-coverage whole-genome sequencing data
    Desaix, Matthew G.
    Rodriguez, Marina D.
    Ruegg, Kristen C.
    Anderson, Eric C.
    METHODS IN ECOLOGY AND EVOLUTION, 2024, 15 (03): : 493 - 510
  • [3] Phylogenomics from low-coverage whole-genome sequencing
    Zhang, Feng
    Ding, Yinhuan
    Zhu, Chao-Dong
    Zhou, Xin
    Orr, Michael C.
    Scheu, Stefan
    Luan, Yun-Xia
    METHODS IN ECOLOGY AND EVOLUTION, 2019, 10 (04): : 507 - 517
  • [4] An accurate assignment test for extremely low-coverage whole-genome sequence data
    Ferrari, Giada
    Atmore, Lane M.
    Jentoft, Sissel
    Jakobsen, Kjetill S.
    Makowiecki, Daniel
    Barrett, James H.
    Star, Bastiaan
    MOLECULAR ECOLOGY RESOURCES, 2022, 22 (04) : 1330 - 1344
  • [5] ACE: absolute copy number estimation from low-coverage whole-genome sequencing data
    Poell, Jos B.
    Mendeville, Matias
    Sie, Daoud
    Brink, Arjen
    Brakenhoff, Ruud H.
    Ylstra, Bauke
    BIOINFORMATICS, 2019, 35 (16) : 2847 - 2849
  • [6] Imputation strategies for low-coverage whole-genome sequencing data and their effects on genomic prediction and genome-wide association studies in pigs
    Wang, X. Q.
    Wang, L. G.
    Shi, L. . Y.
    Tian, J. J.
    Li, M. . Y.
    Wang, L. . X.
    Zhao, F. P.
    ANIMAL, 2024, 18 (09)
  • [7] Best practices for genotype imputation from low-coverage sequencing data in natural populations
    Watowich, Marina M.
    Chiou, Kenneth L.
    Graves, Brian
    Montague, Michael J.
    Brent, Lauren J. N.
    Higham, James P.
    Horvath, Julie E.
    Lu, Amy
    Martinez, Melween I.
    Platt, Michael L.
    Schneider-Crease, India A.
    Lea, Amanda J.
    Snyder-Mackler, Noah
    MOLECULAR ECOLOGY RESOURCES, 2023,
  • [8] Accurate Genotype Imputation in Multiparental Populations from Low-Coverage Sequence
    Zheng, Chaozhi
    Boer, Martin P.
    van Eeuwijk, Fred A.
    GENETICS, 2018, 210 (01) : 71 - 82
  • [9] A protocol for applying low-coverage whole-genome sequencing data in structural variation studies
    Liu, Qi
    Xie, Bo
    Gao, Yang
    Xu, Shuhua
    Lu, Yan
    STAR PROTOCOLS, 2023, 4 (03):
  • [10] A Systematic Evaluation of Low-Coverage Whole Genome Sequencing Imputation across Human Populations
    Rubinacci, Simone
    Delaneau, Olivier
    HUMAN HEREDITY, 2021, 85 (02) : 90 - 91