Accurate genotype imputation from low-coverage whole-genome sequencing data of rainbow trout

被引:2
|
作者
Liu, Sixin [1 ]
Martin, Kyle E. [2 ]
Snelling, Warren M. [3 ]
Long, Roseanna [1 ]
Leeds, Timothy D. [1 ]
Vallejo, Roger L. [1 ]
Wiens, Gregory D. [1 ]
Palti, Yniv [1 ]
机构
[1] Agr Res Serv, Natl Ctr Cool & Cold Water Aquaculture, United States Dept Agr, 11861 Leetown Rd, Kearneysville, WV 25430 USA
[2] Troutlodge Inc, Sumner, WA 98390 USA
[3] Agr Res Serv, US Meat Anim Res Ctr, United States Dept Agr, Clay Ctr, NE 68933 USA
来源
G3-GENES GENOMES GENETICS | 2024年 / 14卷 / 09期
关键词
rainbow trout; low-coverage whole-genome sequencing; genotype imputation; haplotype reference panel; SNP; GLIMPSE2; low-pass sequencing; FORMAT;
D O I
10.1093/g3journal/jkae168
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
With the rapid and significant cost reduction of next-generation sequencing, low-coverage whole-genome sequencing (lcWGS), followed by genotype imputation, is becoming a cost-effective alternative to single-nucleotide polymorphism (SNP)-array genotyping. The objectives of this study were 2-fold: (1) construct a haplotype reference panel for genotype imputation from lcWGS data in rainbow trout (Oncorhynchus mykiss); and (2) evaluate the concordance between imputed genotypes and SNP-array genotypes in 2 breeding populations. Medium-coverage (12x) whole-genome sequences were obtained from a total of 410 fish representing 5 breeding populations with various spawning dates. The short-read sequences were mapped to the rainbow trout reference genome, and genetic variants were identified using GATK. After data filtering, 20,434,612 biallelic SNPs were retained. The reference panel was phased with SHAPEIT5 and was used as a reference to impute genotypes from lcWGS data employing GLIMPSE2. A total of 90 fish from the Troutlodge November breeding population were sequenced with an average coverage of 1.3x, and these fish were also genotyped with the Axiom 57K rainbow trout SNP array. The concordance between array-based genotypes and imputed genotypes was 99.1%. After downsampling the coverage to 0.5x, 0.2x, and 0.1x, the concordance between array-based genotypes and imputed genotypes was 98.7, 97.8, and 96.7%, respectively. In the USDA odd-year breeding population, the concordance between array-based genotypes and imputed genotypes was 97.8% for 109 fish downsampled to 0.5x coverage. Therefore, the reference haplotype panel reported in this study can be used to accurately impute genotypes from lcWGS data in rainbow trout breeding populations.
引用
收藏
页数:6
相关论文
共 50 条
  • [41] Low-coverage genotyping-by-sequencing with accurate long HiFi reads and optimized imputation
    Eberle, Michael
    Busby, George
    Kintzle, Jen
    Di Domenico, Paolo
    Conception, Gregory
    Henno, Geoff
    Botta, Giordano
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2024, 32 : 607 - 607
  • [42] dpGMM: A Dirichlet Process Gaussian Mixture Model for Copy Number Variation Detection in Low-Coverage Whole-Genome Sequencing Data
    Li, Yaoyao
    Zhang, Junying
    Yuan, Xiguo
    Li, Junping
    IEEE ACCESS, 2020, 8 : 27973 - 27985
  • [43] Toward genomic prediction from whole-genome sequence data: impact of sequencing design on genotype imputation and accuracy of predictions
    Druet, T.
    Macleod, I. M.
    Hayes, B. J.
    HEREDITY, 2014, 112 (01) : 39 - 47
  • [44] Toward genomic prediction from whole-genome sequence data: impact of sequencing design on genotype imputation and accuracy of predictions
    T Druet
    I M Macleod
    B J Hayes
    Heredity, 2014, 112 : 39 - 47
  • [45] Imputation of low-coverage sequencing data from 150,119 UK Biobank genomes
    Rubinacci, Simone
    Hofmeister, Robin J.
    da Mota, Barbara Sousa
    Delaneau, Olivier
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2024, 32 : 50 - 50
  • [46] Imputation of low-coverage sequencing data from 150,119 UK Biobank genomes
    Simone Rubinacci
    Robin J. Hofmeister
    Bárbara Sousa da Mota
    Olivier Delaneau
    Nature Genetics, 2023, 55 : 1088 - 1090
  • [47] Induction and recovery of copy number variation in banana through gamma irradiation and low-coverage whole-genome sequencing
    Datta, Sneha
    Jankowicz-Cieslak, Joanna
    Nielen, Stephan
    Ingelbrecht, Ivan
    Till, Bradley J.
    PLANT BIOTECHNOLOGY JOURNAL, 2018, 16 (09) : 1644 - 1653
  • [48] Imputation of low-coverage sequencing data from 150,119 UK Biobank genomes
    Rubinacci, Simone
    Hofmeister, Robin J.
    da Mota, Barbara Sousa
    Delaneau, Olivier
    NATURE GENETICS, 2023, 55 (07) : 1088 - +
  • [49] A comparison of existing global DNA methylation assays to low-coverage whole-genome bisulfite sequencing for epidemiological studies
    Crary-Dooley, Florence K.
    Tam, Mitchell E.
    Dunaway, Keith W.
    Hertz-Picciotto, Irva
    Schmidt, Rebecca J.
    LaSalle, Janine M.
    EPIGENETICS, 2017, 12 (03) : 206 - 214
  • [50] Ultra low-coverage whole genome sequencing for precision oncology in solid tumors
    Tarawneh, T. S.
    Rodepeter, F.
    Ross, P.
    Teply-Szymanski, J.
    Koch, V.
    Thoelken, C.
    Knorrenschild, J. Riera
    Wuendisch, T.
    Denkert, C.
    Neubauer, A.
    Mack, E.
    ANNALS OF ONCOLOGY, 2022, 33 (08) : S1415 - S1416