Linkage disequilibrium vs. pedigree: Genomic selection prediction accuracy in conifer species

被引:26
|
作者
Thistlethwaite, Frances R. [1 ]
El-Dien, Omnia Gamal [1 ,2 ]
Ratcliffe, Blaise [1 ]
Klapste, Jaroslav [3 ]
Porth, Ilga [4 ]
Chen, Charles [5 ]
Stoehr, Michael U. [6 ]
Ingvarsson, Par K. [7 ]
El-Kassaby, Yousry A. [1 ]
机构
[1] Univ British Columbia, Fac Forestry, Dept Forest & Conservat Sci, Vancouver, BC, Canada
[2] Alexandria Univ, Fac Pharm, Pharmacognosy Dept, Alexandria, Egypt
[3] Scion New Zealand Forest Res Inst Ltd, Rotorua, New Zealand
[4] Univ Laval, Dept Sci Bois & Foret, Quebec City, PQ, Canada
[5] Oklahoma State Univ, Dept Biochem & Mol Biol, Stillwater, OK 74078 USA
[6] British Columbia Minist Forests, Lands & Nat Resource Operat, Victoria, BC, Canada
[7] Swedish Univ Agr Sci, Linnean Ctr Plant Biol, Dept Plant Biol, Uppsala, Sweden
来源
PLOS ONE | 2020年 / 15卷 / 06期
基金
加拿大自然科学与工程研究理事会;
关键词
NUCLEOTIDE DIVERSITY; QUANTITATIVE TRAITS; DEMOGRAPHIC HISTORY; RELATIONSHIP MATRIX; CLONED POPULATION; COMPLEX TRAITS; MODELS; ASSOCIATIONS; PATTERNS; VALUES;
D O I
10.1371/journal.pone.0232201
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Background The presupposition of genomic selection (GS) is that predictive accuracies should be based on population-wide linkage disequilibrium (LD). However, in species with large, highly complex genomes the limitation of marker density may preclude the ability to resolve LD accurately enough for GS. Here we investigate such an effect in two conifer species with similar to 20 Gbp genomes, Douglas-fir (Pseudotsuga menziesiiMirb. (Franco)) and Interior spruce (Picea glauca(Moench) Voss xPicea engelmanniiParry ex Engelm.). Random sampling of markers was performed to obtain SNP sets with totals in the range of 200-50,000, this was replicated 10 times. Ridge Regression Best Linear Unbiased Predictor (RR-BLUP) was deployed as the GS method to test these SNP sets, and 10-fold cross-validation was performed on 1,321 Douglas-fir trees, representing 37 full-sib F(1)families and on 1,126 Interior spruce trees, representing 25 open-pollinated (half-sib) families. Both trials are located on 3 sites in British Columbia, Canada. Results As marker number increased, so did GS predictive accuracy for both conifer species. However, a plateau in the gain of accuracy became apparent around 10,000-15,000 markers for both Douglas-fir and Interior spruce. Despite random marker selection, little variation in predictive accuracy was observed across replications. On average, Douglas-fir prediction accuracies were higher than those of Interior spruce, reflecting the difference between full- and half-sib families for Douglas-fir and Interior spruce populations, respectively, as well as their respective effective population size. Conclusions Although possibly advantageous within an advanced breeding population, reducing marker density cannot be recommended for carrying out GS in conifers. Significant LD between markers and putative causal variants was not detected using 50,000 SNPS, and GS was enabled only through the tracking of relatedness in the populations studied. Dramatically increasing marker density would enable said markers to better track LD with causal variants in these large, genetically diverse genomes; as well as providing a model that could be used across populations, breeding programs, and traits.
引用
收藏
页数:14
相关论文
共 50 条
  • [1] Contributions of linkage disequilibrium and co-segregation information to the accuracy of genomic prediction
    Xiaochen Sun
    Rohan Fernando
    Jack Dekkers
    Genetics Selection Evolution, 48
  • [2] Contributions of linkage disequilibrium and co-segregation information to the accuracy of genomic prediction
    Sun, Xiaochen
    Fernando, Rohan
    Dekkers, Jack
    GENETICS SELECTION EVOLUTION, 2016, 48 : 1 - 18
  • [3] Accuracy of genomic selection in simulated populations mimicking the extent of linkage disequilibrium in beef cattle
    Fernanda V Brito
    José Braccini Neto
    Mehdi Sargolzaei
    Jaime A Cobuci
    Flavio S Schenkel
    BMC Genetics, 12
  • [4] Accuracy of genomic selection in simulated populations mimicking the extent of linkage disequilibrium in beef cattle
    Brito, Fernanda V.
    Braccini Neto, Jose
    Sargolzaei, Mehdi
    Cobuci, Jaime A.
    Schenkel, Flavio S.
    BMC GENETICS, 2011, 12
  • [5] The Impact of Genetic Relationship and Linkage Disequilibrium on Genomic Selection
    Liu, Hongjun
    Zhou, Huangkai
    Wu, Yongsheng
    Li, Xiao
    Zhao, Jing
    Zuo, Tao
    Zhang, Xuan
    Zhang, Yongzhong
    Liu, Sisi
    Shen, Yaou
    Lin, Haijian
    Zhang, Zhiming
    Huang, Kaijian
    Luebberstedt, Thomas
    Pan, Guangtang
    PLOS ONE, 2015, 10 (07):
  • [6] Accuracy of Genomic Prediction in Synthetic Populations Depending on the Number of Parents, Relatedness, and Ancestral Linkage Disequilibrium
    Schopp, Pascal
    Mueller, Dominik
    Technow, Frank
    Melchinger, Albrecht E.
    GENETICS, 2017, 205 (01) : 441 - +
  • [7] Genomic Prediction Accuracy Using Haplotypes Defined by Size and Hierarchical Clustering Based on Linkage Disequilibrium
    Won, Sohyoung
    Park, Jong-Eun
    Son, Ju-Hwan
    Lee, Seung-Hwan
    Park, Byeong Ho
    Park, Mina
    Park, Won-Chul
    Chai, Han-Ha
    Kim, Heebal
    Lee, Jungjae
    Lim, Dajeong
    FRONTIERS IN GENETICS, 2020, 11
  • [8] Accuracy of Genomic Prediction in Switchgrass (Panicum virgatum L.) Improved by Accounting for Linkage Disequilibrium
    Ramstein, Guillaume P.
    Evans, Joseph
    Kaeppler, Shawn M.
    Mitchell, Robert B.
    Vogel, Kenneth P.
    Buell, C. Robin
    Casler, Michael D.
    G3-GENES GENOMES GENETICS, 2016, 6 (04): : 1049 - 1062
  • [9] The Effect of Linkage Disequilibrium and Family Relationships on the Reliability of Genomic Prediction
    Wientjes, Yvonne C. J.
    Veerkamp, Roel F.
    Calus, Mario P. L.
    GENETICS, 2013, 193 (02) : 621 - +
  • [10] Prediction in high-dimensional linear models and application to genomic selection under imperfect linkage disequilibrium
    Rabier, Charles-Elie
    Grusea, Simona
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 2021, 70 (04) : 1001 - 1026