Genomic Prediction Accounting for Residual Heteroskedasticity

被引:7
|
作者
Ou, Zhining [1 ]
Tempelman, Robert J. [2 ]
Steibel, Juan P. [2 ,3 ]
Ernst, Catherine W. [2 ]
Bates, Ronald O. [2 ]
Bello, Nora M. [1 ]
机构
[1] Kansas State Univ, Dept Stat, Manhattan, KS 66506 USA
[2] Michigan State Univ, Dept Anim Sci, E Lansing, MI 48824 USA
[3] Michigan State Univ, Dept Fisheries & Wildlife, E Lansing, MI 48824 USA
来源
G3-GENES GENOMES GENETICS | 2016年 / 6卷 / 01期
基金
美国食品与农业研究所; 美国国家科学基金会;
关键词
whole-genome prediction; heteroskedastic errors; genomic breeding values; hierarchical Bayesian model; genPred; shared data resource; PIETRAIN RESOURCE POPULATION; HETEROGENEOUS VARIANCES; BAYESIAN ALPHABET; GENETIC-ANALYSIS; BREEDING VALUES; SELECTION; REGRESSION; ACCURACY; MODELS; DISTRIBUTIONS;
D O I
10.1534/g3.115.022897
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Whole-genome prediction (WGP) models that use single-nucleotide polymorphism marker information to predict genetic merit of animals and plants typically assume homogeneous residual variance. However, variability is often heterogeneous across agricultural production systems and may subsequently bias WGP-based inferences. This study extends classical WGP models based on normality, heavy-tailed specifications and variable selection to explicitly account for environmentally-driven residual heteroske-dasticity under a hierarchical Bayesian mixed-models framework. WGP models assuming homogeneous or heterogeneous residual variances were fitted to training data generated under simulation scenarios reflecting a gradient of increasing heteroskedasticity. Model fit was based on pseudo-Bayes factors and also on prediction accuracy of genomic breeding values computed on a validation data subset one generation removed from the simulated training dataset. Homogeneous vs. heterogeneous residual variance WGP models were also fitted to two quantitative traits, namely 45-min postmortem carcass temperature and loin muscle pH, recorded in a swine resource population dataset prescreened for high and mild residual heteroskedasticity, respectively. Fit of competing WGP models was compared using pseudo-Bayes factors. Predictive ability, defined as the correlation between predicted and observed phenotypes in validation sets of a five-fold cross-validation was also computed. Heteroskedastic error WGP models showed improved model fit and enhanced prediction accuracy compared to homoskedastic error WGP models although the magnitude of the improvement was small (less than two percentage points net gain in prediction accuracy). Nevertheless, accounting for residual heteroskedasticity did improve accuracy of selection, especially on individuals of extreme genetic merit.
引用
收藏
页码:1 / 13
页数:13
相关论文
共 50 条
  • [21] Improved Autoregressive conditional heteroskedasticity model for Prediction of Housing Prices
    Business School, Sichuan University, Chengdu 610064, China
    不详
    Gu, X. (guxin@scu.edu.cn), 2013, CESER Publications, Post Box No. 113, Roorkee, 247667, India (49):
  • [22] Circular prediction regions for miss distance models under heteroskedasticity
    Johnson, Thomas H.
    Haman, John T.
    Wojton, Heather
    Freeman, Laura
    QUALITY AND RELIABILITY ENGINEERING INTERNATIONAL, 2021, 37 (07) : 2991 - 3003
  • [23] A Function Accounting for Training Set Size and Marker Density to Model the Average Accuracy of Genomic Prediction
    Erbe, Malena
    Gredler, Birgit
    Seefried, Franz Reinhold
    Bapst, Beat
    Simianer, Henner
    PLOS ONE, 2013, 8 (12):
  • [24] Accuracy of Genomic Prediction in Switchgrass (Panicum virgatum L.) Improved by Accounting for Linkage Disequilibrium
    Ramstein, Guillaume P.
    Evans, Joseph
    Kaeppler, Shawn M.
    Mitchell, Robert B.
    Vogel, Kenneth P.
    Buell, C. Robin
    Casler, Michael D.
    G3-GENES GENOMES GENETICS, 2016, 6 (04): : 1049 - 1062
  • [25] Impact of residual covariance structures on genomic prediction ability in multi-environment trials
    Mathew, Boby
    Leon, Jens
    Sillanpaa, Mikko J.
    PLOS ONE, 2018, 13 (07):
  • [26] Prediction ability of an alternative multi-trait genomic evaluation for residual feed intake
    Pravia, Maria Isabel
    Navajas, Elly Ana
    Aguilar, Ignacio
    Ravagnolo, Olga
    JOURNAL OF ANIMAL BREEDING AND GENETICS, 2023, 140 (05) : 508 - 518
  • [27] Feature Selection Stability and Accuracy of Prediction Models for Genomic Prediction of Residual Feed Intake in Pigs Using Machine Learning
    Piles, Miriam
    Bergsma, Rob
    Gianola, Daniel
    Gilbert, Helene
    Tusell, Llibertat
    FRONTIERS IN GENETICS, 2021, 12
  • [28] Accounting for outliers and heteroskedasticity in multibreed genetic evaluations of postweaning gain of Nelore-Hereford cattle
    Cardoso, F. F.
    Rosa, G. J. M.
    Tempelman, R. J.
    JOURNAL OF ANIMAL SCIENCE, 2007, 85 (04) : 909 - 918
  • [29] Prediction in the two-way random-effect mode with heteroskedasticity
    Kouassi, Eugene
    Kymn, Kern O.
    JOURNAL OF FORECASTING, 2008, 27 (05) : 451 - 463
  • [30] Development of an accurate genomic ancestry prediction strategy to enable the accounting of Australian and Japanese historical military remains
    Ghaiyed, A. P.
    Chaseling, J.
    Lea, R. A.
    Bernie, A.
    Haupt, L. M.
    Griffiths, L. R.
    Wright, K. M.
    AUSTRALIAN JOURNAL OF FORENSIC SCIENCES, 2022, 54 (03) : 416 - 436