An Improved Bayesian Shrinkage Regression Algorithm for Genomic Selection

被引:1
|
作者
Zhang, Jin [1 ]
Li, Ling [1 ,2 ]
Lv, Mingming [1 ]
Wang, Yidi [1 ]
Qiu, Wenzhe [1 ]
An, Yuan [1 ]
Zhang, Ye [1 ]
Wan, Yuxuan [3 ]
Xu, Yu [4 ]
Chen, Juncong [5 ]
机构
[1] Nanjing Agr Univ, Coll Sci, Nanjing 210095, Peoples R China
[2] Sun Yat sen Univ, Sch Publ Hlth Shenzhen, Shenzhen 518107, Peoples R China
[3] Jiangxi Univ Finance & Econ, Sch Business Adm, Nanchang 330013, Peoples R China
[4] Freshwater Fisheries Res Inst Jiangsu Prov, Nanjing 210017, Peoples R China
[5] Nanjing Agr Univ, Coll Finance, Nanjing 210095, Peoples R China
基金
中国国家自然科学基金;
关键词
genomic selection; polygenic background; Bayesian; mixed linear model; GEBV; WIDE ASSOCIATION; PREDICTIONS;
D O I
10.3390/genes13122193
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Currently a hot topic, genomic selection (GS) has consistently provided powerful support for breeding studies and achieved more comprehensive and reliable selection in animal and plant breeding. GS estimates the effects of all single nucleotide polymorphisms (SNPs) and thereby predicts the genomic estimation of breeding value (GEBV), accelerating breeding progress and overcoming the limitations of conventional breeding. The successful application of GS primarily depends on the accuracy of the GEBV. Adopting appropriate advanced algorithms to improve the accuracy of the GEBV is time-saving and efficient for breeders, and the available algorithms can be further improved in the big data era. In this study, we develop a new algorithm under the Bayesian Shrinkage Regression (BSR, which is called BayesA) framework, an improved expectation-maximization algorithm for BayesA (emBAI). The emBAI algorithm first corrects the polygenic and environmental noise and then calculates the GEBV by emBayesA. We conduct two simulation experiments and a real dataset analysis for flowering time-related Arabidopsis phenotypes to validate the new algorithm. Compared to established methods, emBAI is more powerful in terms of prediction accuracy, mean square error (MSE), mean absolute error (MAE), the area under the receiver operating characteristic curve (AUC) and correlation of prediction in simulation studies. In addition, emBAI performs well under the increasing genetic background. The analysis of the Arabidopsis real dataset further illustrates the benefits of emBAI for genomic prediction according to prediction accuracy, MSE, MAE and correlation of prediction. Furthermore, the new method shows the advantages of significant loci detection and effect coefficient estimation, which are confirmed by The Arabidopsis Information Resource (TAIR) gene bank. In conclusion, the emBAI algorithm provides powerful support for GS in high-dimensional genomic datasets.
引用
收藏
页数:12
相关论文
共 50 条
  • [31] Bayesian variable selection for regression models
    Kuo, L
    Mallick, B
    AMERICAN STATISTICAL ASSOCIATION - 1996 PROCEEDINGS OF THE SECTION ON BAYESIAN STATISTICAL SCIENCE, 1996, : 170 - 175
  • [32] Bayesian variable selection in quantile regression
    Yu, Keming
    Chen, Cathy W. S.
    Reed, Craig
    Dunson, David B.
    STATISTICS AND ITS INTERFACE, 2013, 6 (02) : 261 - 274
  • [33] Bayesian variable selection for logistic regression
    Tian, Yiqing
    Bondell, Howard D.
    Wilson, Alyson
    STATISTICAL ANALYSIS AND DATA MINING, 2019, 12 (05) : 378 - 393
  • [34] On Bayesian lasso variable selection and the specification of the shrinkage parameter
    Anastasia Lykou
    Ioannis Ntzoufras
    Statistics and Computing, 2013, 23 : 361 - 390
  • [35] On Bayesian lasso variable selection and the specification of the shrinkage parameter
    Lykou, Anastasia
    Ntzoufras, Ioannis
    STATISTICS AND COMPUTING, 2013, 23 (03) : 361 - 390
  • [36] Nearly optimal Bayesian shrinkage for high-dimensional regression
    Qifan Song
    Faming Liang
    ScienceChina(Mathematics), 2023, 66 (02) : 409 - 442
  • [37] Nearly optimal Bayesian shrinkage for high-dimensional regression
    Song, Qifan
    Liang, Faming
    SCIENCE CHINA-MATHEMATICS, 2023, 66 (02) : 409 - 442
  • [38] A Pseudo-Bayesian Shrinkage Approach to Regression with Missing Covariates
    Zhang, Nanhua
    Little, Roderick J.
    BIOMETRICS, 2012, 68 (03) : 933 - 942
  • [39] Improved Lasso for genomic selection
    Legarra, Andres
    Robert-Granie, Christele
    Croiseau, Pascal
    Guillaume, Francois
    Fritz, Sebastien
    GENETICS RESEARCH, 2011, 93 (01) : 77 - 87
  • [40] Polygenic prediction via Bayesian regression and continuous shrinkage priors
    Ge, Tian
    Chen, Chia-Yen
    Ni, Yang
    Feng, Yen-Chen Anne
    Smoller, Jordan W.
    NATURE COMMUNICATIONS, 2019, 10 (1)