An Improved Bayesian Shrinkage Regression Algorithm for Genomic Selection

被引:1
|
作者
Zhang, Jin [1 ]
Li, Ling [1 ,2 ]
Lv, Mingming [1 ]
Wang, Yidi [1 ]
Qiu, Wenzhe [1 ]
An, Yuan [1 ]
Zhang, Ye [1 ]
Wan, Yuxuan [3 ]
Xu, Yu [4 ]
Chen, Juncong [5 ]
机构
[1] Nanjing Agr Univ, Coll Sci, Nanjing 210095, Peoples R China
[2] Sun Yat sen Univ, Sch Publ Hlth Shenzhen, Shenzhen 518107, Peoples R China
[3] Jiangxi Univ Finance & Econ, Sch Business Adm, Nanchang 330013, Peoples R China
[4] Freshwater Fisheries Res Inst Jiangsu Prov, Nanjing 210017, Peoples R China
[5] Nanjing Agr Univ, Coll Finance, Nanjing 210095, Peoples R China
基金
中国国家自然科学基金;
关键词
genomic selection; polygenic background; Bayesian; mixed linear model; GEBV; WIDE ASSOCIATION; PREDICTIONS;
D O I
10.3390/genes13122193
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Currently a hot topic, genomic selection (GS) has consistently provided powerful support for breeding studies and achieved more comprehensive and reliable selection in animal and plant breeding. GS estimates the effects of all single nucleotide polymorphisms (SNPs) and thereby predicts the genomic estimation of breeding value (GEBV), accelerating breeding progress and overcoming the limitations of conventional breeding. The successful application of GS primarily depends on the accuracy of the GEBV. Adopting appropriate advanced algorithms to improve the accuracy of the GEBV is time-saving and efficient for breeders, and the available algorithms can be further improved in the big data era. In this study, we develop a new algorithm under the Bayesian Shrinkage Regression (BSR, which is called BayesA) framework, an improved expectation-maximization algorithm for BayesA (emBAI). The emBAI algorithm first corrects the polygenic and environmental noise and then calculates the GEBV by emBayesA. We conduct two simulation experiments and a real dataset analysis for flowering time-related Arabidopsis phenotypes to validate the new algorithm. Compared to established methods, emBAI is more powerful in terms of prediction accuracy, mean square error (MSE), mean absolute error (MAE), the area under the receiver operating characteristic curve (AUC) and correlation of prediction in simulation studies. In addition, emBAI performs well under the increasing genetic background. The analysis of the Arabidopsis real dataset further illustrates the benefits of emBAI for genomic prediction according to prediction accuracy, MSE, MAE and correlation of prediction. Furthermore, the new method shows the advantages of significant loci detection and effect coefficient estimation, which are confirmed by The Arabidopsis Information Resource (TAIR) gene bank. In conclusion, the emBAI algorithm provides powerful support for GS in high-dimensional genomic datasets.
引用
收藏
页数:12
相关论文
共 50 条
  • [21] Quantile regression shrinkage and selection via the Lqsso
    Daneshvar, Alireza
    Golalizadeh, Mousa
    JOURNAL OF BIOPHARMACEUTICAL STATISTICS, 2024, 34 (03) : 297 - 322
  • [22] Model selection with componentwise shrinkage in orthogonal regression
    Hagiwara, K
    IEICE TRANSACTIONS ON FUNDAMENTALS OF ELECTRONICS COMMUNICATIONS AND COMPUTER SCIENCES, 2003, E86A (07) : 1749 - 1758
  • [23] Shrinkage estimation and selection for a logistic regression model
    Hossain, Shakhawat
    Ahmed, S. Ejaz
    PERSPECTIVES ON BIG DATA ANALYSIS: METHODOLOGIES AND APPLICATIONS, 2014, 622 : 159 - 176
  • [24] Input selection and shrinkage in multiresponse linear regression
    Simila, Timo
    Tikka, Jarkko
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2007, 52 (01) : 406 - 422
  • [25] Regression shrinkage and selection via the lasso: a retrospective
    Tibshirani, Robert
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2011, 73 : 273 - 282
  • [26] Research on Regression Test Case Selection Based on Improved Genetic Algorithm
    Huang, Ming
    Guo, Shujie
    Liang, Xu
    Jiao, Xuan
    2013 3RD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND NETWORK TECHNOLOGY (ICCSNT), 2013, : 256 - 259
  • [27] Improved Test Case Selection Algorithm to Reduce Time in Regression Testing
    Ghani, Israr
    Wan-Kadir, Wan M. N.
    Arbain, Adila Firdaus
    Ibrahim, Noraini
    CMC-COMPUTERS MATERIALS & CONTINUA, 2022, 72 (01): : 635 - 650
  • [28] Extension of the bayesian alphabet for genomic selection
    Habier, David
    Fernando, Rohan L.
    Kizilkaya, Kadir
    Garrick, Dorian J.
    BMC BIOINFORMATICS, 2011, 12
  • [29] Extension of the bayesian alphabet for genomic selection
    David Habier
    Rohan L Fernando
    Kadir Kizilkaya
    Dorian J Garrick
    BMC Bioinformatics, 12
  • [30] An Improved Forward Regression Variable Selection Algorithm for High-Dimensional Linear Regression Models
    Xie, Yanxi
    Li, Yuewen
    Xia, Zhijie
    Yan, Ruixia
    IEEE ACCESS, 2020, 8 (08): : 129032 - 129042