An Improved Bayesian Shrinkage Regression Algorithm for Genomic Selection

被引:1
|
作者
Zhang, Jin [1 ]
Li, Ling [1 ,2 ]
Lv, Mingming [1 ]
Wang, Yidi [1 ]
Qiu, Wenzhe [1 ]
An, Yuan [1 ]
Zhang, Ye [1 ]
Wan, Yuxuan [3 ]
Xu, Yu [4 ]
Chen, Juncong [5 ]
机构
[1] Nanjing Agr Univ, Coll Sci, Nanjing 210095, Peoples R China
[2] Sun Yat sen Univ, Sch Publ Hlth Shenzhen, Shenzhen 518107, Peoples R China
[3] Jiangxi Univ Finance & Econ, Sch Business Adm, Nanchang 330013, Peoples R China
[4] Freshwater Fisheries Res Inst Jiangsu Prov, Nanjing 210017, Peoples R China
[5] Nanjing Agr Univ, Coll Finance, Nanjing 210095, Peoples R China
基金
中国国家自然科学基金;
关键词
genomic selection; polygenic background; Bayesian; mixed linear model; GEBV; WIDE ASSOCIATION; PREDICTIONS;
D O I
10.3390/genes13122193
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Currently a hot topic, genomic selection (GS) has consistently provided powerful support for breeding studies and achieved more comprehensive and reliable selection in animal and plant breeding. GS estimates the effects of all single nucleotide polymorphisms (SNPs) and thereby predicts the genomic estimation of breeding value (GEBV), accelerating breeding progress and overcoming the limitations of conventional breeding. The successful application of GS primarily depends on the accuracy of the GEBV. Adopting appropriate advanced algorithms to improve the accuracy of the GEBV is time-saving and efficient for breeders, and the available algorithms can be further improved in the big data era. In this study, we develop a new algorithm under the Bayesian Shrinkage Regression (BSR, which is called BayesA) framework, an improved expectation-maximization algorithm for BayesA (emBAI). The emBAI algorithm first corrects the polygenic and environmental noise and then calculates the GEBV by emBayesA. We conduct two simulation experiments and a real dataset analysis for flowering time-related Arabidopsis phenotypes to validate the new algorithm. Compared to established methods, emBAI is more powerful in terms of prediction accuracy, mean square error (MSE), mean absolute error (MAE), the area under the receiver operating characteristic curve (AUC) and correlation of prediction in simulation studies. In addition, emBAI performs well under the increasing genetic background. The analysis of the Arabidopsis real dataset further illustrates the benefits of emBAI for genomic prediction according to prediction accuracy, MSE, MAE and correlation of prediction. Furthermore, the new method shows the advantages of significant loci detection and effect coefficient estimation, which are confirmed by The Arabidopsis Information Resource (TAIR) gene bank. In conclusion, the emBAI algorithm provides powerful support for GS in high-dimensional genomic datasets.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] Polygenic prediction via Bayesian regression and continuous shrinkage priors
    Tian Ge
    Chia-Yen Chen
    Yang Ni
    Yen-Chen Anne Feng
    Jordan W. Smoller
    Nature Communications, 10
  • [42] Nearly optimal Bayesian shrinkage for high-dimensional regression
    Qifan Song
    Faming Liang
    Science China Mathematics, 2023, 66 : 409 - 442
  • [43] Using the EM algorithm for Bayesian variable selection in logistic regression models with related covariates
    Koslovsky, M. D.
    Swartz, M. D.
    Leon-Novelo, L.
    Chan, W.
    Wilkinson, A. V.
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2018, 88 (03) : 575 - 596
  • [44] An improved weighted naive Bayesian classification algorithm based on multivariable linear regression model
    Wang, Xingang
    Sun, Xiu
    PROCEEDINGS OF 2016 9TH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID), VOL 2, 2016, : 219 - 222
  • [45] Multivariate regression shrinkage and selection by canonical correlation analysis
    An, Baiguo
    Guo, Jianhua
    Wang, Hansheng
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2013, 62 : 93 - 107
  • [46] An improved successive projections algorithm version to variable selection in multiple linear regression
    Canova, Luciana dos Santos
    Vallese, Federico Danilo
    Pistonesi, Marcelo Fabian
    Gomes, Adriano de Araujo
    ANALYTICA CHIMICA ACTA, 2023, 1274
  • [47] A Bayesian Decision Theory Approach for Genomic Selection
    Jesus Villar-Hernandez, Bartolode
    Perez-Elizalde, Sergio
    Crossa, Jose
    Perez-Rodriguez, Paulino
    Toledo, Fernando H.
    Burgueno, Juan
    G3-GENES GENOMES GENETICS, 2018, 8 (09): : 3019 - 3037
  • [48] Sparse bayesian learning for genomic selection in yeast
    Ayat, Maryam
    Domaratzki, Mike
    FRONTIERS IN BIOINFORMATICS, 2022, 2
  • [49] Bayesian discrete lognormal regression model for genomic prediction
    Abelardo Montesinos-López
    Humberto Gutiérrez-Pulido
    Sofía Ramos-Pulido
    José Cricelio Montesinos-López
    Osval A. Montesinos-López
    José Crossa
    Theoretical and Applied Genetics, 2024, 137
  • [50] Bayesian discrete lognormal regression model for genomic prediction
    Montesinos-Lopez, Abelardo
    Gutierrez-Pulido, Humberto
    Ramos-Pulido, Sofia
    Montesinos-Lopez, Jose Cricelio
    Montesinos-Lopez, Osval A.
    Crossa, Jose
    THEORETICAL AND APPLIED GENETICS, 2024, 137 (01)