Fast and accurate Bayesian polygenic risk modeling with variational inference

被引:8
|
作者
Zabad, Shadi [1 ]
Gravel, Simon [2 ]
Li, Yue [1 ]
机构
[1] McGill Univ, Sch Comp Sci, Montreal, PQ, Canada
[2] McGill Univ, Dept Human Genet, Montreal, PQ, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
GENOME-WIDE ASSOCIATION; HUMAN COMPLEX TRAITS; UK BIOBANK; VARIABLE SELECTION; MIXED-MODEL; PREDICTION; SCORES; RARE; REGRESSION; VARIANTS;
D O I
10.1016/j.ajhg.2023.03.009
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
The advent of large-scale genome-wide association studies (GWASs) has motivated the development of statistical methods for phenotype prediction with single-nucleotide polymorphism (SNP) array data. These polygenic risk score (PRS) methods use a multiple linear regres-sion framework to infer joint effect sizes of all genetic variants on the trait. Among the subset of PRS methods that operate on GWAS summary statistics, sparse Bayesian methods have shown competitive predictive ability. However, most existing Bayesian approaches employ Markov chain Monte Carlo (MCMC) algorithms, which are computationally inefficient , do not scale favorably to higher di-mensions, for posterior inference. Here, we introduce variational inference of polygenic risk scores (VIPRS), a Bayesian summary statis-tics-based PRS method that utilizes variational inference techniques to approximate the posterior distribution for the effect sizes. Our experiments with 36 simulation configurations and 12 real phenotypes from the UK Biobank dataset demonstrated that VIPRS is consis-tently competitive with the state-of-the-art in prediction accuracy while being more than twice as fast as popular MCMC-based ap-proaches. This performance advantage is robust across a variety of genetic architectures, SNP heritabilities , independent GWAS co-horts. In addition to its competitive accuracy on the "White British"samples, VIPRS showed improved transferability when applied to other ethnic groups, with up to 1.7-fold increase in R2 among individuals of Nigerian ancestry for low-density lipoprotein (LDL) cholesterol. To illustrate its scalability, we applied VIPRS to a dataset of 9.6 million genetic markers, which conferred further improvements in prediction accuracy for highly polygenic traits, such as height.
引用
收藏
页码:741 / 761
页数:22
相关论文
共 50 条
  • [1] Fast and Accurate Variational Inference for Large Bayesian VARs with Stochastic Volatility
    Chan, Joshua C. C.
    Yu, Xuewen
    JOURNAL OF ECONOMIC DYNAMICS & CONTROL, 2022, 143
  • [2] Variational Bayes for Fast and Accurate Empirical Likelihood Inference
    Yu, Weichang
    Bondell, Howard D.
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2024, 119 (546) : 1089 - 1101
  • [3] Bayesian Group-Sparse Modeling and Variational Inference
    Babacan, S. Derin
    Nakajima, Shinichi
    Do, Minh N.
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2014, 62 (11) : 2906 - 2921
  • [4] Evaluating probabilistic programming and fast variational Bayesian inference in phylogenetics
    Fourment, Mathieu
    Darling, Aaron E.
    PEERJ, 2019, 7
  • [5] Fast and accurate variational inference for models with many latent variables
    Loaiza-Maya, Ruben
    Smith, Michael Stanley
    Nott, David J.
    Danaher, Peter J.
    JOURNAL OF ECONOMETRICS, 2022, 230 (02) : 339 - 362
  • [6] Bayesian K-SVD Using Fast Variational Inference
    Serra, Juan G.
    Testa, Matteo
    Molina, Rafael
    Katsaggelos, Aggelos K.
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (07) : 3344 - 3359
  • [7] Fast Variational Bayesian Inference for Space-Time Adaptive Processing
    Zhang, Xinying
    Wang, Tong
    Wang, Degen
    REMOTE SENSING, 2023, 15 (17)
  • [8] Fast Variational Bayesian Inference for Temporally Correlated Sparse Signal Recovery
    Cao, Zheng
    Dai, Jisheng
    Xu, Weichao
    Chang, Chunqi
    IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 214 - 218
  • [9] Fast Bayesian inference for modeling multivariate crash counts
    Serhiyenko, Volodymyr
    Mamun, Sha A.
    Ivan, John N.
    Ravishanker, Nalini
    ANALYTIC METHODS IN ACCIDENT RESEARCH, 2016, 9 : 44 - 53
  • [10] Variational Bayesian Inference for Infinite Dirichlet Mixture Towards Accurate Data Categorization
    Lai, Yuping
    He, Wenda
    Ping, Yuan
    Qu, Jinshuai
    Zhang, Xiufeng
    WIRELESS PERSONAL COMMUNICATIONS, 2018, 102 (03) : 2307 - 2329