Fast and accurate Bayesian polygenic risk modeling with variational inference

被引:8
|
作者
Zabad, Shadi [1 ]
Gravel, Simon [2 ]
Li, Yue [1 ]
机构
[1] McGill Univ, Sch Comp Sci, Montreal, PQ, Canada
[2] McGill Univ, Dept Human Genet, Montreal, PQ, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
GENOME-WIDE ASSOCIATION; HUMAN COMPLEX TRAITS; UK BIOBANK; VARIABLE SELECTION; MIXED-MODEL; PREDICTION; SCORES; RARE; REGRESSION; VARIANTS;
D O I
10.1016/j.ajhg.2023.03.009
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
The advent of large-scale genome-wide association studies (GWASs) has motivated the development of statistical methods for phenotype prediction with single-nucleotide polymorphism (SNP) array data. These polygenic risk score (PRS) methods use a multiple linear regres-sion framework to infer joint effect sizes of all genetic variants on the trait. Among the subset of PRS methods that operate on GWAS summary statistics, sparse Bayesian methods have shown competitive predictive ability. However, most existing Bayesian approaches employ Markov chain Monte Carlo (MCMC) algorithms, which are computationally inefficient , do not scale favorably to higher di-mensions, for posterior inference. Here, we introduce variational inference of polygenic risk scores (VIPRS), a Bayesian summary statis-tics-based PRS method that utilizes variational inference techniques to approximate the posterior distribution for the effect sizes. Our experiments with 36 simulation configurations and 12 real phenotypes from the UK Biobank dataset demonstrated that VIPRS is consis-tently competitive with the state-of-the-art in prediction accuracy while being more than twice as fast as popular MCMC-based ap-proaches. This performance advantage is robust across a variety of genetic architectures, SNP heritabilities , independent GWAS co-horts. In addition to its competitive accuracy on the "White British"samples, VIPRS showed improved transferability when applied to other ethnic groups, with up to 1.7-fold increase in R2 among individuals of Nigerian ancestry for low-density lipoprotein (LDL) cholesterol. To illustrate its scalability, we applied VIPRS to a dataset of 9.6 million genetic markers, which conferred further improvements in prediction accuracy for highly polygenic traits, such as height.
引用
收藏
页码:741 / 761
页数:22
相关论文
共 50 条
  • [31] Probabilistic model updating via variational Bayesian inference and adaptive Gaussian process modeling
    Ni, Pinghe
    Li, Jun
    Hao, Hong
    Han, Qiang
    Du, Xiuli
    COMPUTER METHODS IN APPLIED MECHANICS AND ENGINEERING, 2021, 383 (383)
  • [32] Variational Bayesian inference for fMRI time series
    Penny, W
    Kiebel, S
    Friston, KJ
    NEUROIMAGE, 2003, 19 (03) : 727 - 741
  • [33] BayesPy: Variational Bayesian Inference in Python']Python
    Luttinen, Jaakko
    JOURNAL OF MACHINE LEARNING RESEARCH, 2016, 17
  • [34] An Introduction to Bayesian Inference via Variational Approximations
    Grimmer, Justin
    POLITICAL ANALYSIS, 2011, 19 (01) : 32 - 47
  • [35] Sparse Audio Inpainting with Variational Bayesian Inference
    Chantas, Giannis
    Nikolopoulos, Spiros
    Kompatsiaris, Ioannis
    2018 IEEE INTERNATIONAL CONFERENCE ON CONSUMER ELECTRONICS (ICCE), 2018,
  • [36] Variational prior replacement in Bayesian inference and inversion
    Zhao, Xuebin
    Curtis, Andrew
    GEOPHYSICAL JOURNAL INTERNATIONAL, 2024, 239 (02) : 1236 - 1256
  • [37] Robust, Accurate Stochastic Optimization for Variational Inference
    Dhaka, Akash Kumar
    Catalina, Alejandro
    Andersen, Michael Riis
    Magnusson, Mans
    Huggins, Jonathan H.
    Vehtari, Aki
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 33, NEURIPS 2020, 2020, 33
  • [38] VARIATIONAL BAYESIAN INFERENCE FOR STEREO OBJECT TRACKING
    Chantas, Giannis
    Nikolaidis, Nikos
    Pitas, Ioannis
    2013 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2013, : 2439 - 2443
  • [39] Streaming, Distributed Variational Inference for Bayesian Nonparametrics
    Campbell, Trevor
    Straub, Julian
    Fisher, John W., III
    How, Jonathan P.
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 28 (NIPS 2015), 2015, 28
  • [40] Variational inference for Bayesian mixtures of factor analysers
    Ghahramani, Z
    Beal, MJ
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 12, 2000, 12 : 449 - 455