Fast and accurate Bayesian polygenic risk modeling with variational inference

被引:8
|
作者
Zabad, Shadi [1 ]
Gravel, Simon [2 ]
Li, Yue [1 ]
机构
[1] McGill Univ, Sch Comp Sci, Montreal, PQ, Canada
[2] McGill Univ, Dept Human Genet, Montreal, PQ, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
GENOME-WIDE ASSOCIATION; HUMAN COMPLEX TRAITS; UK BIOBANK; VARIABLE SELECTION; MIXED-MODEL; PREDICTION; SCORES; RARE; REGRESSION; VARIANTS;
D O I
10.1016/j.ajhg.2023.03.009
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
The advent of large-scale genome-wide association studies (GWASs) has motivated the development of statistical methods for phenotype prediction with single-nucleotide polymorphism (SNP) array data. These polygenic risk score (PRS) methods use a multiple linear regres-sion framework to infer joint effect sizes of all genetic variants on the trait. Among the subset of PRS methods that operate on GWAS summary statistics, sparse Bayesian methods have shown competitive predictive ability. However, most existing Bayesian approaches employ Markov chain Monte Carlo (MCMC) algorithms, which are computationally inefficient , do not scale favorably to higher di-mensions, for posterior inference. Here, we introduce variational inference of polygenic risk scores (VIPRS), a Bayesian summary statis-tics-based PRS method that utilizes variational inference techniques to approximate the posterior distribution for the effect sizes. Our experiments with 36 simulation configurations and 12 real phenotypes from the UK Biobank dataset demonstrated that VIPRS is consis-tently competitive with the state-of-the-art in prediction accuracy while being more than twice as fast as popular MCMC-based ap-proaches. This performance advantage is robust across a variety of genetic architectures, SNP heritabilities , independent GWAS co-horts. In addition to its competitive accuracy on the "White British"samples, VIPRS showed improved transferability when applied to other ethnic groups, with up to 1.7-fold increase in R2 among individuals of Nigerian ancestry for low-density lipoprotein (LDL) cholesterol. To illustrate its scalability, we applied VIPRS to a dataset of 9.6 million genetic markers, which conferred further improvements in prediction accuracy for highly polygenic traits, such as height.
引用
收藏
页码:741 / 761
页数:22
相关论文
共 50 条
  • [21] Variational Bayesian Inference for Crowdsourcing Predictions
    Cai, Desmond
    Duc Thien Nguyen
    Lim, Shiau Hong
    Wynter, Laura
    2020 59TH IEEE CONFERENCE ON DECISION AND CONTROL (CDC), 2020, : 3166 - 3172
  • [22] A Geometric Variational Approach to Bayesian Inference
    Saha, Abhijoy
    Bharath, Karthik
    Kurtek, Sebastian
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2020, 115 (530) : 822 - 835
  • [23] Variational inference for Bayesian bridge regression
    Zanini C.T.P.
    Migon H.S.
    Dias R.
    Statistics and Computing, 2024, 34 (1)
  • [24] Variational Bayesian Inference of Line Spectra
    Badiu, Mihai-Alin
    Hansen, Thomas Lundgaard
    Fleury, Bernard Henri
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2017, 65 (09) : 2247 - 2261
  • [25] Dirichlet process mixture model based nonparametric Bayesian modeling and variational inference
    Fei, Zhengshun
    Liu, Kangling
    Huang, Bingqiang
    Zheng, Yongping
    Xiang, Xinjian
    2019 CHINESE AUTOMATION CONGRESS (CAC2019), 2019, : 3048 - 3051
  • [26] Fast Variational Inference for Bayesian Factor Analysis in Single and Multi-Study Settings
    Hansen, Blake
    Avalos-Pacheco, Alejandra
    Russo, Massimiliano
    De Vito, Roberta
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2025, 34 (01) : 96 - 108
  • [27] Projected Stein Variational Newton: A Fast and Scalable Bayesian Inference Method in High Dimensions
    Chen, Peng
    Wu, Keyi
    Chen, Joshua
    O'Leary-Roseberry, Thomas
    Ghattas, Omar
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [28] Bayesian inference analyses of the polygenic architecture of rheumatoid arthritis
    Stahl, Eli A.
    Wegmann, Daniel
    Trynka, Gosia
    Gutierrez-Achury, Javier
    Do, Ron
    Voight, Benjamin F.
    Kraft, Peter
    Chen, Robert
    Kallberg, Henrik J.
    Kurreeman, Fina A. S.
    Kathiresan, Sekar
    Wijmenga, Cisca
    Gregersen, Peter K.
    Alfredsson, Lars
    Siminovitch, Katherine A.
    Worthington, Jane
    de Bakker, Paul I. W.
    Raychaudhuri, Soumya
    Plenge, Robert M.
    NATURE GENETICS, 2012, 44 (05) : 483 - +
  • [29] Bayesian inference analyses of the polygenic architecture of rheumatoid arthritis
    Eli A Stahl
    Daniel Wegmann
    Gosia Trynka
    Javier Gutierrez-Achury
    Ron Do
    Benjamin F Voight
    Peter Kraft
    Robert Chen
    Henrik J Kallberg
    Fina A S Kurreeman
    Sekar Kathiresan
    Cisca Wijmenga
    Peter K Gregersen
    Lars Alfredsson
    Katherine A Siminovitch
    Jane Worthington
    Paul I W de Bakker
    Soumya Raychaudhuri
    Robert M Plenge
    Nature Genetics, 2012, 44 : 483 - 489
  • [30] VARIATIONAL BAYESIAN INFERENCE FOR NONLINEAR ACOUSTIC ECHO CANCELLATION USING ADAPTIVE CASCADE MODELING
    Malik, Sarmad
    Enzner, Gerald
    2012 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2012, : 37 - 40