PRSice-2: Polygenic Risk Score software for biobank-scale data

被引:863
|
作者
Choi, Shing Wan [1 ,2 ]
O'Reilly, Paul F. [1 ,2 ]
机构
[1] Kings Coll London, Inst Psychiat Psychol & Neurosci, MRC Social Genet & Dev Psychiat Ctr, De Crespigny Pk,Denmark Hill, London SE5 8AF, England
[2] Icahn Sch Med Mt Sinai, Dept Genet & Genom Sci, 1 Gustave L Levy Pl, New York, NY 10029 USA
来源
GIGASCIENCE | 2019年 / 8卷 / 07期
基金
英国医学研究理事会;
关键词
polygenic risk score; GWAS; imputation; PREDICTION;
D O I
10.1093/gigascience/giz082
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
t Background: Polygenic risk score (PRS) analyses have become an integral part of biomedical research, exploited to gain insights into shared aetiology among traits, to control for genomic profile in experimental studies, and to strengthen causal inference, among a range of applications. Substantial efforts are now devoted to biobank projects to collect large genetic and phenotypic data, providing unprecedented opportunity for genetic discovery and applications. To process the large-scale data provided by such biobank resources, highly efficient and scalable methods and software are required. Results: Here we introduce PRSice-2, an efficient and scalable software program for automating and simplifying PRS analyses on large-scale data. PRSice-2 handles both genotyped and imputed data, provides empirical association P-values free from inflation due to overfitting, supports different inheritance models, and can evaluate multiple continuous and binary target traits simultaneously. We demonstrate that PRSice-2 is dramatically faster and more memory-efficient than PRSice-1 and alternative PRS software, LDpred and lassosum, while having comparable predictive power. Conclusion: PRSice-2's combination of efficiency and power will be increasingly important as data sizes grow and as the applications of PRS become more sophisticated, e.g., when incorporated into high-dimensional or gene set-based analyses. PRSice-2 is written in C++, with an R script for plotting, and is freely available for download from http://PRSice.info.
引用
收藏
页数:6
相关论文
共 50 条
  • [1] PRSice: Polygenic Risk Score software
    Euesden, Jack
    Lewis, Cathryn M.
    O'Reilly, Paul F.
    BIOINFORMATICS, 2015, 31 (09) : 1466 - 1468
  • [2] POLYGENIC RISK SCORE SOFTWARE (PRSICE) AND INCREASING THE PREDICTIVE POWER OF PRS
    Euesden, Jack
    Socrates, Adam
    Lewis, Cathryn
    O'Reilly, Paul
    EUROPEAN NEUROPSYCHOPHARMACOLOGY, 2017, 27 : S110 - S110
  • [3] PRSice 2: POLYGENIC RISK SCORE SOFTWARE (UPDATED) AND ITS APPLICATION TO CROSS-TRAIT ANALYSES
    Choi, Shing Wan
    O'Reilly, Paul
    EUROPEAN NEUROPSYCHOPHARMACOLOGY, 2019, 29 : S832 - S832
  • [4] Haplotype estimation for biobank-scale data sets
    Jared O'Connell
    Kevin Sharp
    Nick Shrine
    Louise Wain
    Ian Hall
    Martin Tobin
    Jean-Francois Zagury
    Olivier Delaneau
    Jonathan Marchini
    Nature Genetics, 2016, 48 : 817 - 820
  • [5] Haplotype estimation for biobank-scale data sets
    O'Connell, Jared
    Sharp, Kevin
    Shrine, Nick
    Wain, Louise
    Hall, Ian
    Tobin, Martin
    Zagury, Jean-Francois
    Delaneau, Olivier
    Marchini, Jonathan
    NATURE GENETICS, 2016, 48 (07) : 817 - +
  • [6] Biobank-scale methods and projections for sparse polygenic prediction from machine learning
    Raben, Timothy G.
    Lello, Louis
    Widen, Erik
    Hsu, Stephen D. H.
    SCIENTIFIC REPORTS, 2023, 13 (01)
  • [7] A scalable estimator of SNP heritability for biobank-scale data
    Wu, Yue
    Sankararaman, Sriram
    BIOINFORMATICS, 2018, 34 (13) : 187 - 194
  • [8] Fast estimation of genetic correlation for biobank-scale data
    Wu, Yue
    Burch, Kathryn S.
    Ganna, Andrea
    Pajukanta, Paivi
    Pasaniuc, Bogdan
    Sankararaman, Sriram
    AMERICAN JOURNAL OF HUMAN GENETICS, 2022, 109 (01) : 24 - 32
  • [9] Biobank-scale methods and projections for sparse polygenic prediction from machine learning
    Timothy G. Raben
    Louis Lello
    Erik Widen
    Stephen D. H. Hsu
    Scientific Reports, 13
  • [10] SIMBSIG: similarity search and clustering for biobank-scale data
    Adamer, Michael F.
    Roellin, Eljas
    Bourguignon, Lucie
    Borgwardt, Karsten
    BIOINFORMATICS, 2023, 39 (01)