PRSice-2: Polygenic Risk Score software for biobank-scale data

被引:863
|
作者
Choi, Shing Wan [1 ,2 ]
O'Reilly, Paul F. [1 ,2 ]
机构
[1] Kings Coll London, Inst Psychiat Psychol & Neurosci, MRC Social Genet & Dev Psychiat Ctr, De Crespigny Pk,Denmark Hill, London SE5 8AF, England
[2] Icahn Sch Med Mt Sinai, Dept Genet & Genom Sci, 1 Gustave L Levy Pl, New York, NY 10029 USA
来源
GIGASCIENCE | 2019年 / 8卷 / 07期
基金
英国医学研究理事会;
关键词
polygenic risk score; GWAS; imputation; PREDICTION;
D O I
10.1093/gigascience/giz082
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
t Background: Polygenic risk score (PRS) analyses have become an integral part of biomedical research, exploited to gain insights into shared aetiology among traits, to control for genomic profile in experimental studies, and to strengthen causal inference, among a range of applications. Substantial efforts are now devoted to biobank projects to collect large genetic and phenotypic data, providing unprecedented opportunity for genetic discovery and applications. To process the large-scale data provided by such biobank resources, highly efficient and scalable methods and software are required. Results: Here we introduce PRSice-2, an efficient and scalable software program for automating and simplifying PRS analyses on large-scale data. PRSice-2 handles both genotyped and imputed data, provides empirical association P-values free from inflation due to overfitting, supports different inheritance models, and can evaluate multiple continuous and binary target traits simultaneously. We demonstrate that PRSice-2 is dramatically faster and more memory-efficient than PRSice-1 and alternative PRS software, LDpred and lassosum, while having comparable predictive power. Conclusion: PRSice-2's combination of efficiency and power will be increasingly important as data sizes grow and as the applications of PRS become more sophisticated, e.g., when incorporated into high-dimensional or gene set-based analyses. PRSice-2 is written in C++, with an R script for plotting, and is freely available for download from http://PRSice.info.
引用
收藏
页数:6
相关论文
共 50 条
  • [21] Efficient polygenic risk scores for biobank scale data by exploiting phenotypes from inferred relatives
    Buu Truong
    Xuan Zhou
    Jisu Shin
    Jiuyong Li
    Julius H. J. van der Werf
    Thuc D. Le
    S. Hong Lee
    Nature Communications, 11
  • [22] Quantifying the contribution of dominance deviation effects to complex trait variation in biobank-scale data
    Pazokitoroudi, Ali
    Chiu, Alec M.
    Burch, Kathryn S.
    Pasaniuc, Bogdan
    Sankararaman, Sriram
    AMERICAN JOURNAL OF HUMAN GENETICS, 2021, 108 (05) : 799 - 808
  • [23] FlashPCA2: principal component analysis of Biobank-scale genotype datasets
    Abraham, Gad
    Qiu, Yixuan
    Inouye, Michael
    BIOINFORMATICS, 2017, 33 (17) : 2776 - 2778
  • [24] Venous Thromboembolism Polygenic Risk Score Associates With Pulmonary Hypertension in the UK Biobank
    Clapham, Katharine R.
    Mesbah Uddin, Md
    Honigberg, Michael C.
    Gilliland, Thomas
    Ruan, Yunfeng
    Natarajan, Pradeep
    CIRCULATION-GENOMIC AND PRECISION MEDICINE, 2022, 15 (06): : 605 - 607
  • [25] Accurate estimation of SNP-heritability from biobank-scale data irrespective of genetic architecture
    Kangcheng Hou
    Kathryn S. Burch
    Arunabha Majumdar
    Huwenbo Shi
    Nicholas Mancuso
    Yue Wu
    Sriram Sankararaman
    Bogdan Pasaniuc
    Nature Genetics, 2019, 51 : 1244 - 1251
  • [26] Accurate estimation of SNP-heritability from biobank-scale data irrespective of genetic architecture
    Hou, Kangcheng
    Burch, Kathryn S.
    Majumdar, Arunabha
    Shi, Huwenbo
    Mancuso, Nicholas
    Wu, Yue
    Sankararaman, Sriram
    Pasaniuc, Bogdan
    NATURE GENETICS, 2019, 51 (08) : 1244 - +
  • [27] MultiSTAAR delivers multi-trait rare variant analysis of biobank-scale sequencing data
    Nature Computational Science, 2025, 5 (2): : 101 - 102
  • [28] PRSet: Pathway-specific polygenic risk score software
    Choi, Shing Wan
    Wu, Hei Man
    O'Reilly, Paul F.
    GENETIC EPIDEMIOLOGY, 2020, 44 (05) : 476 - 476
  • [29] Interaction of obesity polygenic score with lifestyle risk factors in an electronic health record biobank
    Dashti, Hassan S.
    Miranda, Nicole
    Cade, Brian E.
    Huang, Tianyi
    Redline, Susan
    Karlson, Elizabeth W.
    Saxena, Richa
    BMC MEDICINE, 2022, 20 (01)
  • [30] A systematic evaluation of the performance and properties of the UK Biobank Polygenic Risk Score (PRS) Release
    Thompson, Deborah J.
    Wells, Daniel
    Selzam, Saskia
    Peneva, Iliana
    Moore, Rachel
    Sharp, Kevin
    Tarran, William A.
    Beard, Edward J.
    Riveros-Mckay, Fernando
    Giner-Delgado, Carla
    Palmer, Duncan
    Seth, Priyanka
    Harrison, James
    Futema, Marta
    McVean, Gil
    Plagnol, Vincent
    Donnelly, Peter
    Weale, Michael E.
    PLOS ONE, 2024, 19 (09):