Estimating disease prevalence in large datasets using genetic risk scores

被引:0
|
作者
Benjamin D. Evans
Piotr Słowiński
Andrew T. Hattersley
Samuel E. Jones
Seth Sharp
Robert A. Kimmitt
Michael N. Weedon
Richard A. Oram
Krasimira Tsaneva-Atanasova
Nicholas J. Thomas
机构
[1] University of Exeter,Department of Mathematics
[2] University of Exeter,Living Systems Institute, Centre for Biomedical Modelling and Analysis
[3] University of Bristol,School of Psychological Science
[4] University of Exeter,Living Systems Institute, Translational Research Exchange @ Exeter
[5] University of Exeter Medical School,undefined
[6] Institute of Biomedical & Clinical Science,undefined
[7] Royal Devon & Exeter NHS Foundation Trust,undefined
[8] Living Systems Institute,undefined
[9] EPSRC Hub for Quantitative Modelling in Healthcare,undefined
[10] University of Exeter,undefined
来源
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Clinical classification is essential for estimating disease prevalence but is difficult, often requiring complex investigations. The widespread availability of population level genetic data makes novel genetic stratification techniques a highly attractive alternative. We propose a generalizable mathematical framework for determining disease prevalence within a cohort using genetic risk scores. We compare and evaluate methods based on the means of genetic risk scores’ distributions; the Earth Mover’s Distance between distributions; a linear combination of kernel density estimates of distributions; and an Excess method. We demonstrate the performance of genetic stratification to produce robust prevalence estimates. Specifically, we show that robust estimates of prevalence are still possible even with rarer diseases, smaller cohort sizes and less discriminative genetic risk scores, highlighting the general utility of these approaches. Genetic stratification techniques offer exciting new research tools, enabling unbiased insights into disease prevalence and clinical characteristics unhampered by clinical classification criteria.
引用
收藏
相关论文
共 50 条
  • [1] Estimating disease prevalence in large datasets using genetic risk scores
    Evans, Benjamin D.
    Slowinski, Piotr
    Hattersley, Andrew T.
    Jones, Samuel E.
    Sharp, Seth
    Kimmitt, Robert A.
    Weedon, Michael N.
    Oram, Richard A.
    Tsaneva-Atanasova, Krasimira
    Thomas, Nicholas J.
    NATURE COMMUNICATIONS, 2021, 12 (01)
  • [2] Estimating the Prevalence and Genetic Risk Mechanisms of ARFID in a Large Autism Cohort
    Koomar, Tanner
    Thomas, Taylor R.
    Pottschmidt, Natalie R.
    Lutter, Michael
    Michaelson, Jacob J.
    FRONTIERS IN PSYCHIATRY, 2021, 12
  • [3] Estimating heritability and genetic correlations from large health datasets in the absence of genetic data
    Jia, Gengjie
    Li, Yu
    Zhang, Hanxin
    Chattopadhyay, Ishanu
    Jensen, Anders Boeck
    Blair, David R.
    Davis, Lea
    Robinson, Peter N.
    Dahlen, Torsten
    Brunak, Soren
    Benson, Mikael
    Edgren, Gustaf
    Cox, Nancy J.
    Gao, Xin
    Rzhetsky, Andrey
    NATURE COMMUNICATIONS, 2019, 10 (1)
  • [4] Estimating heritability and genetic correlations from large health datasets in the absence of genetic data
    Gengjie Jia
    Yu Li
    Hanxin Zhang
    Ishanu Chattopadhyay
    Anders Boeck Jensen
    David R. Blair
    Lea Davis
    Peter N. Robinson
    Torsten Dahlén
    Søren Brunak
    Mikael Benson
    Gustaf Edgren
    Nancy J. Cox
    Xin Gao
    Andrey Rzhetsky
    Nature Communications, 10
  • [5] Calibrated rare variant genetic risk scores for complex disease prediction using large exome sequence repositories
    Ricky Lali
    Michael Chong
    Arghavan Omidi
    Pedrum Mohammadi-Shemirani
    Ann Le
    Edward Cui
    Guillaume Paré
    Nature Communications, 12
  • [6] Calibrated rare variant genetic risk scores for complex disease prediction using large exome sequence repositories
    Lali, Ricky
    Chong, Michael
    Omidi, Arghavan
    Mohammadi-Shemirani, Pedrum
    Le, Ann
    Cui, Edward
    Pare, Guillaume
    NATURE COMMUNICATIONS, 2021, 12 (01)
  • [7] GENETIC RISK SCORES AND CORONARY HEART DISEASE RISK
    Bloch, Michael J.
    JOURNAL OF THE AMERICAN SOCIETY OF HYPERTENSION, 2015, 9 (08) : 580 - 581
  • [8] Prediction of Coronary Artery Disease using Traditional and Genetic Risk Scores for Cardiovascular Risk Factors
    Ramirez, Julia
    van Duijvenboden, Stefan
    Young, William J.
    Tinker, Andrew
    Lambiase, Pier D.
    Orini, Michele
    Munroe, Patricia B.
    GENETIC EPIDEMIOLOGY, 2021, 45 (07) : 783 - 783
  • [9] Estimating prevalence of human traits among populations from polygenic risk scores
    Britney E. Graham
    Brian Plotkin
    Louis Muglia
    Jason H. Moore
    Scott M. Williams
    Human Genomics, 15
  • [10] Estimating prevalence of human traits among populations from polygenic risk scores
    Graham, Britney E.
    Plotkin, Brian
    Muglia, Louis
    Moore, Jason H.
    Williams, Scott M.
    HUMAN GENOMICS, 2021, 15 (01)