Estimating disease prevalence in large datasets using genetic risk scores

被引:0
|
作者
Benjamin D. Evans
Piotr Słowiński
Andrew T. Hattersley
Samuel E. Jones
Seth Sharp
Robert A. Kimmitt
Michael N. Weedon
Richard A. Oram
Krasimira Tsaneva-Atanasova
Nicholas J. Thomas
机构
[1] University of Exeter,Department of Mathematics
[2] University of Exeter,Living Systems Institute, Centre for Biomedical Modelling and Analysis
[3] University of Bristol,School of Psychological Science
[4] University of Exeter,Living Systems Institute, Translational Research Exchange @ Exeter
[5] University of Exeter Medical School,undefined
[6] Institute of Biomedical & Clinical Science,undefined
[7] Royal Devon & Exeter NHS Foundation Trust,undefined
[8] Living Systems Institute,undefined
[9] EPSRC Hub for Quantitative Modelling in Healthcare,undefined
[10] University of Exeter,undefined
来源
Nature Communications | / 12卷
关键词
D O I
暂无
中图分类号
学科分类号
摘要
Clinical classification is essential for estimating disease prevalence but is difficult, often requiring complex investigations. The widespread availability of population level genetic data makes novel genetic stratification techniques a highly attractive alternative. We propose a generalizable mathematical framework for determining disease prevalence within a cohort using genetic risk scores. We compare and evaluate methods based on the means of genetic risk scores’ distributions; the Earth Mover’s Distance between distributions; a linear combination of kernel density estimates of distributions; and an Excess method. We demonstrate the performance of genetic stratification to produce robust prevalence estimates. Specifically, we show that robust estimates of prevalence are still possible even with rarer diseases, smaller cohort sizes and less discriminative genetic risk scores, highlighting the general utility of these approaches. Genetic stratification techniques offer exciting new research tools, enabling unbiased insights into disease prevalence and clinical characteristics unhampered by clinical classification criteria.
引用
收藏
相关论文
共 50 条
  • [31] Risk assessment for hypercholesterolemia using LDL and LPA genetic scores
    Schwaninger, Gunda
    Forer, Lukas
    Ebenbichler, Christoph
    Dieplinger, Hans
    Kronenberg, Florian
    Zschocke, Johannes
    Witsch-Baumgartner, Martina
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2024, 32 : 683 - 684
  • [32] Exploratory analysis of the genetics of impulse control disorders in Parkinson's disease using genetic risk scores
    Faouzi, Johann
    Couvy-Duchesne, Baptiste
    Bekadar, Samir
    Colliot, Olivier
    Corvol, Jean-Christophe
    PARKINSONISM & RELATED DISORDERS, 2021, 86 : 74 - 77
  • [33] On the use of genetic risk scores to predict cardiovascular disease in the general population
    Pereira, Alexandre C.
    HEART, 2016, 102 (20) : 1612 - 1613
  • [34] Age Dependence of Genetic Risk Scores in Relation to Coronary Artery Disease
    Liu, Chang
    Huang, Yunfeng
    Hui, Qin
    Quyyumi, Arshed A.
    Sun, Yan
    CIRCULATION, 2020, 141
  • [35] The evolution of genetic-based risk scores for lipids and cardiovascular disease
    Dron, Jacqueline S.
    Hegele, Robert A.
    CURRENT OPINION IN LIPIDOLOGY, 2019, 30 (02) : 71 - 81
  • [36] Genetic Polymorphisms and Genetic Risk Scores Contribute to the Risk of Coronary Artery Disease (CAD) in a North Indian Population
    Mastana, Sarabjit
    Halai, Kushni Charisma
    Akam, Liz
    Hunter, David John
    Singh, Puneetpal
    INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES, 2024, 25 (15)
  • [37] Prevalence and Effect of Genetic Risk of Thromboembolic Disease in Inflammatory Bowel Disease
    Naito, Takeo
    Botwin, Gregory J.
    Haritunians, Talin
    Li, Dalin
    Yang, Shaohong
    Khrom, Michelle
    Braun, Jonathan
    Abbou, Lisa
    Mengesha, Emebet
    Stevens, Christine
    Masamune, Atsushi
    Daly, Mark
    McGovern, Dermot P. B.
    GASTROENTEROLOGY, 2021, 160 (03) : 771 - +
  • [38] Large Datasets in Healthcare - an Opportunity or a Risk?
    Pigeot, Iris
    Jacobs, Svenja
    Koch-Gromus, Uwe
    BUNDESGESUNDHEITSBLATT-GESUNDHEITSFORSCHUNG-GESUNDHEITSSCHUTZ, 2015, 58 (08) : 785 - 787
  • [39] Scaling genetic programming to large datasets using hierarchical dynamic subset selection
    Curry, Robert
    Lichodzijewski, Peter
    Heywood, Malcolm I.
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2007, 37 (04): : 1065 - 1073
  • [40] Estimating the performance of three cardiovascular disease risk scores: the Estonian Biobank cohort study
    Saar, Aet
    Lall, Kristi
    Alver, Maris
    Marandi, Toomas
    Ainla, Tiia
    Eha, Jaan
    Metspalu, Andres
    Fischer, Krista
    JOURNAL OF EPIDEMIOLOGY AND COMMUNITY HEALTH, 2019, 73 (03) : 272 - 277