Machine Learning to Advance Human Genome-Wide Association Studies

被引:2
|
作者
Sigala, Rafaella E. [1 ]
Lagou, Vasiliki [1 ]
Shmeliov, Aleksey [1 ]
Atito, Sara [2 ,3 ]
Kouchaki, Samaneh [2 ,3 ]
Awais, Muhammad [2 ,3 ]
Prokopenko, Inga [1 ,2 ]
Mahdi, Adam [4 ]
Demirkan, Ayse [1 ,2 ]
机构
[1] Dept Clin & Expt Med, Sect Stat Multiom, Guildford GU2 7XH, Surrey, England
[2] Univ Surrey, Surrey Inst People Centred Artificial Intelligence, Guildford GU2 7XH, Surrey, England
[3] Univ Surrey, Ctr Vis Speech Signal Proc, Guildford GU2 7XH, Surrey, England
[4] Univ Oxford, Oxford Internet Inst, Oxford OX1 3JS, Oxon, England
关键词
genome-wide association; human genetics; machine learning; RISK PREDICTION; GENE; DISEASE; GWAS; PRIORITIZATION; SCHIZOPHRENIA; DISCOVERY; VARIANTS; OBESITY; FTO;
D O I
10.3390/genes15010034
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Machine learning, including deep learning, reinforcement learning, and generative artificial intelligence are revolutionising every area of our lives when data are made available. With the help of these methods, we can decipher information from larger datasets while addressing the complex nature of biological systems in a more efficient way. Although machine learning methods have been introduced to human genetic epidemiological research as early as 2004, those were never used to their full capacity. In this review, we outline some of the main applications of machine learning to assigning human genetic loci to health outcomes. We summarise widely used methods and discuss their advantages and challenges. We also identify several tools, such as Combi, GenNet, and GMSTool, specifically designed to integrate these methods for hypothesis-free analysis of genetic variation data. We elaborate on the additional value and limitations of these tools from a geneticist's perspective. Finally, we discuss the fast-moving field of foundation models and large multi-modal omics biobank initiatives.
引用
收藏
页数:18
相关论文
共 50 条
  • [21] Singleton SNPs in the human genome and implications for genome-wide association studies
    Ke, Xiayi
    Taylor, Martin S.
    Cardon, Lon R.
    EUROPEAN JOURNAL OF HUMAN GENETICS, 2008, 16 (04) : 506 - 515
  • [22] Learning Hierarchical Bayesian Networks for Genome-Wide Association Studies
    Mourad, Raphael
    Sinoquet, Christine
    Leray, Philippe
    COMPSTAT'2010: 19TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL STATISTICS, 2010, : 549 - 556
  • [23] GENOME-WIDE ASSOCIATION STUDIES Validating, augmenting and refining genome-wide association signals
    Ioannidis, John P. A.
    Thomas, Gilles
    Daly, Mark J.
    NATURE REVIEWS GENETICS, 2009, 10 (05) : 318 - 329
  • [24] Pulmonary Function: From Genome-Wide Association Studies to Genome-Wide Interaction Studies
    Christiani, David C.
    AMERICAN JOURNAL OF RESPIRATORY AND CRITICAL CARE MEDICINE, 2019, 199 (05) : 557 - 559
  • [25] Genome-wide association studies are coming for human infectious diseases
    Davila, Sonia
    Hibberd, Martin L.
    GENOME MEDICINE, 2009, 1
  • [26] Genome-wide association studies are coming for human infectious diseases
    Sonia Davila
    Martin L Hibberd
    Genome Medicine, 1
  • [27] Genome-wide association studies of human adiposity: Zooming in on synapses
    Sandholt, Camilla H.
    Grarup, Niels
    Pedersen, Oluf
    Hansen, Torben
    MOLECULAR AND CELLULAR ENDOCRINOLOGY, 2015, 418 : 90 - 100
  • [28] Genome-wide association studies: A new era in human genetics
    Cambien F.
    Current Cardiovascular Risk Reports, 2007, 1 (4) : 271 - 272
  • [29] Genome-Wide Association Studies of Autism
    Glessner J.T.
    Connolly J.J.
    Hakonarson H.
    Current Behavioral Neuroscience Reports, 2014, 1 (4) : 234 - 241
  • [30] Genome-Wide Association Studies in Atherosclerosis
    Sivapalaratnam, S.
    Motazacker, M. M.
    Maiwald, S.
    Hovingh, G. K.
    Kastelein, J. J. P.
    Levi, M.
    Trip, M. D.
    Dallinga-Thie, G. M.
    CURRENT ATHEROSCLEROSIS REPORTS, 2011, 13 (03) : 225 - 232