Machine Learning to Advance Human Genome-Wide Association Studies

被引:2
|
作者
Sigala, Rafaella E. [1 ]
Lagou, Vasiliki [1 ]
Shmeliov, Aleksey [1 ]
Atito, Sara [2 ,3 ]
Kouchaki, Samaneh [2 ,3 ]
Awais, Muhammad [2 ,3 ]
Prokopenko, Inga [1 ,2 ]
Mahdi, Adam [4 ]
Demirkan, Ayse [1 ,2 ]
机构
[1] Dept Clin & Expt Med, Sect Stat Multiom, Guildford GU2 7XH, Surrey, England
[2] Univ Surrey, Surrey Inst People Centred Artificial Intelligence, Guildford GU2 7XH, Surrey, England
[3] Univ Surrey, Ctr Vis Speech Signal Proc, Guildford GU2 7XH, Surrey, England
[4] Univ Oxford, Oxford Internet Inst, Oxford OX1 3JS, Oxon, England
关键词
genome-wide association; human genetics; machine learning; RISK PREDICTION; GENE; DISEASE; GWAS; PRIORITIZATION; SCHIZOPHRENIA; DISCOVERY; VARIANTS; OBESITY; FTO;
D O I
10.3390/genes15010034
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Machine learning, including deep learning, reinforcement learning, and generative artificial intelligence are revolutionising every area of our lives when data are made available. With the help of these methods, we can decipher information from larger datasets while addressing the complex nature of biological systems in a more efficient way. Although machine learning methods have been introduced to human genetic epidemiological research as early as 2004, those were never used to their full capacity. In this review, we outline some of the main applications of machine learning to assigning human genetic loci to health outcomes. We summarise widely used methods and discuss their advantages and challenges. We also identify several tools, such as Combi, GenNet, and GMSTool, specifically designed to integrate these methods for hypothesis-free analysis of genetic variation data. We elaborate on the additional value and limitations of these tools from a geneticist's perspective. Finally, we discuss the fast-moving field of foundation models and large multi-modal omics biobank initiatives.
引用
收藏
页数:18
相关论文
共 50 条
  • [11] Automated machine learning for genome wide association studies
    Lakiotaki, Kleanthi
    Papadovasilakis, Zaharias
    Lagani, Vincenzo
    Fafalios, Stefanos
    Charonyktakis, Paulos
    Tsagris, Michail
    Tsamardinos, Ioannis
    BIOINFORMATICS, 2023, 39 (09)
  • [12] A machine-compiled database of genome-wide association studies
    Kuleshov, Volodymyr
    Ding, Jialin
    Vo, Christopher
    Hancock, Braden
    Ratner, Alexander
    Li, Yang
    Re, Christopher
    Batzoglou, Serafim
    Snyder, Michael
    NATURE COMMUNICATIONS, 2019, 10 (1)
  • [13] Genome-wide association studies
    Nature Reviews Methods Primers, 1
  • [14] A machine-compiled database of genome-wide association studies
    Volodymyr Kuleshov
    Jialin Ding
    Christopher Vo
    Braden Hancock
    Alexander Ratner
    Yang Li
    Christopher Ré
    Serafim Batzoglou
    Michael Snyder
    Nature Communications, 10
  • [15] Genome-wide association studies
    Willson, Joseph
    NATURE REVIEWS METHODS PRIMERS, 2021, 1 (01):
  • [16] Genome-Wide Association Studies
    Guo, Xiuqing
    Rotter, Jerome I.
    JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2019, 322 (17): : 1705 - 1706
  • [17] Genome-Wide Association Studies of Human Growth Traits
    Weedon, Michael N.
    RECENT ADVANCES IN GROWTH RESEARCH: NUTRITIONAL, MOLECULAR AND ENDOCRINE PERSPECTIVES, 2013, 71 : 29 - 38
  • [18] Genome-Wide Association Studies of the Human Gut Microbiota
    Davenport, Emily R.
    Cusanovich, Darren A.
    Michelini, Katelyn
    Barreiro, Luis B.
    Ober, Carole
    Gilad, Yoav
    PLOS ONE, 2015, 10 (11):
  • [19] Progress in Genome-Wide Association Studies of Human Height
    Hirschhorn, Joel N.
    Lettre, Guillaume
    HORMONE RESEARCH, 2009, 71 : 5 - 13
  • [20] Singleton SNPs in the human genome and implications for genome-wide association studies
    Xiayi Ke
    Martin S Taylor
    Lon R Cardon
    European Journal of Human Genetics, 2008, 16 : 506 - 515