Bi-level structured functional analysis for genome-wide association studies

被引:2
|
作者
Wu, Mengyun [1 ]
Wang, Fan [2 ,3 ]
Ge, Yeheng
Ma, Shuangge [4 ]
Li, Yang [2 ,3 ]
机构
[1] Shanghai Univ Finance & Econ, Sch Stat & Management, Shanghai, Peoples R China
[2] Renmin Univ China, Ctr Appl Stat, Sch Stat, Beijing, Peoples R China
[3] Renmin Univ China, Stat Consulting Ctr, Beijing, Peoples R China
[4] Yale Sch Publ Hlth, Dept Biostat, New Haven, CT USA
基金
美国国家卫生研究院; 中国国家自然科学基金;
关键词
bi-level selection; functional analysis; genome-wide association study; structured analysis; PENALIZED REGRESSION; VARIABLE SELECTION; ESTIMATOR;
D O I
10.1111/biom.13871
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Genome-wide association studies (GWAS) have led to great successes in identifying genotype-phenotype associations for complex human diseases. In such studies, the high dimensionality of single nucleotide polymorphisms (SNPs) often makes analysis difficult. Functional analysis, which interprets SNPs densely distributed in a chromosomal region as a continuous process rather than discrete observations, has emerged as a promising avenue for overcoming the high dimensionality challenges. However, the majority of the existing functional studies continue to be individual SNP based and are unable to sufficiently account for the intricate underpinning structures of SNP data. SNPs are often found in groups (e.g., genes or pathways) and have a natural group structure. Additionally, these SNP groups can be highly correlated with coordinated biological functions and interact in a network. Motivated by these unique characteristics of SNP data, we develop a novel bi-level structured functional analysis method and investigate disease-associated genetic variants at the SNP level and SNP group level simultaneously. The penalization technique is adopted for bi-level selection and also to accommodate the group-level network structure. Both the estimation and selection consistency properties are rigorously established. The superiority of the proposed method over alternatives is shown through extensive simulation studies. A type 2 diabetes SNP data application yields some biologically intriguing results.
引用
收藏
页码:3359 / 3373
页数:15
相关论文
共 50 条
  • [1] Lessons from Functional Analysis of Genome-Wide Association Studies
    Sur, Inderpreet
    Tuupanen, Sari
    Whitington, Thomas
    Aaltonen, Lauri A.
    Taipale, Jussi
    CANCER RESEARCH, 2013, 73 (14) : 4180 - 4184
  • [2] An Analysis Pipeline for Genome-wide Association Studies
    Stefanov, Stefan
    Lautenberger, James
    Gold, Bert
    CANCER INFORMATICS, 2008, 6 : 455 - +
  • [3] REPLICABILITY ANALYSIS FOR GENOME-WIDE ASSOCIATION STUDIES
    Heller, Ruth
    Yekutieli, Daniel
    ANNALS OF APPLIED STATISTICS, 2014, 8 (01): : 481 - 498
  • [4] Power analysis for genome-wide association studies
    Robert J Klein
    BMC Genetics, 8
  • [5] Power analysis for genome-wide association studies
    Klein, Robert J.
    BMC GENETICS, 2007, 8 (1)
  • [6] SNP2GO: Functional Analysis of Genome-Wide Association Studies
    Szkiba, David
    Kapun, Martin
    von Haeseler, Arndt
    Gallach, Miguel
    GENETICS, 2014, 197 (01) : 285 - 289
  • [7] Functional analysis of candidate genes from genome-wide association studies of hearing
    Ingham, Neil J.
    Rook, Victoria
    Di Domenico, Francesca
    James, Elysia
    Lewis, Morag A.
    Girotto, Giorgia
    Buniello, Annalisa
    Steel, Karen P.
    HEARING RESEARCH, 2020, 387
  • [8] A Powerful Gene Level Association Test for Genome-wide Association Studies
    Zhao, Jingyuan
    Keong, Garrett Teoh Hor
    Thalamuthu, Anbupalam
    GENETIC EPIDEMIOLOGY, 2012, 36 (07) : 761 - 762
  • [9] Genome-wide association studies
    Nature Reviews Methods Primers, 1
  • [10] Genome-wide association studies
    Willson, Joseph
    NATURE REVIEWS METHODS PRIMERS, 2021, 1 (01):