Stability Selection for Genome-Wide Association

被引:46
|
作者
Alexander, David H. [1 ]
Lange, Kenneth [2 ,3 ,4 ]
机构
[1] Univ Calif Los Angeles, David Geffen Sch Med, Dept Biomath, Los Angeles, CA 90095 USA
[2] Univ Calif Los Angeles, Dept Biomath, Los Angeles, CA 90095 USA
[3] Univ Calif Los Angeles, Dept Human Genet, Los Angeles, CA 90095 USA
[4] Univ Calif Los Angeles, Dept Stat, Los Angeles, CA 90095 USA
关键词
genome-wide association; variable selection; stability selection; the lasso; Wellcome Trust Case Control Consortium data; VARIABLE SELECTION; LASSO; LOCI; RISK;
D O I
10.1002/gepi.20623
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
This article applies the recently proposed "stability selection'' procedure of Meinshausen and Buhlmann to the problem of variable selection in genome-wide association. In particular, it explores whether stability selection can identify new regions of interest originally missed or can call into legitimate question regions originally flagged. Our analysis of the seven data sets of the Wellcome Trust Case-Control Consortium suggests that stability selection effectively controls the family-wise error rate but suffers a loss of power. The extensive correlation structure among SNP markers induced by linkage disequilibrium renders the procedure too conservative, causing it to miss regions known to be highly significant from simple marginal analyses. As a remedy one can aggregate nearby SNPs into groups and select groups rather than individual SNPs. The modified procedure can accurately identify the most important regions of genome-wide association, but in a simulation study it still offers less power than simpler and less computationally intensive methods of marginal association testing. Genet. Epidemiol. 35:722-728, 2011. (C) 2011 Wiley Periodicals, Inc.
引用
收藏
页码:722 / 728
页数:7
相关论文
共 50 条
  • [41] Social determinants of health and selection bias in genome-wide association studies
    Riehm, Kira E. E.
    Keyes, Katherine M. M.
    Susser, Ezra S. S.
    WORLD PSYCHIATRY, 2023, 22 (01) : 160 - 161
  • [42] Statistical Power of Model Selection Strategies for Genome-Wide Association Studies
    Wu, Zheyang
    Zhao, Hongyu
    PLOS GENETICS, 2009, 5 (07):
  • [43] Practical Issues in Screening and Variable Selection in Genome-Wide Association Analysis
    Hong, Sungyeon
    Kim, Yongkang
    Park, Taesung
    CANCER INFORMATICS, 2014, 13 : 55 - 65
  • [44] An efficient unified model for genome-wide association studies and genomic selection
    Hengde Li
    Guosheng Su
    Li Jiang
    Zhenmin Bao
    Genetics Selection Evolution, 49
  • [45] An efficient unified model for genome-wide association studies and genomic selection
    Li, Hengde
    Su, Guosheng
    Jiang, Li
    Bao, Zhenmin
    GENETICS SELECTION EVOLUTION, 2017, 49
  • [46] Statistical methods adopted in genome-wide association study and genomic selection
    Hayashi, Takeshi
    GENES & GENETIC SYSTEMS, 2011, 86 (06) : 393 - 393
  • [47] Nonlinear post-selection inference for genome-wide association studies
    Slim, Lotfi
    Chatelain, Clement
    Azencott, Chloe-Agathe
    BIOCOMPUTING 2022, PSB 2022, 2022, : 349 - 360
  • [48] Iterative hard thresholding for model selection in genome-wide association studies
    Keys, Kevin L.
    Chen, Gary K.
    Lange, Kenneth
    GENETIC EPIDEMIOLOGY, 2017, 41 (08) : 756 - 768
  • [49] Genome-wide pathway analysis of a genome-wide association study on multiple sclerosis
    Gwan Gyu Song
    Sung Jae Choi
    Jong Dae Ji
    Young Ho Lee
    Molecular Biology Reports, 2013, 40 : 2557 - 2564
  • [50] Genome-wide pathway analysis of a genome-wide association study on multiple sclerosis
    Song, Gwan Gyu
    Choi, Sung Jae
    Ji, Jong Dae
    Lee, Young Ho
    MOLECULAR BIOLOGY REPORTS, 2013, 40 (03) : 2557 - 2564