Statistical Selection of Biological Models for Genome-Wide Association Analyses

被引:0
|
作者
Bi, Wenjian [1 ]
Kang, Guolian [1 ]
Pounds, Stanley B. [1 ]
机构
[1] St Jude Childrens Res Hosp, Dept Biostat, 332 N Lauderdale St, Memphis, TN 38105 USA
基金
美国国家卫生研究院;
关键词
biological models; genome-wide association study; multiple adjusted evidence weights; two-stage discovery validation study; FALSE DISCOVERY RATES; FETAL-HEMOGLOBIN; P-VALUES; IDENTIFICATION; MICROARRAY; PHENOTYPE;
D O I
暂无
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Genome-wide association studies have discovered many biologically important associations of genes with phenotypes. Typically, genome-wide association analyses formally test the association of each genetic feature (SNP, CNV, etc) with the phenotype of interest and summarize the results with multiplicity-adjusted p-values. However, very small p-values only provide evidence against the null hypothesis of no association without indicating which biological model best explains the observed data. Correctly identifying a specific biological model may improve the scientific interpretation and can be used to more effectively select and design a follow-up validation study. Thus, statistical methodology to identify the correct biological model for a particular genotype-phenotype association can be very useful to investigators. Here, we propose a general statistical method to summarize how accurately each of five biological models (null, additive, dominant, recessive, co-dominant) represents the data observed for each variant in a GWAS study. We show that the new method stringently controls the false discovery rate and asymptotically selects the correct biological model. Simulations of two-stage discovery-validation studies show that the new method has these properties and that its validation power is similar to or exceeds that of simple methods that use the same statistical model for all SNPs. Example analyses of three data sets also highlight these advantages of the new method. An R package is freely available at www.stjuderesearch.org/site/depts/biostats/software.
引用
收藏
页码:150 / 157
页数:8
相关论文
共 50 条
  • [21] cgmisc: enhanced genome-wide association analyses and visualization
    Kierczak, Marcin
    Jablonska, Jagoda
    Forsberg, Simon K. G.
    Bianchi, Matteo
    Tengvall, Katarina
    Pettersson, Mats
    Scholz, Veronika
    Meadows, Jennifer R. S.
    Jern, Patric
    Carlborg, Orjan
    Lindblad-Toh, Kerstin
    BIOINFORMATICS, 2015, 31 (23) : 3830 - 3831
  • [22] Enrichment of statistical power for genome-wide association studies
    Li, Meng
    Liu, Xiaolei
    Bradbury, Peter
    Yu, Jianming
    Zhang, Yuan-Ming
    Todhunter, Rory J.
    Buckler, Edward S.
    Zhang, Zhiwu
    BMC BIOLOGY, 2014, 12
  • [23] Enrichment of statistical power for genome-wide association studies
    Meng Li
    Xiaolei Liu
    Peter Bradbury
    Jianming Yu
    Yuan-Ming Zhang
    Rory J Todhunter
    Edward S Buckler
    Zhiwu Zhang
    BMC Biology, 12
  • [24] Statistical genetic issues for genome-wide association studies
    Weir, Bruce S.
    GENOME, 2010, 53 (11) : 869 - 875
  • [25] Towards genome-wide marker assisted breeding: genome-wide association study and genomic selection
    Iwata, Hiroyashi
    GENES & GENETIC SYSTEMS, 2011, 86 (06) : 393 - 393
  • [26] Genome-wide association and genomic selection in animal breeding
    Hayes, Ben
    Goddard, Mike
    GENOME, 2010, 53 (11) : 876 - 883
  • [27] Bayesian Variable Selection with Genome-wide Association Studies
    Bangchang, Kannat Na
    LOBACHEVSKII JOURNAL OF MATHEMATICS, 2024, 45 (02) : 613 - 620
  • [28] Model Selection Strategies in Genome-Wide Association Studies
    Keildson, Sarah L.
    Farrall, Martin
    Morris, Andrew P.
    GENETIC EPIDEMIOLOGY, 2009, 33 (08) : 792 - 792
  • [29] A variable selection method for genome-wide association studies
    He, Qianchuan
    Lin, Dan-Yu
    BIOINFORMATICS, 2011, 27 (01) : 1 - 8
  • [30] Analysing biological pathways in genome-wide association studies
    Wang, Kai
    Li, Mingyao
    Hakonarson, Hakon
    NATURE REVIEWS GENETICS, 2010, 11 (12) : 843 - 854