Weighted SNP Set Analysis in Genome-Wide Association Study

被引:5
|
作者
Dai, Hui [1 ]
Zhao, Yang [1 ]
Qian, Cheng [1 ]
Cai, Min [1 ]
Zhang, Ruyang [1 ]
Chu, Minjie [1 ]
Dai, Juncheng [1 ]
Hu, Zhibin [1 ,2 ,3 ]
Shen, Hongbing [1 ,2 ,3 ]
Chen, Feng [1 ]
机构
[1] Nanjing Med Univ, Sch Publ Hlth, Dept Epidemiol & Biostat, Nanjing, Jiangsu, Peoples R China
[2] Nanjing Med Univ, Ctr Canc, Jiangsu Key Lab Canc Biomarkers Prevent & Treatme, Clin Epidemiol Sect, Nanjing, Jiangsu, Peoples R China
[3] Nanjing Med Univ, State Key Lab Reprod Med, Nanjing, Jiangsu, Peoples R China
来源
PLOS ONE | 2013年 / 8卷 / 09期
基金
高等学校博士学科点专项科研基金; 中国国家自然科学基金;
关键词
MULTIPLE SNPS; SUSCEPTIBILITY; DISEASE; GENE; TESTS; LOCI;
D O I
10.1371/journal.pone.0075897
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Genome-wide association studies (GWAS) are popular for identifying genetic variants which are associated with disease risk. Many approaches have been proposed to test multiple single nucleotide polymorphisms (SNPs) in a region simultaneously which considering disadvantages of methods in single locus association analysis. Kernel machine based SNP set analysis is more powerful than single locus analysis, which borrows information from SNPs correlated with causal or tag SNPs. Four types of kernel machine functions and principal component based approach (PCA) were also compared. However, given the loss of power caused by low minor allele frequencies (MAF), we conducted an extension work on PCA and used a new method called weighted PCA (wPCA). Comparative analysis was performed for weighted principal component analysis (wPCA), logistic kernel machine based test (LKM) and principal component analysis (PCA) based on SNP set in the case of different minor allele frequencies (MAF) and linkage disequilibrium (LD) structures. We also applied the three methods to analyze two SNP sets extracted from a real GWAS dataset of non-small cell lung cancer in Han Chinese population. Simulation results show that when the MAF of the causal SNP is low, weighted principal component and weighted IBS are more powerful than PCA and other kernel machine functions at different LD structures and different numbers of causal SNPs. Application of the three methods to a real GWAS dataset indicates that wPCA and wIBS have better performance than the linear kernel, IBS kernel and PCA.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] SNP Set Association Analysis for Genome-Wide Association Studies
    Cai, Min
    Dai, Hui
    Qiu, Yongyong
    Zhao, Yang
    Zhang, Ruyang
    Chu, Minjie
    Dai, Juncheng
    Hu, Zhibin
    Shen, Hongbing
    Chen, Feng
    PLOS ONE, 2013, 8 (05):
  • [2] An efficient weighted tag SNP-set analytical method in genome-wide association studies
    Bin Yan
    Shudong Wang
    Huaqian Jia
    Xing Liu
    Xinzeng Wang
    BMC Genetics, 16
  • [3] An efficient weighted tag SNP-set analytical method in genome-wide association studies
    Yan, Bin
    Wang, Shudong
    Jia, Huaqian
    Liu, Xing
    Wang, Xinzeng
    BMC GENETICS, 2015, 16
  • [4] Importance of SNP Dependency Correction and Association Integration for Gene Set Analysis in Genome-Wide Association Studies
    Marczyk, Michal
    Macioszek, Agnieszka
    Tobiasz, Joanna
    Polanska, Joanna
    Zyla, Joanna
    FRONTIERS IN GENETICS, 2021, 12
  • [5] Powerful SNP-Set Analysis for Case-Control Genome-wide Association Studies
    Wu, Michael C.
    Kraft, Peter
    Epstein, Michael P.
    Taylor, Deanne M.
    Chanock, Stephen J.
    Hunter, David J.
    Lin, Xihong
    AMERICAN JOURNAL OF HUMAN GENETICS, 2010, 86 (06) : 929 - 942
  • [6] Profiles of causative SNP in a genome-wide association study.
    Misztal, I.
    Pocrnic, I.
    Perez-Enciso, M.
    Lourenco, D. A. L.
    JOURNAL OF DAIRY SCIENCE, 2020, 103 : 114 - 115
  • [7] GSEA-SNP: applying gene set enrichment analysis to SNP data from genome-wide association studies
    Holden, Marit
    Deng, Shiwei
    Wojnowski, Leszek
    Kulle, Bettina
    BIOINFORMATICS, 2008, 24 (23) : 2784 - 2785
  • [8] Genome-wide association studies pipeline (GWASpi): a desktop application for genome-wide SNP analysis and management
    Muniz-Fernandez, Fernando
    Carreno-Torres, Angel
    Morcillo-Suarez, Carlos
    Navarro, Arcadi
    BIOINFORMATICS, 2011, 27 (13) : 1871 - 1872
  • [9] Gene Set analysis in Genome-wide Association Studies
    Tintle, Nathan L.
    GENETIC EPIDEMIOLOGY, 2009, 33 (08) : 805 - 806
  • [10] Multiple SNP Set Analysis for Genome-Wide Association Studies Through Bayesian Latent Variable Selection
    Lu, Zhao-Hua
    Zhu, Hongtu
    Knickmeyer, Rebecca C.
    Sullivan, Patrick F.
    Williams, Stephanie N.
    Zou, Fei
    GENETIC EPIDEMIOLOGY, 2015, 39 (08) : 664 - 677