Adaptive Elastic-Net Sparse Principal Component Analysis for Pathway Association Testing

被引:7
|
作者
Chen, Xi [1 ]
机构
[1] Vanderbilt Univ, Dept Biostat, Nashville, TN 37232 USA
关键词
gene expression; microarray; pathway analysis; sparse principal component analysis; GENE SET ENRICHMENT; SINGULAR-VALUE DECOMPOSITION; WIDE EXPRESSION DATA; MICROARRAY DATA; RECEPTOR; SELECTION; THERAPY; LASSO; GAMMA;
D O I
10.2202/1544-6115.1697
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Pathway or gene set analysis has become an increasingly popular approach for analyzing high-throughput biological experiments such as microarray gene expression studies. The purpose of pathway analysis is to identify differentially expressed pathways associated with outcomes. Important challenges in pathway analysis are selecting a subset of genes contributing most to association with clinical phenotypes and conducting statistical tests of association for the pathways efficiently. We propose a two-stage analysis strategy: (1) extract latent variables representing activities within each pathway using a dimension reduction approach based on adaptive elastic-net sparse principal component analysis; (2) integrate the latent variables with the regression modeling framework to analyze studies with different types of outcomes such as binary, continuous or survival outcomes. Our proposed approach is computationally efficient. For each pathway, because the latent variables are estimated in an unsupervised fashion without using disease outcome information, in the sample label permutation testing procedure, the latent variables only need to be calculated once rather than for each permutation resample. Using both simulated and real datasets, we show our approach performed favorably when compared with five other currently available pathway testing methods.
引用
收藏
页数:22
相关论文
共 50 条
  • [41] Principal Component Analysis With Sparse Fused Loadings
    Guo, Jian
    James, Gareth
    Levina, Elizaveta
    Michailidis, George
    Zhu, Ji
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2010, 19 (04) : 930 - 946
  • [42] Multilevel sparse functional principal component analysis
    Di, Chongzhi
    Crainiceanu, Ciprian M.
    Jank, Wolfgang S.
    STAT, 2014, 3 (01): : 126 - 143
  • [43] Sparse principal component analysis by choice of norm
    Qi, Xin
    Luo, Ruiyan
    Zhao, Hongyu
    JOURNAL OF MULTIVARIATE ANALYSIS, 2013, 114 : 127 - 160
  • [44] Optimal solutions for sparse principal component analysis
    d'Aspremont, Alexandre
    Bach, Francis
    El Ghaoui, Laurent
    JOURNAL OF MACHINE LEARNING RESEARCH, 2008, 9 : 1269 - 1294
  • [45] Approximation bounds for sparse principal component analysis
    d'Aspremont, Alexandre
    Bach, Francis
    El Ghaoui, Laurent
    MATHEMATICAL PROGRAMMING, 2014, 148 (1-2) : 89 - 110
  • [46] Exactly Uncorrelated Sparse Principal Component Analysis
    Kwon, Oh-Ran
    Lu, Zhaosong
    Zou, Hui
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2024, 33 (01) : 231 - 241
  • [47] Stochastic convex sparse principal component analysis
    Baytas, Inci M.
    Lin, Kaixiang
    Wang, Fei
    Jain, Anil K.
    Zhou, Jiayu
    EURASIP JOURNAL ON BIOINFORMATICS AND SYSTEMS BIOLOGY, 2016, (01)
  • [48] Sparse principal component analysis with measurement errors
    Shi, Jianhong
    Song, Weixing
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2016, 175 : 87 - 99
  • [49] Sparse exponential family Principal Component Analysis
    Lu, Meng
    Huang, Jianhua Z.
    Qian, Xiaoning
    PATTERN RECOGNITION, 2016, 60 : 681 - 691
  • [50] Sparse Principal Component Analysis in Hilbert Space
    Qi, Xin
    Luo, Ruiyan
    SCANDINAVIAN JOURNAL OF STATISTICS, 2015, 42 (01) : 270 - 289