NETWORK-REGULARIZED HIGH-DIMENSIONAL COX REGRESSION FOR ANALYSIS OF GENOMIC DATA

被引:55
|
作者
Sun, Hokeun [1 ]
Lin, Wei [2 ]
Feng, Rui [2 ]
Li, Hongzhe [2 ]
机构
[1] Pusan Natl Univ, Dept Stat, Pusan 609735, South Korea
[2] Univ Penn, Perelman Sch Med, Dept Biostat & Epidemiol, Philadelphia, PA 19104 USA
关键词
Laplacian penalty; network analysis; regularization; sparsity; survival data; variable selection; weak oracle property; PROPORTIONAL HAZARDS MODEL; NONCONCAVE PENALIZED LIKELIHOOD; VARIABLE SELECTION; DANTZIG SELECTOR; ADAPTIVE LASSO; EXPRESSION; METASTASIS; SHRINKAGE;
D O I
10.5705/ss.2012.317
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We consider estimation and variable selection in high-dimensional Cox regression when a prior knowledge of the relationships among the covariates, described by a network or graph, is available. A limitation of the existing methodology for survival analysis with high-dimensional genomic data is that a wealth of structural information about many biological processes, such as regulatory networks and pathways, has often been ignored. In order to incorporate such prior network information into the analysis of genomic data, we propose a network-based regularization method for high-dimensional Cox regression; it uses an l(1)-penalty to induce sparsity of the regression coefficients and a quadratic Laplacian penalty to encourage smoothness between the coefficients of neighboring variables on a given network. The proposed method is implemented by an efficient coordinate descent algorithm. In the setting where the dimensionality p can grow exponentially fast with the sample size n, we establish model selection consistency and estimation bounds for the proposed estimators. The theoretical results provide insights into the gain from taking into account the network structural information. Extensive simulation studies indicate that our method outperforms Lasso and elastic net in terms of variable selection accuracy and stability. We apply our method to a breast cancer gene expression study and identify several biologically plausible subnetworks and pathways that are associated with breast cancer distant metastasis.
引用
收藏
页码:1433 / 1459
页数:27
相关论文
共 50 条
  • [21] Factor Analysis Regression for Predictive Modeling with High-Dimensional Data
    Carter, Randy
    Michael, Netsanet
    JOURNAL OF QUANTITATIVE ECONOMICS, 2022, 20 (SUPPL 1) : 115 - 132
  • [22] High-Dimensional Heteroscedastic Regression with an Application to eQTL Data Analysis
    Daye, Z. John
    Chen, Jinbo
    Li, Hongzhe
    BIOMETRICS, 2012, 68 (01) : 316 - 326
  • [23] Factor Analysis Regression for Predictive Modeling with High-Dimensional Data
    Randy Carter
    Netsanet Michael
    Journal of Quantitative Economics, 2022, 20 : 115 - 132
  • [24] Vanishing deviance problem in high-dimensional penalized Cox regression
    Yao, Sijie
    Li, Tingyi
    Cao, Biwei
    Wang, Xuefeng
    CANCER RESEARCH, 2023, 83 (07)
  • [25] GD-RDA: A New Regularized Discriminant Analysis for High-Dimensional Data
    Zhou, Yan
    Zhang, Baoxue
    Li, Gaorong
    Tong, Tiejun
    Wan, Xiang
    JOURNAL OF COMPUTATIONAL BIOLOGY, 2017, 24 (11) : 1099 - 1111
  • [26] COMPRESSIVE REGULARIZED DISCRIMINANT ANALYSIS OF HIGH-DIMENSIONAL DATA WITH APPLICATIONS TO MICROARRAY STUDIES
    Tabassum, Muhammad Naveed
    Ollila, Esa
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 4204 - 4208
  • [27] ONE-STEP REGULARIZED ESTIMATORFOR HIGH-DIMENSIONAL REGRESSION MODELS
    Wang, Yi
    Zeng, Donglin
    Wang, Yuanjia
    Tong, Xingwei
    STATISTICA SINICA, 2024, 34 (04) : 2089 - 2113
  • [28] Prognostic scoring system for osteosarcoma using network- regularized high-dimensional Cox-regression analysis and potential therapeutic targets (vol 234, pg 13851, 2019)
    Goh, T. S.
    Lee, J. S.
    Kim II, J.
    Park, Y. G.
    Pak, K.
    Jeong, D. C.
    Oh, S. O.
    Kim, Y. H.
    JOURNAL OF CELLULAR PHYSIOLOGY, 2023, 238 (07) : 1622 - 1622
  • [29] Individual Data Protected Integrative Regression Analysis of High-Dimensional Heterogeneous Data
    Cai, Tianxi
    Liu, Molei
    Xia, Yin
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2022, 117 (540) : 2105 - 2119
  • [30] Unconditional quantile regression with high-dimensional data
    Sasaki, Yuya
    Ura, Takuya
    Zhang, Yichong
    QUANTITATIVE ECONOMICS, 2022, 13 (03) : 955 - 978