High-Dimensional Variable Selection With Competing Events Using Cooperative Penalized Regression

被引:0
|
作者
Burk, Lukas [1 ,2 ,3 ,4 ]
Bender, Andreas [2 ,4 ]
Wright, Marvin N. [1 ,2 ,5 ]
机构
[1] Leibniz Inst Prevent Res & Epidemiol BIPS, Bremen, Germany
[2] Ludwig Maximilians Univ Munchen, Dept Stat, Munich, Germany
[3] Univ Bremen, Fac Math & Comp Sci, Bremen, Germany
[4] Munich Ctr Machine Learning MCML, Munich, Germany
[5] Univ Copenhagen, Dept Publ Hlth, Copenhagen, Denmark
关键词
competing risks; Cox regression; high-dimensional data analysis; penalized regression; variable selection; REGULARIZATION PATHS; FORESTS;
D O I
10.1002/bimj.70036
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Variable selection is an important step in the analysis of high-dimensional data, yet there are limited options for survival outcomes in the presence of competing risks. Commonly employed penalized Cox regression considers each event type separately through cause-specific models, neglecting possibly shared information between them. We adapt the feature-weighted elastic net (fwelnet), an elastic net generalization, to survival outcomes and competing risks. For two causes, our proposed algorithm fits two alternating cause-specific models, where each model receives the coefficient vector of the complementary model as prior information. We dub this "cooperative penalized regression," as it enables the modeling of competing risk data with cause-specific models while accounting for shared effects between causes. Coefficients that are shrunken toward zero in the model for the first cause will receive larger penalization weights in the model for the second cause and vice versa. Through multiple iterations, this process ensures stronger penalization of uninformative predictors in both models. We demonstrate our method's variable selection capabilities on simulated genomics data and apply it to bladder cancer microarray data. We evaluate selection performance using the positive predictive value for the correct selection of informative features and the false positive rate for the selection of uninformative variables. The benchmark compares results with cause-specific penalized Cox regression, random survival forests, and likelihood-boosted Cox regression. Results indicate that our approach is more effective at selecting informative features and removing uninformative features. In settings without shared effects, variable selection performance is similar to cause-specific penalized Cox regression.
引用
收藏
页数:12
相关论文
共 50 条
  • [41] High-Dimensional Censored Regression via the Penalized Tobit Likelihood
    Jacobson, Tate
    Zou, Hui
    JOURNAL OF BUSINESS & ECONOMIC STATISTICS, 2024, 42 (01) : 286 - 297
  • [42] Orthogonality-projection-based penalized variable selection for high-dimensional partially linear models
    Yang, Yiping
    Zhao, Peixin
    Zhang, Jun
    APPLIED MATHEMATICAL MODELLING, 2025, 138
  • [43] High-Dimensional Variable Selection for Quantile Regression Based on Variational Bayesian Method
    Dai, Dengluan
    Tang, Anmin
    Ye, Jinli
    MATHEMATICS, 2023, 11 (10)
  • [44] Variable selection for high-dimensional regression models with time series and heteroscedastic errors
    Chiou, Hai-Tang
    Guo, Meihui
    Ing, Ching-Kang
    JOURNAL OF ECONOMETRICS, 2020, 216 (01) : 118 - 136
  • [45] Variable selection and identification of high-dimensional nonparametric nonlinear systems by directional regression
    B. Sun
    Q. Y. Cai
    Z. K. Peng
    C. M. Cheng
    F. Wang
    H. Z. Zhang
    Nonlinear Dynamics, 2023, 111 : 12101 - 12112
  • [46] An Additive Sparse Penalty for Variable Selection in High-Dimensional Linear Regression Model
    Lee, Sangin
    COMMUNICATIONS FOR STATISTICAL APPLICATIONS AND METHODS, 2015, 22 (02) : 147 - 157
  • [47] Bayesian variable selection and model averaging in high-dimensional multinomial nonparametric regression
    Yau, P
    Kohn, R
    Wood, S
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2003, 12 (01) : 23 - 54
  • [48] Simultaneous variable selection and smoothing for high-dimensional function-on-scalar regression
    Parodi, Alice
    Reimherr, Matthew
    ELECTRONIC JOURNAL OF STATISTICS, 2018, 12 (02): : 4602 - 4639
  • [49] MCEN: a method of simultaneous variable selection and clustering for high-dimensional multinomial regression
    Ren, Sheng
    Kang, Emily L.
    Lu, Jason L.
    STATISTICS AND COMPUTING, 2020, 30 (02) : 291 - 304
  • [50] Correlation-adjusted regression survival scores for high-dimensional variable selection
    Welchowski, Thomas
    Zuber, Verena
    Schmid, Matthias
    STATISTICS IN MEDICINE, 2019, 38 (13) : 2413 - 2427