A Variable Selection Method for High-Dimensional Survival Data

被引:1
|
作者
Giordano, Francesco [1 ]
Milito, Sara [1 ]
Restaino, Marialuisa [1 ]
机构
[1] Univ Salerno, Via Giovanni Paolo II 132, I-84084 Salerno, Italy
关键词
Variable selection; High-dimension; Survival data;
D O I
10.1007/978-3-030-99638-3_49
中图分类号
F8 [财政、金融];
学科分类号
0202 ;
摘要
Survival data with high-dimensional predictors are regularly collected in many studies. Models with a very large number of covariates are both infeasible to fit and likely to incur low predictability due to overfitting. The selection of significant variables plays a crucial role in estimating models. Even if several approaches that identify variables in presence of censored data are available in literature, there is not unanimous consensus on which method outperforms the others. Nonetheless, it is possible to exploit the advantages of methods to get the final set of covariates as good as possible. Therefore, we propose a method that combines different variable selection procedures by using the subsampling technique, for identifying as relevant those covariates that are selected most frequently by the different variable selectors on subsampled data. By a simulation study, we evaluate the performance of the proposed procedure and compare it with other techniques.
引用
收藏
页码:303 / 308
页数:6
相关论文
共 50 条
  • [11] Variable selection and subgroup analysis for high-dimensional censored data
    Zhang, Yu
    Wang, Jiangli
    Zhang, Weiping
    STATISTICAL THEORY AND RELATED FIELDS, 2024, 8 (03) : 211 - 231
  • [12] Scalable Bayesian variable selection for structured high-dimensional data
    Chang, Changgee
    Kundu, Suprateek
    Long, Qi
    BIOMETRICS, 2018, 74 (04) : 1372 - 1382
  • [13] High-dimensional variable selection in regression and classification with missing data
    Gao, Qi
    Lee, Thomas C. M.
    SIGNAL PROCESSING, 2017, 131 : 1 - 7
  • [14] RANKING-BASED VARIABLE SELECTION FOR HIGH-DIMENSIONAL DATA
    Baranowski, Rafal
    Chen, Yining
    Fryzlewicz, Piotr
    STATISTICA SINICA, 2020, 30 (03) : 1485 - 1516
  • [15] Bayesian variable selection in clustering high-dimensional data with substructure
    Michael D. Swartz
    Qianxing Mo
    Mary E. Murphy
    Joanne R. Lupton
    Nancy D. Turner
    Mee Young Hong
    Marina Vannucci
    Journal of Agricultural, Biological, and Environmental Statistics, 2008, 13 : 407 - 423
  • [16] Sparse Bayesian variable selection for classifying high-dimensional data
    Yang, Aijun
    Lian, Heng
    Jiang, Xuejun
    Liu, Pengfei
    STATISTICS AND ITS INTERFACE, 2018, 11 (02) : 385 - 395
  • [17] A Robust Supervised Variable Selection for Noisy High-Dimensional Data
    Kalina, Jan
    Schlenker, Anna
    BIOMED RESEARCH INTERNATIONAL, 2015, 2015
  • [18] Estimation and variable selection for high-dimensional spatial data models
    Hou, Li
    Jin, Baisuo
    Wu, Yuehua
    JOURNAL OF ECONOMETRICS, 2024, 238 (02)
  • [19] Variable selection for longitudinal data with high-dimensional covariates and dropouts
    Zheng, Xueying
    Fu, Bo
    Zhang, Jiajia
    Qin, Guoyou
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2018, 88 (04) : 712 - 725
  • [20] Stochastic variational variable selection for high-dimensional microbiome data
    Tung Dang
    Kie Kumaishi
    Erika Usui
    Shungo Kobori
    Takumi Sato
    Yusuke Toda
    Yuji Yamasaki
    Hisashi Tsujimoto
    Yasunori Ichihashi
    Hiroyoshi Iwata
    Microbiome, 10