A Variable Selection Method for High-Dimensional Survival Data

被引:1
|
作者
Giordano, Francesco [1 ]
Milito, Sara [1 ]
Restaino, Marialuisa [1 ]
机构
[1] Univ Salerno, Via Giovanni Paolo II 132, I-84084 Salerno, Italy
关键词
Variable selection; High-dimension; Survival data;
D O I
10.1007/978-3-030-99638-3_49
中图分类号
F8 [财政、金融];
学科分类号
0202 ;
摘要
Survival data with high-dimensional predictors are regularly collected in many studies. Models with a very large number of covariates are both infeasible to fit and likely to incur low predictability due to overfitting. The selection of significant variables plays a crucial role in estimating models. Even if several approaches that identify variables in presence of censored data are available in literature, there is not unanimous consensus on which method outperforms the others. Nonetheless, it is possible to exploit the advantages of methods to get the final set of covariates as good as possible. Therefore, we propose a method that combines different variable selection procedures by using the subsampling technique, for identifying as relevant those covariates that are selected most frequently by the different variable selectors on subsampled data. By a simulation study, we evaluate the performance of the proposed procedure and compare it with other techniques.
引用
收藏
页码:303 / 308
页数:6
相关论文
共 50 条
  • [21] A robust variable screening method for high-dimensional data
    Wang, Tao
    Zheng, Lin
    Li, Zhonghua
    Liu, Haiyang
    JOURNAL OF APPLIED STATISTICS, 2017, 44 (10) : 1839 - 1855
  • [22] A hybrid feature selection method for high-dimensional data
    Taheri, Nooshin
    Nezamabadi-pour, Hossein
    2014 4TH INTERNATIONAL CONFERENCE ON COMPUTER AND KNOWLEDGE ENGINEERING (ICCKE), 2014, : 141 - 145
  • [23] A class comparison method with filtering-enhanced variable selection for high-dimensional data sets
    Lusa, Lara
    Korn, Edward L.
    McShane, Lisa M.
    STATISTICS IN MEDICINE, 2008, 27 (28) : 5834 - 5849
  • [24] Stable Variable Selection for High-Dimensional Genomic Data with Strong Correlations
    Sarkar R.
    Manage S.
    Gao X.
    Annals of Data Science, 2024, 11 (04) : 1139 - 1164
  • [25] Variable selection techniques after multiple imputation in high-dimensional data
    Zahid, Faisal Maqbool
    Faisal, Shahla
    Heumann, Christian
    STATISTICAL METHODS AND APPLICATIONS, 2020, 29 (03): : 553 - 580
  • [26] PUlasso: High-Dimensional Variable Selection With Presence-Only Data
    Song, Hyebin
    Raskutti, Garvesh
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2020, 115 (529) : 334 - 347
  • [27] Variable Selection in High-Dimensional Partially Linear Models with Longitudinal Data
    Yang Yiping
    Xue Liugen
    RECENT ADVANCE IN STATISTICS APPLICATION AND RELATED AREAS, VOLS I AND II, 2009, : 661 - 667
  • [28] Variable selection via combined penalization for high-dimensional data analysis
    Wang, Xiaoming
    Park, Taesung
    Carriere, K. C.
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2010, 54 (10) : 2230 - 2243
  • [29] PALLADIO: a parallel framework for robust variable selection in high-dimensional data
    Barbieri, Matteo
    Fiorini, Samuele
    Tomasi, Federico
    Barla, Annalisa
    PROCEEDINGS OF PYHPC2016: 6TH WORKSHOP ON PYTHON FOR HIGH-PERFORMANCE AND SCIENTIFIC COMPUTING, 2016, : 19 - 26
  • [30] Variable selection techniques after multiple imputation in high-dimensional data
    Faisal Maqbool Zahid
    Shahla Faisal
    Christian Heumann
    Statistical Methods & Applications, 2020, 29 : 553 - 580