A Variable Selection Method for High-Dimensional Survival Data

被引：1

作者：

Giordano, Francesco ^{[1
]}

Milito, Sara ^{[1
]}

Restaino, Marialuisa ^{[1
]}

机构：

[1] Univ Salerno, Via Giovanni Paolo II 132, I-84084 Salerno, Italy

来源：

MATHEMATICAL AND STATISTICAL METHODS FOR ACTUARIAL SCIENCES AND FINANCE, MAF 2022 | 2022年

关键词：

Variable selection; High-dimension; Survival data;

D O I：

10.1007/978-3-030-99638-3_49

中图分类号：

F8 [财政、金融];

学科分类号：

0202 ;

摘要：

Survival data with high-dimensional predictors are regularly collected in many studies. Models with a very large number of covariates are both infeasible to fit and likely to incur low predictability due to overfitting. The selection of significant variables plays a crucial role in estimating models. Even if several approaches that identify variables in presence of censored data are available in literature, there is not unanimous consensus on which method outperforms the others. Nonetheless, it is possible to exploit the advantages of methods to get the final set of covariates as good as possible. Therefore, we propose a method that combines different variable selection procedures by using the subsampling technique, for identifying as relevant those covariates that are selected most frequently by the different variable selectors on subsampled data. By a simulation study, we evaluate the performance of the proposed procedure and compare it with other techniques.

引用

页码：303 / 308

页数：6

共 50 条

[11] Variable selection and subgroup analysis for high-dimensional censored data
Zhang, Yu
Wang, Jiangli
Zhang, Weiping
STATISTICAL THEORY AND RELATED FIELDS, 2024, 8 (03) : 211 - 231
[12] Scalable Bayesian variable selection for structured high-dimensional data
Chang, Changgee
Kundu, Suprateek
Long, Qi
BIOMETRICS, 2018, 74 (04) : 1372 - 1382
[13] High-dimensional variable selection in regression and classification with missing data
Gao, Qi
Lee, Thomas C. M.
SIGNAL PROCESSING, 2017, 131 : 1 - 7
[14] RANKING-BASED VARIABLE SELECTION FOR HIGH-DIMENSIONAL DATA
Baranowski, Rafal
Chen, Yining
Fryzlewicz, Piotr
STATISTICA SINICA, 2020, 30 (03) : 1485 - 1516
[15] Bayesian variable selection in clustering high-dimensional data with substructure
Michael D. Swartz
Qianxing Mo
Mary E. Murphy
Joanne R. Lupton
Nancy D. Turner
Mee Young Hong
Marina Vannucci
Journal of Agricultural, Biological, and Environmental Statistics, 2008, 13 : 407 - 423
[16] Sparse Bayesian variable selection for classifying high-dimensional data
Yang, Aijun
Lian, Heng
Jiang, Xuejun
Liu, Pengfei
STATISTICS AND ITS INTERFACE, 2018, 11 (02) : 385 - 395
[17] A Robust Supervised Variable Selection for Noisy High-Dimensional Data
Kalina, Jan
Schlenker, Anna
BIOMED RESEARCH INTERNATIONAL, 2015, 2015
[18] Estimation and variable selection for high-dimensional spatial data models
Hou, Li
Jin, Baisuo
Wu, Yuehua
JOURNAL OF ECONOMETRICS, 2024, 238 (02)
[19] Variable selection for longitudinal data with high-dimensional covariates and dropouts
Zheng, Xueying
Fu, Bo
Zhang, Jiajia
Qin, Guoyou
JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2018, 88 (04) : 712 - 725
[20] Stochastic variational variable selection for high-dimensional microbiome data
Tung Dang
Kie Kumaishi
Erika Usui
Shungo Kobori
Takumi Sato
Yusuke Toda
Yuji Yamasaki
Hisashi Tsujimoto
Yasunori Ichihashi
Hiroyoshi Iwata
Microbiome, 10

← 1 2 3 4 5 →