HIGH-DIMENSIONAL VARIABLE SELECTION WITH RIGHT-CENSORED LENGTH-BIASED DATA

被引:2
|
作者
Di He [1 ,2 ]
Zhou, Yong [3 ]
Zou, Hui [4 ]
机构
[1] Shanghai Univ Finance & Econ, Sch Stat & Management, Shanghai 200433, Peoples R China
[2] Nanjing Univ, Sch Econ, Nanjing 210046, Peoples R China
[3] East China Normal Univ, Acad Stat & Interdisciplinary Sci, Shanghai 200062, Peoples R China
[4] Univ Minnesota, Sch Stat, Minneapolis, MN 55455 USA
基金
中国国家自然科学基金;
关键词
Accelerated failure time model; high-dimensional variable selection; length-biased data; multi-stage penalization; NONCONCAVE PENALIZED LIKELIHOOD; SEMIPARAMETRIC TRANSFORMATION MODELS; NONPARAMETRIC-ESTIMATION; EMPIRICAL DISTRIBUTIONS; QUANTILE REGRESSION; PREVALENT COHORT; ADAPTIVE LASSO; SURVIVAL;
D O I
10.5705/ss.202018.0089
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Length-biased data are common in various fields, including epidemiology and labor economics, and they have attracted considerable attention in survival literature. A crucial goal of a survival analysis is to identify a subset of risk factors and their risk contributions from among a vast number of clinical covariates. However, there is no research on variable selection for length-biased data, owing to the complex nature of such data and the lack of a convenient loss function. Therefore, we propose an estimation method based on penalized estimating equations to obtain a sparse and consistent estimator for length-biased data under an accelerated failure time model. The proposed estimator possesses the selection and estimation consistency property. In particular, we implement our method using a SCAD penalty and a local linear approximation algorithm. We suggest selecting the tuning parameter using the extended BIC in high-dimensional settings. Furthermore, we develop a novel multistage SCAD penalized estimating equation procedure to achieve improved estimation accuracy and sparsity in the variable selection. Simulation studies show that the proposed procedure has high accuracy and almost perfect sparsity. Oscar Awards data are analyzed as an application of the proposed method.
引用
收藏
页码:193 / 215
页数:23
相关论文
共 50 条
  • [21] Monotone rank estimation of transformation models with length-biased and right-censored data
    Chen XiaoPing
    Shi JianHua
    Zhou Yong
    SCIENCE CHINA-MATHEMATICS, 2015, 58 (10) : 2055 - 2068
  • [22] The survival function NPMLE for combined right-censored and length-biased right-censored failure time data: properties and applications
    McVittie, James H.
    Wolfson, David B.
    Stephens, David A.
    INTERNATIONAL JOURNAL OF BIOSTATISTICS, 2024, 20 (02): : 531 - 551
  • [23] Monotone rank estimation of transformation models with length-biased and right-censored data
    CHEN XiaoPing
    SHI JianHua
    ZHOU Yong
    Science China(Mathematics), 2015, 58 (10) : 2055 - 2068
  • [24] Buckley-James-Type Estimator with Right-Censored and Length-Biased Data
    Ning, Jing
    Qin, Jing
    Shen, Yu
    BIOMETRICS, 2011, 67 (04) : 1369 - 1378
  • [25] Semiparametric quantile-difference estimation for length-biased and right-censored data
    Yutao Liu
    Shucong Zhang
    Yong Zhou
    ScienceChina(Mathematics), 2019, 62 (09) : 1823 - 1838
  • [26] A general quantile residual life model for length-biased right-censored data
    Bai, Fangfang
    Chen, Xuerong
    Chen, Yan
    Huang, Tao
    SCANDINAVIAN JOURNAL OF STATISTICS, 2019, 46 (04) : 1191 - 1205
  • [27] Semiparametric quantile-difference estimation for length-biased and right-censored data
    Liu, Yutao
    Zhang, Shucong
    Zhou, Yong
    SCIENCE CHINA-MATHEMATICS, 2019, 62 (09) : 1823 - 1838
  • [28] Semiparametric varying-coefficient model with right-censored and length-biased data
    Lin, Cunjie
    Zhou, Yong
    JOURNAL OF MULTIVARIATE ANALYSIS, 2016, 152 : 119 - 144
  • [29] Acceleration of Expectation-Maximization algorithm for length-biased right-censored data
    Kwun Chuen Gary Chan
    Lifetime Data Analysis, 2017, 23 : 102 - 112
  • [30] Regression Analysis of Length-biased and Right-censored Failure Time Data with Missing Covariates
    Hu, Na
    Chen, Xuerong
    Sun, Jianguo
    SCANDINAVIAN JOURNAL OF STATISTICS, 2015, 42 (02) : 438 - 452