Conditional screening for ultra-high dimensional covariates with survival outcomes

被引:0
|
作者
Hyokyoung G. Hong
Jian Kang
Yi Li
机构
[1] Michigan State University,
[2] University of Michigan,undefined
来源
Lifetime Data Analysis | 2018年 / 24卷
关键词
Conditional screening; Cox model; Diffuse large B-cell lymphoma; High-dimensional variable screening;
D O I
暂无
中图分类号
学科分类号
摘要
Identifying important biomarkers that are predictive for cancer patients’ prognosis is key in gaining better insights into the biological influences on the disease and has become a critical component of precision medicine. The emergence of large-scale biomedical survival studies, which typically involve excessive number of biomarkers, has brought high demand in designing efficient screening tools for selecting predictive biomarkers. The vast amount of biomarkers defies any existing variable selection methods via regularization. The recently developed variable screening methods, though powerful in many practical setting, fail to incorporate prior information on the importance of each biomarker and are less powerful in detecting marginally weak while jointly important signals. We propose a new conditional screening method for survival outcome data by computing the marginal contribution of each biomarker given priorily known biological information. This is based on the premise that some biomarkers are known to be associated with disease outcomes a priori. Our method possesses sure screening properties and a vanishing false selection rate. The utility of the proposal is further confirmed with extensive simulation studies and analysis of a diffuse large B-cell lymphoma dataset. We are pleased to dedicate this work to Jack Kalbfleisch, who has made instrumental contributions to the development of modern methods of analyzing survival data.
引用
收藏
页码:45 / 71
页数:26
相关论文
共 50 条
  • [31] Quantile-adaptive variable screening in ultra-high dimensional varying coefficient models
    Zhang, Junying
    Zhang, Riquan
    Lu, Zhiping
    JOURNAL OF APPLIED STATISTICS, 2016, 43 (04) : 643 - 654
  • [32] Sequential Feature Screening for Generalized Linear Models with Sparse Ultra-High Dimensional Data
    Junying Zhang
    Hang Wang
    Riquan Zhang
    Jiajia Zhang
    Journal of Systems Science and Complexity, 2020, 33 : 510 - 526
  • [33] Model Based Screening Embedded Bayesian Variable Selection for Ultra-high Dimensional Settings
    Li, Dongjin
    Dutta, Somak
    Roy, Vivekananda
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2023, 32 (01) : 61 - 73
  • [34] Model-free feature screening for ultra-high dimensional competing risks data
    Chen, Xiaolin
    Zhang, Yahui
    Liu, Yi
    Chen, Xiaojing
    STATISTICS & PROBABILITY LETTERS, 2020, 164
  • [35] Category-Adaptive Variable Screening for Ultra-High Dimensional Heterogeneous Categorical Data
    Xie, Jinhan
    Lin, Yuanyuan
    Yan, Xiaodong
    Tang, Niansheng
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2020, 115 (530) : 747 - 760
  • [36] Sequential Feature Screening for Generalized Linear Models with Sparse Ultra-High Dimensional Data
    ZHANG Junying
    WANG Hang
    ZHANG Riquan
    ZHANG Jiajia
    Journal of Systems Science & Complexity, 2020, 33 (02) : 510 - 526
  • [37] Sequential Feature Screening for Generalized Linear Models with Sparse Ultra-High Dimensional Data
    Zhang, Junying
    Wang, Hang
    Zhang, Riquan
    Zhang, Jiajia
    JOURNAL OF SYSTEMS SCIENCE & COMPLEXITY, 2020, 33 (02) : 510 - 526
  • [38] PRIOR KNOWLEDGE GUIDED ULTRA-HIGH DIMENSIONAL VARIABLE SCREENING WITH APPLICATION TO NEUROIMAGING DATA
    He, Jie
    Kang, Jian
    STATISTICA SINICA, 2022, 32 : 2095 - 2117
  • [39] A sure independence screening procedure for ultra-high dimensional partially linear additive models
    Kazemi, M.
    Shahsavani, D.
    Arashi, M.
    JOURNAL OF APPLIED STATISTICS, 2019, 46 (08) : 1385 - 1403
  • [40] Robust feature screening for ultra-high dimensional right censored data via distance correlation
    Chen, Xiaolin
    Chen, Xiaojing
    Wang, Hong
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2018, 119 : 118 - 138