ROBUST ELASTIC NET ESTIMATORS FOR VARIABLE SELECTION AND IDENTIFICATION OF PROTEOMIC BIOMARKERS

被引:22
|
作者
Freue, Gabriela V. Cohen [1 ]
Kepplinger, David [1 ]
Salibian-Barrera, Matias [1 ]
Smucler, Ezequiel [2 ]
机构
[1] Univ British Columbia, Dept Stat, 3182-2207 Main Mall, Vancouver, BC V6T 1Z4, Canada
[2] Univ Torcuato Ditella, Dept Math & Stat, Ave Figueroa Alcorta 7350, RA-1428 Buenos Aires, DF, Argentina
来源
ANNALS OF APPLIED STATISTICS | 2019年 / 13卷 / 04期
基金
加拿大自然科学与工程研究理事会;
关键词
Robust estimation; regularized estimation; penalized estimation; elastic net penalty; proteomics biomarkers; HIGH BREAKDOWN-POINT; DIVERGING NUMBER; REGRESSION; REGULARIZATION; ALGORITHM; LASSO;
D O I
10.1214/19-AOAS1269
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In large-scale quantitative proteomic studies, scientists measure the abundance of thousands of proteins from the human proteome in search of novel biomarkers for a given disease. Penalized regression estimators can be used to identify potential biomarkers among a large set of molecular features measured. Yet, the performance and statistical properties of these estimators depend on the loss and penalty functions used to define them. Motivated by a real plasma proteomic biomarkers study, we propose a new class of penalized robust estimators based on the elastic net penalty, which can be tuned to keep groups of correlated variables together in the selected model and maintain robustness against possible outliers. We also propose an efficient algorithm to compute our robust penalized estimators and derive a data-driven method to select the penalty term. Our robust penalized estimators have very good robustness properties and are also consistent under certain regularity conditions. Numerical results show that our robust estimators compare favorably to other robust penalized estimators. Using our proposed methodology for the analysis of the proteomics data, we identify new potentially relevant biomarkers of cardiac allograft vasculopathy that are not found with nonrobust alternatives. The selected model is validated in a new set of 52 test samples and achieves an area under the receiver operating characteristic (AUC) of 0.85.
引用
收藏
页码:2065 / 2090
页数:26
相关论文
共 50 条
  • [21] Grouping Variable Selection by Weight Fused Elastic Net for Multi-Collinear Data
    Fu, Guang-Hui
    Xu, Qing-Song
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2012, 41 (02) : 205 - 221
  • [22] Bayesian Elastic Net variable selection and application for spatial quantile panel autoregressive model
    Yu, Zhuoxi
    Yao, Xue
    Liu, Haiyun
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2025,
  • [23] Adaptive elastic-net selection in a quantile model with diverging number of variable groups
    Ciuperca, Gabriela
    STATISTICS, 2020, 54 (05) : 1147 - 1170
  • [24] Variable selection and regularization via arbitrary rectangle-range generalized elastic net
    Ding, Yujia
    Peng, Qidi
    Song, Zhengming
    Chen, Hansen
    STATISTICS AND COMPUTING, 2023, 33 (03)
  • [25] Robust variable selection and parametric component identification in varying coefficient models
    Yang, Hu
    Lv, Jing
    Guo, Chaohui
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2016, 45 (18) : 5533 - 5549
  • [26] Robust model selection criteria for robust S and LTS estimators
    Cetin, Meral
    HACETTEPE JOURNAL OF MATHEMATICS AND STATISTICS, 2016, 45 (01): : 153 - 164
  • [27] Proteomic identification of biomarkers of vascular injury
    Huang, Ngan F.
    Kurpinski, Kyle
    Fang, Qizhi
    Lee, Randall J.
    Li, Song
    AMERICAN JOURNAL OF TRANSLATIONAL RESEARCH, 2011, 3 (02): : 139 - 148
  • [28] BOOTSTRAP SELECTION PROCEDURES BASED ON ROBUST ESTIMATORS
    SWANEPOEL, JWH
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 1983, 12 (18) : 2059 - 2083
  • [29] A robust strategy for proteomic identification of biomarkers of invasive phenotype complexed with extracellular heat shock proteins
    Steven G. Griffiths
    Alan Ezrin
    Emily Jackson
    Lisa Dewey
    Alan A. Doucette
    Cell Stress and Chaperones, 2019, 24 : 1197 - 1209
  • [30] A robust strategy for proteomic identification of biomarkers of invasive phenotype complexed with extracellular heat shock proteins
    Griffiths, Steven G.
    Ezrin, Alan
    Jackson, Emily
    Dewey, Lisa
    Doucette, Alan A.
    CELL STRESS & CHAPERONES, 2019, 24 (06): : 1197 - 1209