ROBUST ELASTIC NET ESTIMATORS FOR VARIABLE SELECTION AND IDENTIFICATION OF PROTEOMIC BIOMARKERS

被引:22
|
作者
Freue, Gabriela V. Cohen [1 ]
Kepplinger, David [1 ]
Salibian-Barrera, Matias [1 ]
Smucler, Ezequiel [2 ]
机构
[1] Univ British Columbia, Dept Stat, 3182-2207 Main Mall, Vancouver, BC V6T 1Z4, Canada
[2] Univ Torcuato Ditella, Dept Math & Stat, Ave Figueroa Alcorta 7350, RA-1428 Buenos Aires, DF, Argentina
来源
ANNALS OF APPLIED STATISTICS | 2019年 / 13卷 / 04期
基金
加拿大自然科学与工程研究理事会;
关键词
Robust estimation; regularized estimation; penalized estimation; elastic net penalty; proteomics biomarkers; HIGH BREAKDOWN-POINT; DIVERGING NUMBER; REGRESSION; REGULARIZATION; ALGORITHM; LASSO;
D O I
10.1214/19-AOAS1269
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In large-scale quantitative proteomic studies, scientists measure the abundance of thousands of proteins from the human proteome in search of novel biomarkers for a given disease. Penalized regression estimators can be used to identify potential biomarkers among a large set of molecular features measured. Yet, the performance and statistical properties of these estimators depend on the loss and penalty functions used to define them. Motivated by a real plasma proteomic biomarkers study, we propose a new class of penalized robust estimators based on the elastic net penalty, which can be tuned to keep groups of correlated variables together in the selected model and maintain robustness against possible outliers. We also propose an efficient algorithm to compute our robust penalized estimators and derive a data-driven method to select the penalty term. Our robust penalized estimators have very good robustness properties and are also consistent under certain regularity conditions. Numerical results show that our robust estimators compare favorably to other robust penalized estimators. Using our proposed methodology for the analysis of the proteomics data, we identify new potentially relevant biomarkers of cardiac allograft vasculopathy that are not found with nonrobust alternatives. The selected model is validated in a new set of 52 test samples and achieves an area under the receiver operating characteristic (AUC) of 0.85.
引用
收藏
页码:2065 / 2090
页数:26
相关论文
共 50 条
  • [41] Robust Elastic-Net Subspace Representation
    Kim, Eunwoo
    Lee, Minsik
    Oh, Songhwai
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 25 (09) : 4245 - 4259
  • [42] Proteomic identification of biomarkers for glomerular diseases.
    Thongboonkerd, V
    Klein, JB
    McLeish, KR
    JOURNAL OF THE AMERICAN SOCIETY OF NEPHROLOGY, 2002, 13 : 120A - 120A
  • [43] Proteomic identification of biomarkers of skeletal muscle disorders
    Ohlendieck, Kay
    BIOMARKERS IN MEDICINE, 2013, 7 (01) : 169 - 186
  • [44] Proteomic identification of plasma biomarkers in uterine leiomyoma
    Lin, Chao-Po
    Chen, Yi-Wen
    Liu, Wen-Hsin
    Chou, Hsiu-Chuan
    Chang, Yi-Ping
    Lin, Szu-Ting
    Li, Ji-Min
    Jian, Shiou-Fen
    Lee, Ying-Ray
    Chan, Hong-Lin
    MOLECULAR BIOSYSTEMS, 2012, 8 (04) : 1136 - 1145
  • [45] Proteomic identification of biomarkers of traumatic brain injury
    Wang, KKW
    Ottens, AK
    Liu, MC
    Lewis, SB
    Meegan, C
    Oli, M
    Tortella, FC
    Hayes, RL
    EXPERT REVIEW OF PROTEOMICS, 2005, 2 (04) : 603 - 614
  • [46] Proteomic approach for the identification of potential biomarkers in schizophrenia
    Yamada, Shinnosuke
    Nagai, Taku
    Yoshimi, Akira
    Ohashi, Mitsuki
    Ito, Yoshihito
    Noda, Yukihiro
    Ozaki, Norio
    JOURNAL OF PHARMACOLOGICAL SCIENCES, 2010, 112 : 63P - 63P
  • [47] Proteomic identification of urinary biomarkers of diabetic nephropathy
    Rao, Paturi V.
    Lu, Xinfang
    Standley, Melissa
    Pattee, Patrick
    Neelima, Gundupalle
    Girisesh, Gudige
    Dakshinamurthy, K. V.
    Roberts, Charles T., Jr.
    Nagalla, Srinivasa R.
    DIABETES CARE, 2007, 30 (03) : 629 - 637
  • [48] Proteomic identification of neural stem cell biomarkers
    Maltman, Daniel J.
    Przyborski, Stefan A.
    JOURNAL OF ANATOMY, 2008, 212 (01) : 89 - 89
  • [49] Correction to: A robust strategy for proteomic identification of biomarkers of invasive phenotype complexed with extracellular heat shock proteins
    Steven G. Griffiths
    Alan Ezrin
    Emily Jackson
    Lisa Dewey
    Alan A. Doucette
    Cell Stress and Chaperones, 2021, 26 : 453 - 453
  • [50] Elastic variable selection approach for calibration
    Giglio, Cannon
    Brown, Steven
    ABSTRACTS OF PAPERS OF THE AMERICAN CHEMICAL SOCIETY, 2016, 252