Robust and consistent variable selection in high-dimensional generalized linear models

被引:22
|
作者
Avella-Medina, Marco [1 ]
Ronchetti, Elvezio [2 ]
机构
[1] MIT, Sloan Sch Management, 30 Mem Dr, Cambridge, MA 02142 USA
[2] Univ Geneva, Res Ctr Stat, Blvd Pont Arve 40, CH-1205 Geneva, Switzerland
基金
瑞士国家科学基金会;
关键词
Contamination neighbourhood; Generalized linear model; Infinitesimal robustness; Lasso; Oracle estimator; Robust quasilikelihood; NONCONCAVE PENALIZED LIKELIHOOD; REGRESSION SHRINKAGE; CONFIDENCE-INTERVALS; ADAPTIVE LASSO; INFERENCE; ESTIMATORS; REGULARIZATION;
D O I
10.1093/biomet/asx070
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Generalized linear models are popular for modelling a large variety of data. We consider variable selection through penalized methods by focusing on resistance issues in the presence of outlying data and other deviations from assumptions. We highlight the weaknesses of widely-used penalized M-estimators, propose a robust penalized quasilikelihood estimator, and show that it enjoys oracle properties in high dimensions and is stable in a neighbourhood of the model. We illustrate its finite-sample performance on simulated and real data.
引用
收藏
页码:31 / 44
页数:14
相关论文
共 50 条
  • [31] Variable selection in high-dimensional partially linear additive models for composite quantile regression
    Guo, Jie
    Tang, Manlai
    Tian, Maozai
    Zhu, Kai
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2013, 65 : 56 - 67
  • [32] High-dimensional robust inference for censored linear models
    Huang, Jiayu
    Wu, Yuanshan
    SCIENCE CHINA-MATHEMATICS, 2024, 67 (04) : 891 - 918
  • [33] An Improved Forward Regression Variable Selection Algorithm for High-Dimensional Linear Regression Models
    Xie, Yanxi
    Li, Yuewen
    Xia, Zhijie
    Yan, Ruixia
    IEEE ACCESS, 2020, 8 (08): : 129032 - 129042
  • [34] Double penalized variable selection for high-dimensional partial linear mixed effects models
    Yang, Yiping
    Luo, Chuanqin
    Yang, Weiming
    JOURNAL OF MULTIVARIATE ANALYSIS, 2024, 204
  • [35] High-dimensional robust inference for censored linear models
    Jiayu Huang
    Yuanshan Wu
    ScienceChina(Mathematics), 2024, 67 (04) : 891 - 918
  • [36] Partial profile score feature selection in high-dimensional generalized linear interaction models
    Xu, Zengchao
    Luo, Shan
    Chen, Zehua
    STATISTICS AND ITS INTERFACE, 2022, 15 (04) : 433 - 447
  • [37] A Robust Supervised Variable Selection for Noisy High-Dimensional Data
    Kalina, Jan
    Schlenker, Anna
    BIOMED RESEARCH INTERNATIONAL, 2015, 2015
  • [38] Robust Information Criterion for Model Selection in Sparse High-Dimensional Linear Regression Models
    Gohain, Prakash Borpatra
    Jansson, Magnus
    IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2023, 71 : 2251 - 2266
  • [39] A consistent variable selection method in high-dimensional canonical discriminant analysis
    Oda, Ryoya
    Suzuki, Yuya
    Yanagihara, Hirokazu
    Fujikoshi, Yasunori
    JOURNAL OF MULTIVARIATE ANALYSIS, 2020, 175
  • [40] Consistent Variable Selection for High-dimensional Nonparametric Additive Nonlinear Systems
    Mu, Biqiang
    Zheng, Wei Xing
    Bai, Er-Wei
    2016 IEEE 55TH CONFERENCE ON DECISION AND CONTROL (CDC), 2016, : 3066 - 3071