Robust and consistent variable selection in high-dimensional generalized linear models

被引:22
|
作者
Avella-Medina, Marco [1 ]
Ronchetti, Elvezio [2 ]
机构
[1] MIT, Sloan Sch Management, 30 Mem Dr, Cambridge, MA 02142 USA
[2] Univ Geneva, Res Ctr Stat, Blvd Pont Arve 40, CH-1205 Geneva, Switzerland
基金
瑞士国家科学基金会;
关键词
Contamination neighbourhood; Generalized linear model; Infinitesimal robustness; Lasso; Oracle estimator; Robust quasilikelihood; NONCONCAVE PENALIZED LIKELIHOOD; REGRESSION SHRINKAGE; CONFIDENCE-INTERVALS; ADAPTIVE LASSO; INFERENCE; ESTIMATORS; REGULARIZATION;
D O I
10.1093/biomet/asx070
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Generalized linear models are popular for modelling a large variety of data. We consider variable selection through penalized methods by focusing on resistance issues in the presence of outlying data and other deviations from assumptions. We highlight the weaknesses of widely-used penalized M-estimators, propose a robust penalized quasilikelihood estimator, and show that it enjoys oracle properties in high dimensions and is stable in a neighbourhood of the model. We illustrate its finite-sample performance on simulated and real data.
引用
收藏
页码:31 / 44
页数:14
相关论文
共 50 条
  • [41] NONPENALIZED VARIABLE SELECTION IN HIGH-DIMENSIONAL LINEAR MODEL SETTINGS VIA GENERALIZED FIDUCIAL INFERENCE
    Williams, Jonathan P.
    Hannig, Jan
    ANNALS OF STATISTICS, 2019, 47 (03): : 1723 - 1753
  • [42] Robust transfer learning of high-dimensional generalized linear model
    Sun, Fei
    Zhang, Qi
    PHYSICA A-STATISTICAL MECHANICS AND ITS APPLICATIONS, 2023, 618
  • [43] Cluster feature selection in high-dimensional linear models
    Lin, Bingqing
    Pang, Zhen
    Wang, Qihua
    RANDOM MATRICES-THEORY AND APPLICATIONS, 2018, 7 (01)
  • [44] HIGH-DIMENSIONAL VARIABLE SELECTION
    Wasserman, Larry
    Roeder, Kathryn
    ANNALS OF STATISTICS, 2009, 37 (5A): : 2178 - 2201
  • [45] Robust adaptive variable selection in ultra-high dimensional linear regression models
    Ghosh, Abhik
    Jaenada, Maria
    Pardo, Leandro
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2024, 94 (03) : 571 - 603
  • [46] Variable selection and estimation for high-dimensional spatial autoregressive models
    Cai, Liqian
    Maiti, Tapabrata
    SCANDINAVIAN JOURNAL OF STATISTICS, 2020, 47 (02) : 587 - 607
  • [47] Variable selection in high-dimensional quantile varying coefficient models
    Tang, Yanlin
    Song, Xinyuan
    Wang, Huixia Judy
    Zhu, Zhongyi
    JOURNAL OF MULTIVARIATE ANALYSIS, 2013, 122 : 115 - 132
  • [48] Robust variable selection for generalized linear models with a diverging number of parameters
    Guo, Chaohui
    Yang, Hu
    Lv, Jing
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2017, 46 (06) : 2967 - 2981
  • [49] An efficient and robust variable selection method for longitudinal generalized linear models
    Lv, Jing
    Yang, Hu
    Guo, Chaohui
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2015, 82 : 74 - 88
  • [50] FACTOR MODELS AND VARIABLE SELECTION IN HIGH-DIMENSIONAL REGRESSION ANALYSIS
    Kneip, Alois
    Sarda, Pascal
    ANNALS OF STATISTICS, 2011, 39 (05): : 2410 - 2447