Robust Ridge Regression for High-Dimensional Data

被引:70
|
作者
Maronna, Ricardo A. [1 ,2 ]
机构
[1] Univ La Plata, Dept Math, RA-1900 La Plata, Argentina
[2] CICPBA, Buenos Aires, DF, Argentina
关键词
MM estimate; S estimate; Shrinking; SELECTION;
D O I
10.1198/TECH.2010.09114
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Ridge regression, being based on the minimization of a quadratic loss function, is sensitive to outliers. Current proposals for robust ridge-regression estimators are sensitive to "bad leverage observations," cannot be employed when the number of predictors p is larger than the number of observations n, and have a low robustness when the ratio pin is large. In this article a ridge-regression estimate based on repeated M estimation ("MM estimation") is proposed. It is a penalized regression MM estimator, in which the quadratic loss is replaced by an average of rho(r(i)/(sigma) over cap), where r(i) are the residuals and (sigma) over cap the residual scale from an initial estimator, which is a penalized S estimator; and rho is a bounded function. The MM estimator can be computed for p > n and is robust for large p/n. A fast algorithm is proposed. The advantages of the proposed approach over its competitors are demonstrated through both simulated and real data. Supplemental materials are available online.
引用
收藏
页码:44 / 53
页数:10
相关论文
共 50 条
  • [31] Uniform Consistency of Cross-Validation Estimators for High-Dimensional Ridge Regression
    Patil, Pratik
    Wei, Yuting
    Rinaldo, Alessandro
    Tibshirani, Ryan J.
    24TH INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS (AISTATS), 2021, 130
  • [32] HDBRR: a statistical package for high-dimensional Bayesian ridge regression without MCMC
    Perez-Elizalde, Sergio
    Monroy-Castillo, Blanca E.
    Perez-Rodriguez, Paulino
    Crossa, Jose
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2022, 92 (17) : 3679 - 3705
  • [33] g.ridge: An R Package for Generalized Ridge Regression for Sparse and High-Dimensional Linear Models
    Emura, Takeshi
    Matsumoto, Koutarou
    Uozumi, Ryuji
    Michimae, Hirofumi
    SYMMETRY-BASEL, 2024, 16 (02):
  • [34] On Coupling Robust Estimation with Regularization for High-Dimensional Data
    Kalina, Jan
    Hlinka, Jaroslav
    DATA SCIENCE: INNOVATIVE DEVELOPMENTS IN DATA ANALYSIS AND CLUSTERING, 2017, : 15 - 27
  • [35] A robust variable screening method for high-dimensional data
    Wang, Tao
    Zheng, Lin
    Li, Zhonghua
    Liu, Haiyang
    JOURNAL OF APPLIED STATISTICS, 2017, 44 (10) : 1839 - 1855
  • [36] Software Tools for Robust Analysis of High-Dimensional Data
    Todorov, Valentin
    Filzmoser, Peter
    AUSTRIAN JOURNAL OF STATISTICS, 2014, 43 (04) : 255 - 266
  • [37] Robust analysis of cancer heterogeneity for high-dimensional data
    Cheng, Chao
    Feng, Xingdong
    Li, Xiaoguang
    Wu, Mengyun
    STATISTICS IN MEDICINE, 2022, 41 (27) : 5448 - 5462
  • [38] Feature Selection for High-Dimensional Data Based on Ridge Regression and SVM and Its Application in Peptide QSAR Modeling
    Wang Zhi-Ming
    Han Na
    Yuan Zhe-Ming
    Wu Zhao-Hua
    ACTA PHYSICO-CHIMICA SINICA, 2013, 29 (03) : 498 - 507
  • [39] Robust regularized cluster analysis for high-dimensional data
    Kalina, Jan
    Vlckova, Katarina
    MATHEMATICAL METHODS IN ECONOMICS (MME 2014), 2014, : 378 - 383
  • [40] ON ROBUST INFORMATION EXTRACTION FROM HIGH-DIMENSIONAL DATA
    Kalina, Jan
    SERBIAN JOURNAL OF MANAGEMENT, 2014, 9 (01) : 131 - 144