Robust Ridge Regression for High-Dimensional Data

被引:70
|
作者
Maronna, Ricardo A. [1 ,2 ]
机构
[1] Univ La Plata, Dept Math, RA-1900 La Plata, Argentina
[2] CICPBA, Buenos Aires, DF, Argentina
关键词
MM estimate; S estimate; Shrinking; SELECTION;
D O I
10.1198/TECH.2010.09114
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Ridge regression, being based on the minimization of a quadratic loss function, is sensitive to outliers. Current proposals for robust ridge-regression estimators are sensitive to "bad leverage observations," cannot be employed when the number of predictors p is larger than the number of observations n, and have a low robustness when the ratio pin is large. In this article a ridge-regression estimate based on repeated M estimation ("MM estimation") is proposed. It is a penalized regression MM estimator, in which the quadratic loss is replaced by an average of rho(r(i)/(sigma) over cap), where r(i) are the residuals and (sigma) over cap the residual scale from an initial estimator, which is a penalized S estimator; and rho is a bounded function. The MM estimator can be computed for p > n and is robust for large p/n. A fast algorithm is proposed. The advantages of the proposed approach over its competitors are demonstrated through both simulated and real data. Supplemental materials are available online.
引用
收藏
页码:44 / 53
页数:10
相关论文
共 50 条
  • [41] Robust feature screening for high-dimensional survival data
    Hao, Meiling
    Lin, Yuanyuan
    Liu, Xianhui
    Tang, Wenlu
    JOURNAL OF APPLIED STATISTICS, 2019, 46 (06) : 979 - 994
  • [42] A Tuning-free Robust and Efficient Approach to High-dimensional Regression
    Wang, Lan
    Peng, Bo
    Bradic, Jelena
    Li, Runze
    Wu, Yunan
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2020, 115 (532) : 1700 - 1714
  • [43] Heterogeneous robust estimation with the mixed penalty in high-dimensional regression model
    Zhu, Yanling
    Wang, Kai
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2024, 53 (08) : 2730 - 2743
  • [44] Robust and sparse estimation methods for high-dimensional linear and logistic regression
    Kurnaz, Fatma Sevinc
    Hoffmann, Irene
    Filzmoser, Peter
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2018, 172 : 211 - 222
  • [45] COMPUTATIONALLY EFFICIENT AND STATISTICALLY OPTIMAL ROBUST HIGH-DIMENSIONAL LINEAR REGRESSION
    Shen, Yinan
    Li, Jingyang
    Cai, Jian-feng
    Xia, Dong
    ANNALS OF STATISTICS, 2025, 53 (01): : 374 - 399
  • [46] Robust Variable Selection with Optimality Guarantees for High-Dimensional Logistic Regression
    Insolia, Luca
    Kenney, Ana
    Calovi, Martina
    Chiaromonte, Francesca
    STATS, 2021, 4 (03): : 665 - 681
  • [47] Ridge estimation of inverse covariance matrices from high-dimensional data
    van Wieringen, Wessel N.
    Peeters, Carel F. W.
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2016, 103 : 284 - 303
  • [48] Regression on High-dimensional Inputs
    Kuleshov, Alexander
    Bernstein, Alexander
    2016 IEEE 16TH INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW), 2016, : 732 - 739
  • [49] On inference in high-dimensional regression
    Battey, Heather S.
    Reid, Nancy
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2023, 85 (01) : 149 - 175
  • [50] Inverse Matrix Problem in Regression for High-Dimensional Data Sets
    Shakeel, Namra
    Mehmood, Tahir
    Mathematical Problems in Engineering, 2023, 2023