Robust Ridge Regression for High-Dimensional Data

被引:70
|
作者
Maronna, Ricardo A. [1 ,2 ]
机构
[1] Univ La Plata, Dept Math, RA-1900 La Plata, Argentina
[2] CICPBA, Buenos Aires, DF, Argentina
关键词
MM estimate; S estimate; Shrinking; SELECTION;
D O I
10.1198/TECH.2010.09114
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Ridge regression, being based on the minimization of a quadratic loss function, is sensitive to outliers. Current proposals for robust ridge-regression estimators are sensitive to "bad leverage observations," cannot be employed when the number of predictors p is larger than the number of observations n, and have a low robustness when the ratio pin is large. In this article a ridge-regression estimate based on repeated M estimation ("MM estimation") is proposed. It is a penalized regression MM estimator, in which the quadratic loss is replaced by an average of rho(r(i)/(sigma) over cap), where r(i) are the residuals and (sigma) over cap the residual scale from an initial estimator, which is a penalized S estimator; and rho is a bounded function. The MM estimator can be computed for p > n and is robust for large p/n. A fast algorithm is proposed. The advantages of the proposed approach over its competitors are demonstrated through both simulated and real data. Supplemental materials are available online.
引用
收藏
页码:44 / 53
页数:10
相关论文
共 50 条
  • [21] Combination of Nonconvex Penalties and Ridge Regression for High-Dimensional Linear Models
    Xiuli WANG
    Mingqiu WANG
    JournalofMathematicalResearchwithApplications, 2014, 34 (06) : 743 - 753
  • [22] A Bayesian approach with generalized ridge estimation for high-dimensional regression and testing
    Yang, Szu-Peng
    Emura, Takeshi
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2017, 46 (08) : 6083 - 6105
  • [23] Fast Robust Correlation for High-Dimensional Data
    Raymaekers, Jakob
    Rousseeuw, Peter J.
    TECHNOMETRICS, 2021, 63 (02) : 184 - 198
  • [24] Discriminative Ridge Machine: A Classifier for High-Dimensional Data or Imbalanced Data
    Peng, Chong
    Cheng, Qiang
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (06) : 2595 - 2609
  • [25] Robust transfer learning for high-dimensional regression with linear constraints
    Chen, Xuan
    Song, Yunquan
    Wang, Yuanfeng
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2024, 94 (11) : 2462 - 2482
  • [26] High-dimensional robust regression with Lq-loss functions
    Wang, Yibo
    Karunamuni, Rohana J.
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2022, 176
  • [27] High-dimensional robust regression with Lq-loss functions
    Wang, Yibo
    Karunamuni, Rohana J.
    Computational Statistics and Data Analysis, 2022, 176
  • [28] Quantile forward regression for high-dimensional survival data
    Lee, Eun Ryung
    Park, Seyoung
    Lee, Sang Kyu
    Hong, Hyokyoung G.
    LIFETIME DATA ANALYSIS, 2023, 29 (04) : 769 - 806
  • [29] Quantile forward regression for high-dimensional survival data
    Eun Ryung Lee
    Seyoung Park
    Sang Kyu Lee
    Hyokyoung G. Hong
    Lifetime Data Analysis, 2023, 29 : 769 - 806
  • [30] Adaptive Bayesian density regression for high-dimensional data
    Shen, Weining
    Ghosal, Subhashis
    BERNOULLI, 2016, 22 (01) : 396 - 420