Enmsp: an elastic-net multi-step screening procedure for high-dimensional regression

被引:0
|
作者
Xue, Yushan [1 ]
Ren, Jie [2 ]
Yang, Bin [3 ]
机构
[1] Cent Univ Finance & Econ, Sch Stat & Math, Beijing, Peoples R China
[2] HollySys Grp Co Ltd, Beijing, Peoples R China
[3] Res Ctr Int Inspection & Quarantine Stand & Tech R, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
High-dimensional data; Correlated effects; Elastic-net; Iterative algorithm; EnMSP; NONCONCAVE PENALIZED LIKELIHOOD; VARIABLE SELECTION; LASSO; OPTIMALITY;
D O I
10.1007/s11222-024-10394-9
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
To improve the estimation efficiency of high-dimensional regression problems, penalized regularization is routinely used. However, accurately estimating the model remains challenging, particularly in the presence of correlated effects, wherein irrelevant covariates exhibit strong correlation with relevant ones. This situation, referred to as correlated data, poses additional complexities for model estimation. In this paper, we propose the elastic-net multi-step screening procedure (EnMSP), an iterative algorithm designed to recover sparse linear models in the context of correlated data. EnMSP uses a small repeated penalty strategy to identify truly relevant covariates in a few iterations. Specifically, in each iteration, EnMSP enhances the adaptive lasso method by adding a weighted l2\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$l_2$$\end{document} penalty, which improves the selection of relevant covariates. The method is shown to select the true model and achieve the l2\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$l_2$$\end{document}-norm error bound under certain conditions. The effectiveness of EnMSP is demonstrated through numerical comparisons and applications in financial data.
引用
收藏
页数:16
相关论文
共 50 条
  • [21] Interaction screening in high-dimensional multi-response regression via projected distance correlation
    Liu, Lili
    Lin, Lu
    Liu, Lei
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2024,
  • [22] Variable screening in multivariate linear regression with high-dimensional covariates
    Bizuayehu, Shiferaw B.
    Li, Lu
    Xu, Jin
    STATISTICAL THEORY AND RELATED FIELDS, 2022, 6 (03) : 241 - 253
  • [23] ONE-STEP REGULARIZED ESTIMATORFOR HIGH-DIMENSIONAL REGRESSION MODELS
    Wang, Yi
    Zeng, Donglin
    Wang, Yuanjia
    Tong, Xingwei
    STATISTICA SINICA, 2024, 34 (04) : 2089 - 2113
  • [24] Elastic net-based high dimensional data selection for regression
    Chamlal, Hasna
    Benzmane, Asmaa
    Ouaderhman, Tayeb
    EXPERT SYSTEMS WITH APPLICATIONS, 2024, 244
  • [25] Variable selection in high-dimensional regression: a nonparametric procedure for business failure prediction
    Amendola, Alessandra
    Giordano, Francesco
    Parrella, Maria Lucia
    Restaino, Marialuisa
    APPLIED STOCHASTIC MODELS IN BUSINESS AND INDUSTRY, 2017, 33 (04) : 355 - 368
  • [26] High Dimensional Logistic Regression Model using Adjusted Elastic Net Penalty
    Algamal, Zakariya Yahya
    Lee, Muhammad Hisyam
    PAKISTAN JOURNAL OF STATISTICS AND OPERATION RESEARCH, 2015, 11 (04) : 667 - 676
  • [27] An Orthogonal Matching Pursuit Variable Screening Algorithm for High-Dimensional Linear Regression Models
    Xie, Yanxi
    Li, Yuewen
    Shi, Victor
    Lu, Quan
    SCIENTIFIC PROGRAMMING, 2022, 2022
  • [28] Sparsity-promoting elastic net method with rotations for high-dimensional nonlinear inverse problem
    Wang, Yuepeng
    Ren, Lanlan
    Zhang, Zongyuan
    Lin, Guang
    Xu, Chao
    COMPUTER METHODS IN APPLIED MECHANICS AND ENGINEERING, 2019, 345 : 263 - 282
  • [29] INTERACTION PURSUIT IN HIGH-DIMENSIONAL MULTI-RESPONSE REGRESSION VIA DISTANCE CORRELATION
    Kong, Yinfei
    Li, Daoji
    Fan, Yingying
    Lv, Jinchi
    ANNALS OF STATISTICS, 2017, 45 (02): : 897 - 922
  • [30] DOLDA: a regularized supervised topic model for high-dimensional multi-class regression
    Magnusson, Mans
    Jonsson, Leif
    Villani, Mattias
    COMPUTATIONAL STATISTICS, 2020, 35 (01) : 175 - 201