Enmsp: an elastic-net multi-step screening procedure for high-dimensional regression

被引:0
|
作者
Xue, Yushan [1 ]
Ren, Jie [2 ]
Yang, Bin [3 ]
机构
[1] Cent Univ Finance & Econ, Sch Stat & Math, Beijing, Peoples R China
[2] HollySys Grp Co Ltd, Beijing, Peoples R China
[3] Res Ctr Int Inspection & Quarantine Stand & Tech R, Beijing, Peoples R China
基金
中国国家自然科学基金;
关键词
High-dimensional data; Correlated effects; Elastic-net; Iterative algorithm; EnMSP; NONCONCAVE PENALIZED LIKELIHOOD; VARIABLE SELECTION; LASSO; OPTIMALITY;
D O I
10.1007/s11222-024-10394-9
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
To improve the estimation efficiency of high-dimensional regression problems, penalized regularization is routinely used. However, accurately estimating the model remains challenging, particularly in the presence of correlated effects, wherein irrelevant covariates exhibit strong correlation with relevant ones. This situation, referred to as correlated data, poses additional complexities for model estimation. In this paper, we propose the elastic-net multi-step screening procedure (EnMSP), an iterative algorithm designed to recover sparse linear models in the context of correlated data. EnMSP uses a small repeated penalty strategy to identify truly relevant covariates in a few iterations. Specifically, in each iteration, EnMSP enhances the adaptive lasso method by adding a weighted l2\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$l_2$$\end{document} penalty, which improves the selection of relevant covariates. The method is shown to select the true model and achieve the l2\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$l_2$$\end{document}-norm error bound under certain conditions. The effectiveness of EnMSP is demonstrated through numerical comparisons and applications in financial data.
引用
收藏
页数:16
相关论文
共 50 条
  • [31] DOLDA: a regularized supervised topic model for high-dimensional multi-class regression
    Måns Magnusson
    Leif Jonsson
    Mattias Villani
    Computational Statistics, 2020, 35 : 175 - 201
  • [32] A multi-step procedure for enriching limited two-dimensional acoustic far-field pattern measurements
    Barucq, Helene
    Bekkey, Chokri
    Djellouli, Rabia
    JOURNAL OF INVERSE AND ILL-POSED PROBLEMS, 2010, 18 (02): : 189 - 216
  • [33] Fast Cross-validation for Multi-penalty High-dimensional Ridge Regression
    van de Wiel, Mark A.
    van Nee, Mirrelijn M.
    Rauschenberger, Armin
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2021, 30 (04) : 835 - 847
  • [34] Maximal cliques-based hybrid high-dimensional feature selection with interaction screening for regression
    Chamlal, Hasna
    Benzmane, Asmaa
    Ouaderhman, Tayeb
    NEUROCOMPUTING, 2024, 607
  • [35] Development of a multi-step screening procedure for redox active molecules in organic radical polymer anodes and as redox flow anolytes
    Achazi, Andreas J.
    Fataj, Xhesilda
    Rohland, Philip
    Hager, Martin D.
    Schubert, Ulrich S.
    Mollenhauer, Doreen
    JOURNAL OF COMPUTATIONAL CHEMISTRY, 2024, 45 (14) : 1112 - 1129
  • [36] Handling High-Dimensional Regression Problems by Means of an Efficient Multi-Objective Evolutionary Algorithm
    Jose Gacto, Maria
    Alcala, Rafael
    Herrera, Francisco
    2009 9TH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, 2009, : 109 - +
  • [37] High-Dimensional Multi-Task Learning using Multivariate Regression and Generalized Fiducial Inference
    Wei, Zhenyu
    Lee, Thomas C. M.
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2023, 32 (01) : 226 - 240
  • [38] A sequential stepwise screening procedure for sparse recovery in high-dimensional multiresponse models with complex group structures
    Liang, Weixiong
    Yang, Yuehan
    STATISTICS AND ITS INTERFACE, 2025, 18 (03) : 349 - 359
  • [39] Regularized logistic regression with adjusted adaptive elastic net for gene selection in high dimensional cancer classification
    Algamal, Zakariya Yahya
    Lee, Muhammad Hisyam
    COMPUTERS IN BIOLOGY AND MEDICINE, 2015, 67 : 136 - 145
  • [40] CLASSIFICATION OF HIGH-DIMENSIONAL MICROARRAY DATA WITH A TWO-STEP PROCEDURE VIA A WILCOXON CRITERION AND MULTILAYER PERCEPTRON
    Nikulin, Vladimir
    Huang, Tian-Hsiang
    Mclachlan, Geoffrey
    INTERNATIONAL JOURNAL OF COMPUTATIONAL INTELLIGENCE AND APPLICATIONS, 2011, 10 (01) : 1 - 14