HIGH-DIMENSIONAL LINEAR REGRESSION WITH HARD THRESHOLDING REGULARIZATION: THEORY AND ALGORITHM

被引:0
|
作者
Kang, Lican [1 ,2 ]
Lai, Yanming [1 ]
Liu, Yanyan [1 ]
Luo, Yuan [1 ]
Zhang, Jing [3 ]
机构
[1] Wuhan Univ, Sch Math & Stat, Wuhan 430072, Hubei, Peoples R China
[2] Duke NUS Med Sch, Ctr Quantitat Med, Singapore 169857, Singapore
[3] Zhongnan Univ Econ & Law, Sch Math & Stat, Wuhan 430073, Hubei, Peoples R China
基金
中国国家自然科学基金;
关键词
  Generalized Newton method; hard thresholding regularization; high-dimensional; linear regression model; primal dual active set algorithm; NONCONVEX PENALIZED REGRESSION; ACTIVE SET ALGORITHM; VARIABLE SELECTION; ELASTIC-NET; SPARSITY; LASSO;
D O I
10.3934/jimo.2022034
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Variable selection and parameter estimation are fundamental and important problems in high dimensional data analysis. In this paper, we employ the hard thresholding regularization method [1] to handle these issues under the framework of high-dimensional and sparse linear regression model. Theoretically, we establish a sharp non-asymptotic estimation error for the global solution and further show that the support of the global solution coincides with the target support with high probability. Motivated by the KKT condition, we propose a primal dual active set algorithm (PDAS) to solve the minimization problem, and show that the proposed PDAS algorithm is essentially a generalized Newton method, which guarantees that the proposed PDAS algorithm will converge fast if a good initial value is provided. Furthermore, we propose a sequential version of the PDAS algorithm (SPDAS) with a warm-start strategy to choose the initial value adaptively. The most significant advantage of the proposed procedure is its fast calculation speed. Extensive numerical studies demonstrate that the proposed method performs well on variable selection and estimation accuracy. It has favorable exhibition over the existing methods in terms of computational speed. As an illustration, we apply the proposed method to a breast cancer gene expression data set.
引用
收藏
页码:2104 / 2122
页数:19
相关论文
共 50 条
  • [31] PENALIZED LINEAR REGRESSION WITH HIGH-DIMENSIONAL PAIRWISE SCREENING
    Gong, Siliang
    Zhang, Kai
    Liu, Yufeng
    STATISTICA SINICA, 2021, 31 (01) : 391 - 420
  • [32] Spline-Lasso in High-Dimensional Linear Regression
    Guo, Jianhua
    Hu, Jianchang
    Jing, Bing-Yi
    Zhang, Zhen
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2016, 111 (513) : 288 - 297
  • [33] Group Transfer Learning for High-Dimensional Linear Regression
    Chen, Chen
    Xu, Dawei
    Ding, Juan
    Zhang, Junjian
    Xiong, Wenjun
    STAT, 2025, 14 (01):
  • [34] TESTING HIGH-DIMENSIONAL REGRESSION COEFFICIENTS IN LINEAR MODELS
    Zhao, Alex
    Li, Changcheng
    Li, Runze
    Zhang, Zhe
    ANNALS OF STATISTICS, 2024, 52 (05): : 2034 - 2058
  • [35] Consistent group selection in high-dimensional linear regression
    Wei, Fengrong
    Huang, Jian
    BERNOULLI, 2010, 16 (04) : 1369 - 1384
  • [36] Robust Estimation of High-Dimensional Linear Regression With Changepoints
    Cui, Xiaolong
    Geng, Haoyu
    Wang, Zhaojun
    Zou, Changliang
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2024, 70 (10) : 7297 - 7319
  • [37] Robust linear regression for high-dimensional data: An overview
    Filzmoser, Peter
    Nordhausen, Klaus
    WILEY INTERDISCIPLINARY REVIEWS-COMPUTATIONAL STATISTICS, 2021, 13 (04)
  • [38] A global homogeneity test for high-dimensional linear regression
    Charbonnier, Camille
    Verzelen, Nicolas
    Villers, Fanny
    ELECTRONIC JOURNAL OF STATISTICS, 2015, 9 (01): : 318 - 382
  • [39] Tests for high-dimensional partially linear regression models
    Shi, Hongwei
    Yang, Weichao
    Sun, Bowen
    Guo, Xu
    STATISTICAL PAPERS, 2025, 66 (03)
  • [40] Empirical likelihood for high-dimensional linear regression models
    Hong Guo
    Changliang Zou
    Zhaojun Wang
    Bin Chen
    Metrika, 2014, 77 : 921 - 945