HIGH-DIMENSIONAL LINEAR REGRESSION WITH HARD THRESHOLDING REGULARIZATION: THEORY AND ALGORITHM

被引:0
|
作者
Kang, Lican [1 ,2 ]
Lai, Yanming [1 ]
Liu, Yanyan [1 ]
Luo, Yuan [1 ]
Zhang, Jing [3 ]
机构
[1] Wuhan Univ, Sch Math & Stat, Wuhan 430072, Hubei, Peoples R China
[2] Duke NUS Med Sch, Ctr Quantitat Med, Singapore 169857, Singapore
[3] Zhongnan Univ Econ & Law, Sch Math & Stat, Wuhan 430073, Hubei, Peoples R China
基金
中国国家自然科学基金;
关键词
  Generalized Newton method; hard thresholding regularization; high-dimensional; linear regression model; primal dual active set algorithm; NONCONVEX PENALIZED REGRESSION; ACTIVE SET ALGORITHM; VARIABLE SELECTION; ELASTIC-NET; SPARSITY; LASSO;
D O I
10.3934/jimo.2022034
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Variable selection and parameter estimation are fundamental and important problems in high dimensional data analysis. In this paper, we employ the hard thresholding regularization method [1] to handle these issues under the framework of high-dimensional and sparse linear regression model. Theoretically, we establish a sharp non-asymptotic estimation error for the global solution and further show that the support of the global solution coincides with the target support with high probability. Motivated by the KKT condition, we propose a primal dual active set algorithm (PDAS) to solve the minimization problem, and show that the proposed PDAS algorithm is essentially a generalized Newton method, which guarantees that the proposed PDAS algorithm will converge fast if a good initial value is provided. Furthermore, we propose a sequential version of the PDAS algorithm (SPDAS) with a warm-start strategy to choose the initial value adaptively. The most significant advantage of the proposed procedure is its fast calculation speed. Extensive numerical studies demonstrate that the proposed method performs well on variable selection and estimation accuracy. It has favorable exhibition over the existing methods in terms of computational speed. As an illustration, we apply the proposed method to a breast cancer gene expression data set.
引用
收藏
页码:2104 / 2122
页数:19
相关论文
共 50 条
  • [1] CANONICAL THRESHOLDING FOR NONSPARSE HIGH-DIMENSIONAL LINEAR REGRESSION
    Silin, Igor
    Fan, Jianqing
    ANNALS OF STATISTICS, 2022, 50 (01): : 460 - 486
  • [2] Hard-thresholding regularization method for high-dimensional heterogeneous models
    Wu, Yue
    Zheng, Zemin
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2025,
  • [3] High-dimensional linear regression via implicit regularization
    Zhao, Peng
    Yang, Yun
    He, Qiao-Chu
    BIOMETRIKA, 2022, 109 (04) : 1033 - 1046
  • [4] An algorithm for total variation regularization in high-dimensional linear problems
    Defrise, Michel
    Vanhove, Christian
    Liu, Xuan
    INVERSE PROBLEMS, 2011, 27 (06)
  • [5] High-Dimensional Multivariate Linear Regression with Weighted Nuclear Norm Regularization
    Suh, Namjoon
    Lin, Li-Hsiang
    Huo, Xiaoming
    JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 2024, 33 (04) : 1264 - 1275
  • [6] The Impact of Regularization on High-dimensional Logistic Regression
    Salehi, Fariborz
    Abbasi, Ehsan
    Hassibi, Babak
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 32 (NIPS 2019), 2019, 32
  • [7] Interpretation of high-dimensional linear regression: Effects of nullspace and regularization demonstrated on battery data
    Schaeffer, Joachim
    Lenz, Eric
    Chueh, William C.
    Bazant, Martin Z.
    Findeisen, Rolf
    Braatz, Richard D.
    COMPUTERS & CHEMICAL ENGINEERING, 2024, 180
  • [8] CONVEX REGULARIZATION FOR HIGH-DIMENSIONAL MULTIRESPONSE TENSOR REGRESSION
    Raskutti, Garvesh
    Yuan, Ming
    Chen, Han
    ANNALS OF STATISTICS, 2019, 47 (03): : 1554 - 1584
  • [9] An Improved Forward Regression Variable Selection Algorithm for High-Dimensional Linear Regression Models
    Xie, Yanxi
    Li, Yuewen
    Xia, Zhijie
    Yan, Ruixia
    IEEE ACCESS, 2020, 8 (08): : 129032 - 129042
  • [10] On Iterative Hard Thresholding Methods for High-dimensional M-Estimation
    Jain, Prateek
    Tewari, Ambuj
    Kar, Purushottam
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 27 (NIPS 2014), 2014, 27