Leveraging independence in high-dimensional mixed linear regression

被引:0
|
作者
Wang, Ning [1 ]
Deng, Kai [2 ]
Mai, Qing [2 ]
Zhang, Xin [2 ]
机构
[1] Beijing Normal Univ, Dept Stat, Zhuhai 519000, Peoples R China
[2] Florida State Univ, Dept Stat, 17 N Woodward Ave, Tallahassee, FL 32312 USA
基金
美国国家科学基金会;
关键词
EM algorithm; finite mixture model; group lasso; latent variable model; EM ALGORITHM; GAUSSIAN MIXTURES;
D O I
10.1093/biomtc/ujae103
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
We address the challenge of estimating regression coefficients and selecting relevant predictors in the context of mixed linear regression in high dimensions, where the number of predictors greatly exceeds the sample size. Recent advancements in this field have centered on incorporating sparsity-inducing penalties into the expectation-maximization (EM) algorithm, which seeks to maximize the conditional likelihood of the response given the predictors. However, existing procedures often treat predictors as fixed or overlook their inherent variability. In this paper, we leverage the independence between the predictor and the latent indicator variable of mixtures to facilitate efficient computation and also achieve synergistic variable selection across all mixture components. We establish the non-asymptotic convergence rate of the proposed fast group-penalized EM estimator to the true regression parameters. The effectiveness of our method is demonstrated through extensive simulations and an application to the Cancer Cell Line Encyclopedia dataset for the prediction of anticancer drug sensitivity.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] Consistent Risk Estimation in Moderately High-Dimensional Linear Regression
    Xu, Ji
    Maleki, Arian
    Rad, Kamiar Rahnama
    Hsu, Daniel
    IEEE TRANSACTIONS ON INFORMATION THEORY, 2021, 67 (09) : 5997 - 6030
  • [42] Empirical priors for prediction in sparse high-dimensional linear regression
    Martin, Ryan
    Tang, Yiqi
    Journal of Machine Learning Research, 2020, 21
  • [43] Fixed Effects Testing in High-Dimensional Linear Mixed Models
    Bradic, Jelena
    Claeskens, Gerda
    Gueuning, Thomas
    JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2020, 115 (532) : 1835 - 1850
  • [44] Variational Bayesian Inference in High-Dimensional Linear Mixed Models
    Yi, Jieyi
    Tang, Niansheng
    MATHEMATICS, 2022, 10 (03)
  • [45] High-dimensional linear mixed model selection by partial correlation
    Alabiso, Audry
    Shang, Junfeng
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2023, 52 (18) : 6355 - 6380
  • [46] Regression on High-dimensional Inputs
    Kuleshov, Alexander
    Bernstein, Alexander
    2016 IEEE 16TH INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW), 2016, : 732 - 739
  • [47] Scalable Algorithms for Learning High-Dimensional Linear Mixed Models
    Tan, Zilong
    Roche, Kimberly
    Zhou, Xiang
    Mukherjee, Sayan
    UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2018, : 259 - 268
  • [48] On inference in high-dimensional regression
    Battey, Heather S.
    Reid, Nancy
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2023, 85 (01) : 149 - 175
  • [49] An Improved Forward Regression Variable Selection Algorithm for High-Dimensional Linear Regression Models
    Xie, Yanxi
    Li, Yuewen
    Xia, Zhijie
    Yan, Ruixia
    IEEE ACCESS, 2020, 8 (08): : 129032 - 129042
  • [50] Reduced rank regression with matrix projections for high-dimensional multivariate linear regression model
    Guo, Wenxing
    Balakrishnan, Narayanaswamy
    Bian, Mengjie
    ELECTRONIC JOURNAL OF STATISTICS, 2021, 15 (02): : 4167 - 4191