Leveraging independence in high-dimensional mixed linear regression

被引:0
|
作者
Wang, Ning [1 ]
Deng, Kai [2 ]
Mai, Qing [2 ]
Zhang, Xin [2 ]
机构
[1] Beijing Normal Univ, Dept Stat, Zhuhai 519000, Peoples R China
[2] Florida State Univ, Dept Stat, 17 N Woodward Ave, Tallahassee, FL 32312 USA
基金
美国国家科学基金会;
关键词
EM algorithm; finite mixture model; group lasso; latent variable model; EM ALGORITHM; GAUSSIAN MIXTURES;
D O I
10.1093/biomtc/ujae103
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
We address the challenge of estimating regression coefficients and selecting relevant predictors in the context of mixed linear regression in high dimensions, where the number of predictors greatly exceeds the sample size. Recent advancements in this field have centered on incorporating sparsity-inducing penalties into the expectation-maximization (EM) algorithm, which seeks to maximize the conditional likelihood of the response given the predictors. However, existing procedures often treat predictors as fixed or overlook their inherent variability. In this paper, we leverage the independence between the predictor and the latent indicator variable of mixtures to facilitate efficient computation and also achieve synergistic variable selection across all mixture components. We establish the non-asymptotic convergence rate of the proposed fast group-penalized EM estimator to the true regression parameters. The effectiveness of our method is demonstrated through extensive simulations and an application to the Cancer Cell Line Encyclopedia dataset for the prediction of anticancer drug sensitivity.
引用
收藏
页数:15
相关论文
共 50 条