Leveraging independence in high-dimensional mixed linear regression

被引：0

作者：

Wang, Ning ^{[1
]}

Deng, Kai ^{[2
]}

Mai, Qing ^{[2
]}

Zhang, Xin ^{[2
]}

机构：

[1] Beijing Normal Univ, Dept Stat, Zhuhai 519000, Peoples R China

[2] Florida State Univ, Dept Stat, 17 N Woodward Ave, Tallahassee, FL 32312 USA

来源：

BIOMETRICS | 2024年 / 80卷 / 03期

基金：

美国国家科学基金会;

关键词：

EM algorithm; finite mixture model; group lasso; latent variable model; EM ALGORITHM; GAUSSIAN MIXTURES;

D O I：

10.1093/biomtc/ujae103

中图分类号：

Q [生物科学];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

We address the challenge of estimating regression coefficients and selecting relevant predictors in the context of mixed linear regression in high dimensions, where the number of predictors greatly exceeds the sample size. Recent advancements in this field have centered on incorporating sparsity-inducing penalties into the expectation-maximization (EM) algorithm, which seeks to maximize the conditional likelihood of the response given the predictors. However, existing procedures often treat predictors as fixed or overlook their inherent variability. In this paper, we leverage the independence between the predictor and the latent indicator variable of mixtures to facilitate efficient computation and also achieve synergistic variable selection across all mixture components. We establish the non-asymptotic convergence rate of the proposed fast group-penalized EM estimator to the true regression parameters. The effectiveness of our method is demonstrated through extensive simulations and an application to the Cancer Cell Line Encyclopedia dataset for the prediction of anticancer drug sensitivity.

引用

页数：15

共 50 条

[41] Consistent Risk Estimation in Moderately High-Dimensional Linear Regression
Xu, Ji
Maleki, Arian
Rad, Kamiar Rahnama
Hsu, Daniel
IEEE TRANSACTIONS ON INFORMATION THEORY, 2021, 67 (09) : 5997 - 6030
[42] Empirical priors for prediction in sparse high-dimensional linear regression
Martin, Ryan
Tang, Yiqi
Journal of Machine Learning Research, 2020, 21
[43] Fixed Effects Testing in High-Dimensional Linear Mixed Models
Bradic, Jelena
Claeskens, Gerda
Gueuning, Thomas
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2020, 115 (532) : 1835 - 1850
[44] Variational Bayesian Inference in High-Dimensional Linear Mixed Models
Yi, Jieyi
Tang, Niansheng
MATHEMATICS, 2022, 10 (03)
[45] High-dimensional linear mixed model selection by partial correlation
Alabiso, Audry
Shang, Junfeng
COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2023, 52 (18) : 6355 - 6380
[46] Regression on High-dimensional Inputs
Kuleshov, Alexander
Bernstein, Alexander
2016 IEEE 16TH INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW), 2016, : 732 - 739
[47] Scalable Algorithms for Learning High-Dimensional Linear Mixed Models
Tan, Zilong
Roche, Kimberly
Zhou, Xiang
Mukherjee, Sayan
UNCERTAINTY IN ARTIFICIAL INTELLIGENCE, 2018, : 259 - 268
[48] On inference in high-dimensional regression
Battey, Heather S.
Reid, Nancy
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2023, 85 (01) : 149 - 175
[49] An Improved Forward Regression Variable Selection Algorithm for High-Dimensional Linear Regression Models
Xie, Yanxi
Li, Yuewen
Xia, Zhijie
Yan, Ruixia
IEEE ACCESS, 2020, 8 (08): : 129032 - 129042
[50] Reduced rank regression with matrix projections for high-dimensional multivariate linear regression model
Guo, Wenxing
Balakrishnan, Narayanaswamy
Bian, Mengjie
ELECTRONIC JOURNAL OF STATISTICS, 2021, 15 (02): : 4167 - 4191

← 1 2 3 4 5 →