Leveraging independence in high-dimensional mixed linear regression

被引：0

作者：

Wang, Ning ^{[1
]}

Deng, Kai ^{[2
]}

Mai, Qing ^{[2
]}

Zhang, Xin ^{[2
]}

机构：

[1] Beijing Normal Univ, Dept Stat, Zhuhai 519000, Peoples R China

[2] Florida State Univ, Dept Stat, 17 N Woodward Ave, Tallahassee, FL 32312 USA

来源：

BIOMETRICS | 2024年 / 80卷 / 03期

基金：

美国国家科学基金会;

关键词：

EM algorithm; finite mixture model; group lasso; latent variable model; EM ALGORITHM; GAUSSIAN MIXTURES;

D O I：

10.1093/biomtc/ujae103

中图分类号：

Q [生物科学];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

We address the challenge of estimating regression coefficients and selecting relevant predictors in the context of mixed linear regression in high dimensions, where the number of predictors greatly exceeds the sample size. Recent advancements in this field have centered on incorporating sparsity-inducing penalties into the expectation-maximization (EM) algorithm, which seeks to maximize the conditional likelihood of the response given the predictors. However, existing procedures often treat predictors as fixed or overlook their inherent variability. In this paper, we leverage the independence between the predictor and the latent indicator variable of mixtures to facilitate efficient computation and also achieve synergistic variable selection across all mixture components. We establish the non-asymptotic convergence rate of the proposed fast group-penalized EM estimator to the true regression parameters. The effectiveness of our method is demonstrated through extensive simulations and an application to the Cancer Cell Line Encyclopedia dataset for the prediction of anticancer drug sensitivity.

引用

页数：15

共 50 条

[31] MODEL SELECTION FOR HIGH-DIMENSIONAL LINEAR REGRESSION WITH DEPENDENT OBSERVATIONS
Ing, Ching-Kang
ANNALS OF STATISTICS, 2020, 48 (04): : 1959 - 1980
[32] HIGH-DIMENSIONAL LINEAR REGRESSION FOR DEPENDENT DATA WITH APPLICATIONS TO NOWCASTING
Han, Yuefeng
Tsay, Ruey S.
STATISTICA SINICA, 2020, 30 (04) : 1797 - 1827
[33] Variational Bayes for High-Dimensional Linear Regression With Sparse Priors
Ray, Kolyan
Szabo, Botond
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2022, 117 (539) : 1270 - 1281
[34] Shrinkage Ridge Regression Estimators in High-Dimensional Linear Models
Yuzbasi, Bahadir
Ahmed, S. Ejaz
PROCEEDINGS OF THE NINTH INTERNATIONAL CONFERENCE ON MANAGEMENT SCIENCE AND ENGINEERING MANAGEMENT, 2015, 362 : 793 - 807
[35] Sparsity Oriented Importance Learning for High-Dimensional Linear Regression
Ye, Chenglong
Yang, Yi
Yang, Yuhong
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2018, 113 (524) : 1797 - 1812
[36] Distributed Continual Learning With CoCoA in High-Dimensional Linear Regression
Hellkvist, Martin
Ozcelikkale, Ayca
Ahlen, Anders
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 2024, 72 : 1015 - 1031
[37] The sparsity and bias of the lasso selection in high-dimensional linear regression
Zhang, Cun-Hui
Huang, Jian
ANNALS OF STATISTICS, 2008, 36 (04): : 1567 - 1594
[38] Empirical Priors for Prediction in Sparse High-dimensional Linear Regression
Martin, Ryan
Tang, Yiqi
JOURNAL OF MACHINE LEARNING RESEARCH, 2020, 21
[39] The likelihood ratio test for high-dimensional linear regression model
Xie, Junshan
Xiao, Nannan
COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2017, 46 (17) : 8479 - 8492
[40] A comparison study of Bayesian high-dimensional linear regression models
Shin, Ju-Won
Lee, Kyoungjae
KOREAN JOURNAL OF APPLIED STATISTICS, 2021, 34 (03) : 491 - 505

← 1 2 3 4 5 →