High-dimensional linear mixed model selection by partial correlation

被引:3
|
作者
Alabiso, Audry [1 ]
Shang, Junfeng [2 ]
机构
[1] Progress Casualty Insurance, Private Passenger Auto, Mayfield, OH USA
[2] Bowling Green State Univ, Dept Math & Stat, Bowling Green, OH 43403 USA
关键词
Mixed model selection; partial correlation; linear mixed models; VARIABLE SELECTION;
D O I
10.1080/03610926.2022.2028838
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We wish to perform variable selection in high-dimensional linear mixed models where the number of the potential covariates is much larger than the sample size and where the random effects are utilized to describe correlated observations. We propose a variable selection procedure based on the Thresholded Partial Correlation (TPC) algorithm (Li, Liu, and Lou 2017) to conduct variable selection using the partial correlation between the covariates and the response variable conditional on the random effects, and this procedure is called the conditional Thresholded Partial Correlation, denoted by TPCc. This TPCc approach is able to select the fixed effects in high-dimensional data when the covariates are highly correlated. We investigate the performance of the proposed method (TPCc) in a variety of simulated high-dimensional data sets. The simulation results show that the TPCc outperforms the TPC in selecting the most appropriate model among the candidate pool in the mixed modeling setting. We also apply the proposed method to a real high-dimensional data set in the production of riboflavin.
引用
收藏
页码:6355 / 6380
页数:26
相关论文
共 50 条