High-dimensional linear mixed model selection by partial correlation

被引:3
|
作者
Alabiso, Audry [1 ]
Shang, Junfeng [2 ]
机构
[1] Progress Casualty Insurance, Private Passenger Auto, Mayfield, OH USA
[2] Bowling Green State Univ, Dept Math & Stat, Bowling Green, OH 43403 USA
关键词
Mixed model selection; partial correlation; linear mixed models; VARIABLE SELECTION;
D O I
10.1080/03610926.2022.2028838
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We wish to perform variable selection in high-dimensional linear mixed models where the number of the potential covariates is much larger than the sample size and where the random effects are utilized to describe correlated observations. We propose a variable selection procedure based on the Thresholded Partial Correlation (TPC) algorithm (Li, Liu, and Lou 2017) to conduct variable selection using the partial correlation between the covariates and the response variable conditional on the random effects, and this procedure is called the conditional Thresholded Partial Correlation, denoted by TPCc. This TPCc approach is able to select the fixed effects in high-dimensional data when the covariates are highly correlated. We investigate the performance of the proposed method (TPCc) in a variety of simulated high-dimensional data sets. The simulation results show that the TPCc outperforms the TPC in selecting the most appropriate model among the candidate pool in the mixed modeling setting. We also apply the proposed method to a real high-dimensional data set in the production of riboflavin.
引用
收藏
页码:6355 / 6380
页数:26
相关论文
共 50 条
  • [41] Bayesian adaptive lasso with variational Bayes for variable selection in high-dimensional generalized linear mixed models
    Dao Thanh Tung
    Minh-Ngoc Tran
    Tran Manh Cuong
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2019, 48 (02) : 530 - 543
  • [42] Classically high-dimensional correlation: simulation of high-dimensional entanglement
    Li, PengYun
    Zhang, Shihao
    Zhang, Xiangdong
    OPTICS EXPRESS, 2018, 26 (24): : 31413 - 31429
  • [43] A systematic review on model selection in high-dimensional regression
    Lee, Eun Ryung
    Cho, Jinwoo
    Yu, Kyusang
    JOURNAL OF THE KOREAN STATISTICAL SOCIETY, 2019, 48 (01) : 1 - 12
  • [44] Simultaneous Feature and Model Selection for High-Dimensional Data
    Perolini, Alessandro
    Guerif, Sebastien
    2011 23RD IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2011), 2011, : 47 - 50
  • [45] A systematic review on model selection in high-dimensional regression
    Eun Ryung Lee
    Jinwoo Cho
    Kyusang Yu
    Journal of the Korean Statistical Society, 2019, 48 : 1 - 12
  • [46] Automatic model selection for high-dimensional survival analysis
    Lang, M.
    Kotthaus, H.
    Marwedel, P.
    Weihs, C.
    Rahnenfuehrer, J.
    Bischl, B.
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2015, 85 (01) : 62 - 76
  • [47] High-dimensional Gaussian model selection on a Gaussian design
    Verzelen, Nicolas
    ANNALES DE L INSTITUT HENRI POINCARE-PROBABILITES ET STATISTIQUES, 2010, 46 (02): : 480 - 524
  • [48] Scalar correlation functions for model structure selection in high-dimensional time-series modelling
    Kathari, Sudhakar
    Tangirala, Arun K.
    ISA TRANSACTIONS, 2020, 100 : 275 - 288
  • [49] High-dimensional partial correlation coefficients: A survey study of estimation Methods
    Yang, Jingying
    Bai, Guishu
    Qin, Xu
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2025, 54 (06) : 1637 - 1660
  • [50] High-dimensional inference for linear model with correlated errors
    Yuan, Panxu
    Guo, Xiao
    METRIKA, 2022, 85 (01) : 21 - 52