Variable selection for multiply-imputed data with penalized generalized estimating equations

被引:6
|
作者
Geronimi, J. [1 ,2 ]
Saporta, G. [2 ]
机构
[1] IRIS, 50 Rue Carnot, F-92284 Suresnes, France
[2] CNAM, Cedric, 292 Rue St Martin, F-75141 Paris, France
关键词
Generalized estimating equations; LASSO; Longitudinal data; Missing data; Multiple imputation; Variable selection; LONGITUDINAL DATA; MISSING DATA; IMPUTATION; REGRESSION; KNEE;
D O I
10.1016/j.csda.2017.01.001
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Generalized estimating equations (GEE) are useful tools for marginal regression analysis for longitudinal data. Having a high number of variables along with the presence of missing data presents complex issues when working in a longitudinal context. In variable selection for instance, penalized generalized estimating equations have not been systematically developed to integrate missing data. The MI-PGEE: multiple imputation penalized generalized estimating equations, an extension of the multiple imputation least absolute shrinkage and selection operator (MI-LASSO) is presented. MI-PGEE allows integration of missing data and within-subject correlation in variable selection procedures. Missing data are dealt with using multiple imputation, and variable selection is performed using a group LASSO penalty. Estimated coefficients for the same variable across multiply imputed datasets are considered as a group while applying penalized generalized estimating equations, leading to a unique model across multiply-imputed datasets. In order to select the tuning parameter, a new BIC-like criterion is proposed. In a simulation study, the advantage of using MI-PGEE compared to simple imputation PGEE is shown. The usefulness of the new method is illustrated by an application to a subgroup of the placebo arm of the strontium ranelate efficacy in knee osteoarthritis trial study. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:103 / 114
页数:12
相关论文
共 50 条
  • [21] Penalized Generalized Estimating Equations for High-Dimensional Longitudinal Data Analysis
    Wang, Lan
    Zhou, Jianhui
    Qu, Annie
    BIOMETRICS, 2012, 68 (02) : 353 - 360
  • [22] Variable selection for recurrent event data via nonconcave penalized estimating function
    Xingwei Tong
    Liang Zhu
    Jianguo Sun
    Lifetime Data Analysis, 2009, 15 : 197 - 215
  • [23] Variable selection for recurrent event data via nonconcave penalized estimating function
    Tong, Xingwei
    Zhu, Liang
    Sun, Jianguo
    LIFETIME DATA ANALYSIS, 2009, 15 (02) : 197 - 215
  • [24] Penalized Generalized Quasi-Likelihood Based Variable Selection for Longitudinal Data
    Nadarajah, Tharshanna
    Variyath, Asokan Mulayath
    Loredo-Osti, J. Concepcion
    ADVANCES AND CHALLENGES IN PARAMETRIC AND SEMI-PARAMETRIC ANALYSIS FOR CORRELATED DATA, 2016, 218 : 233 - 250
  • [25] Tobit analysis to investigate determinants of the level of assets in couples' pension accounts using multiply-imputed data and techniques
    Yuh, Y
    DeVaney, SA
    Hanna, S
    CONSUMER INTERESTS ANNUAL, VOL 43: 43RD ANNUAL CONFERENCE OF THE AMERICAN COUNCIL ON CONSUMER INTERESTS, 1997, : 169 - 169
  • [26] PENALIZED GENERALIZED EMPIRICAL LIKELIHOOD WITH A DIVERGING NUMBER OF GENERAL ESTIMATING EQUATIONS FOR CENSORED DATA
    Tang, Niansheng
    Yan, Xiaodong
    Zhao, Xingqiu
    ANNALS OF STATISTICS, 2020, 48 (01): : 607 - 627
  • [27] A comparison of model selection methods for prediction in the presence of multiply imputed data
    Le Thi Phuong Thao
    Geskus, Ronald
    BIOMETRICAL JOURNAL, 2019, 61 (02) : 343 - 356
  • [28] Data mining for longitudinal data under multicollinearity and time dependence using penalized generalized estimating equations
    Blommaert, A.
    Hens, N.
    Beutels, Ph
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2014, 71 : 667 - 680
  • [29] A simple pooling method for variable selection in multiply imputed datasets outperformed complex methods
    Panken, A. M.
    Heymans, M. W.
    BMC MEDICAL RESEARCH METHODOLOGY, 2022, 22 (01)
  • [30] PENALIZED ESTIMATING EQUATIONS FOR GENERALIZED LINEAR MODELS WITH MULTIPLE IMPUTATION
    Li, Yang
    Yang, Haoyu
    Yu, Haochen
    Huang, Hanwen
    Shen, Ye
    ANNALS OF APPLIED STATISTICS, 2023, 17 (03): : 2345 - 2363