Variable selection for distribution-free models for longitudinal zero-inflated count responses

被引:15
|
作者
Chen, Tian [1 ]
Wu, Pan [2 ]
Tang, Wan [3 ]
Zhang, Hui [4 ]
Feng, Changyong [5 ]
Kowalski, Jeanne [6 ]
Tu, Xin M. [5 ,7 ]
机构
[1] Univ Toledo, Dept Math & Stat, Univ Hall 2010F, Toledo, OH 43606 USA
[2] Christiana Care Hlth Syst, Value Inst, John H Ammon Med Educ Ctr, Newark, DE 19718 USA
[3] Tulane Univ, Sch Publ Hlth & Trop Med, Dept Biostat & Bioinformat, New Orleans, LA 70112 USA
[4] St Jude Childrens Res Hosp, Dept Biostat, 332 N Lauderdale St, Memphis, TN 38105 USA
[5] Univ Rochester, Dept Biostat & Computat Biol, 601 Elmwood Ave, Rochester, NY 14642 USA
[6] Emory Univ, Dept Biostat & Bioinformat, Atlanta, GA 30322 USA
[7] Univ Rochester, Dept Psychiat, 601 Elmwood Ave, Rochester, NY 14642 USA
基金
美国国家卫生研究院;
关键词
functional response models; one-step SCAD; zero-inflated poisson; zero-inflated negative binomial; population mixtures; NONCONCAVE PENALIZED LIKELIHOOD; MIXED-EFFECTS MODELS; ESTIMATING EQUATIONS; MAXIMUM-LIKELIHOOD; CAUSAL INFERENCE; REGRESSION; POISSON; OUTCOMES; MIXTURE;
D O I
10.1002/sim.6892
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Zero-inflated count outcomes arise quite often in research and practice. Parametric models such as the zero-inflated Poisson and zero-inflated negative binomial are widely used to model such responses. Like most parametric models, they are quite sensitive to departures from assumed distributions. Recently, new approaches have been proposed to provide distribution-free, or semi-parametric, alternatives. These methods extend the generalized estimating equations to provide robust inference for population mixtures defined by zero-inflated count outcomes. In this paper, we propose methods to extend smoothly clipped absolute deviation (SCAD)-based variable selection methods to these new models. Variable selection has been gaining popularity in modern clinical research studies, as determining differential treatment effects of interventions for different subgroups has become the norm, rather the exception, in the era of patent-centered outcome research. Such moderation analysis in general creates many explanatory variables in regression analysis, and the advantages of SCAD-based methods over their traditional counterparts render them a great choice for addressing this important and timely issues in clinical research. We illustrate the proposed approach with both simulated and real study data. Copyright (c) 2016 John Wiley & Sons, Ltd.
引用
收藏
页码:2770 / 2785
页数:16
相关论文
共 50 条
  • [21] On Baseline Conditions for Zero-Inflated Longitudinal Count Data
    Maruotti, Antonello
    Raponi, Valentina
    COMMUNICATIONS IN STATISTICS-SIMULATION AND COMPUTATION, 2014, 43 (04) : 743 - 760
  • [22] Joint Models for Longitudinal Zero-Inflated Overdispersed Binomial and Normal Responses
    Seyede Sedighe Azimi
    Ehsan Bahrami Samani
    Sankhya B, 2023, 85 : 251 - 271
  • [23] Joint Models for Longitudinal Zero-Inflated Overdispersed Binomial and Normal Responses
    Azimi, Seyede Sedighe
    Samani, Ehsan Bahrami
    SANKHYA-SERIES B-APPLIED AND INTERDISCIPLINARY STATISTICS, 2023, 85 (02): : 251 - 271
  • [24] Zero-inflated models with application to spatial count data
    Agarwal, DK
    Gelfand, AE
    Citron-Pousty, S
    ENVIRONMENTAL AND ECOLOGICAL STATISTICS, 2002, 9 (04) : 341 - 355
  • [25] Forecasting Civil Conflict with Zero-Inflated Count Models
    Bagozzi, Benjamin E.
    CIVIL WARS, 2015, 17 (01) : 1 - 24
  • [26] Marginal zero-inflated regression models for count data
    Martin, Jacob
    Hall, Daniel B.
    JOURNAL OF APPLIED STATISTICS, 2017, 44 (10) : 1807 - 1826
  • [27] Marginal Mean Models for Zero-Inflated Count Data
    Todem, David
    Kim, KyungMann
    Hsu, Wei-Wen
    BIOMETRICS, 2016, 72 (03) : 986 - 994
  • [28] Zero-inflated Bell regression models for count data
    Lemonte, Artur J.
    Moreno-Arenas, German
    Castellares, Fredy
    JOURNAL OF APPLIED STATISTICS, 2020, 47 (02) : 265 - 286
  • [29] Zero-inflated models with application to spatial count data
    Deepak K. Agarwal
    Alan E. Gelfand
    Steven Citron-Pousty
    Environmental and Ecological Statistics, 2002, 9 : 341 - 355
  • [30] Statistical models for longitudinal zero-inflated count data with applications to the substance abuse field
    Buu, Anne
    Li, Runze
    Tan, Xianming
    Zucker, Robert A.
    STATISTICS IN MEDICINE, 2012, 31 (29) : 4074 - 4086