共 50 条
Variable selection for distribution-free models for longitudinal zero-inflated count responses
被引:15
|作者:
Chen, Tian
[1
]
Wu, Pan
[2
]
Tang, Wan
[3
]
Zhang, Hui
[4
]
Feng, Changyong
[5
]
Kowalski, Jeanne
[6
]
Tu, Xin M.
[5
,7
]
机构:
[1] Univ Toledo, Dept Math & Stat, Univ Hall 2010F, Toledo, OH 43606 USA
[2] Christiana Care Hlth Syst, Value Inst, John H Ammon Med Educ Ctr, Newark, DE 19718 USA
[3] Tulane Univ, Sch Publ Hlth & Trop Med, Dept Biostat & Bioinformat, New Orleans, LA 70112 USA
[4] St Jude Childrens Res Hosp, Dept Biostat, 332 N Lauderdale St, Memphis, TN 38105 USA
[5] Univ Rochester, Dept Biostat & Computat Biol, 601 Elmwood Ave, Rochester, NY 14642 USA
[6] Emory Univ, Dept Biostat & Bioinformat, Atlanta, GA 30322 USA
[7] Univ Rochester, Dept Psychiat, 601 Elmwood Ave, Rochester, NY 14642 USA
基金:
美国国家卫生研究院;
关键词:
functional response models;
one-step SCAD;
zero-inflated poisson;
zero-inflated negative binomial;
population mixtures;
NONCONCAVE PENALIZED LIKELIHOOD;
MIXED-EFFECTS MODELS;
ESTIMATING EQUATIONS;
MAXIMUM-LIKELIHOOD;
CAUSAL INFERENCE;
REGRESSION;
POISSON;
OUTCOMES;
MIXTURE;
D O I:
10.1002/sim.6892
中图分类号:
Q [生物科学];
学科分类号:
07 ;
0710 ;
09 ;
摘要:
Zero-inflated count outcomes arise quite often in research and practice. Parametric models such as the zero-inflated Poisson and zero-inflated negative binomial are widely used to model such responses. Like most parametric models, they are quite sensitive to departures from assumed distributions. Recently, new approaches have been proposed to provide distribution-free, or semi-parametric, alternatives. These methods extend the generalized estimating equations to provide robust inference for population mixtures defined by zero-inflated count outcomes. In this paper, we propose methods to extend smoothly clipped absolute deviation (SCAD)-based variable selection methods to these new models. Variable selection has been gaining popularity in modern clinical research studies, as determining differential treatment effects of interventions for different subgroups has become the norm, rather the exception, in the era of patent-centered outcome research. Such moderation analysis in general creates many explanatory variables in regression analysis, and the advantages of SCAD-based methods over their traditional counterparts render them a great choice for addressing this important and timely issues in clinical research. We illustrate the proposed approach with both simulated and real study data. Copyright (c) 2016 John Wiley & Sons, Ltd.
引用
收藏
页码:2770 / 2785
页数:16
相关论文