generalized additive models for location;
scale and shape;
model-based boosting;
multivariate Gaussian distribution;
multivariate logit model;
multivariate Poisson distribution;
semiparametric regression;
VARIABLE SELECTION;
POISSON REGRESSION;
R PACKAGE;
BIVARIATE;
REGULARIZATION;
ALGORITHMS;
D O I:
10.1002/sim.9699
中图分类号:
Q [生物科学];
学科分类号:
07 ;
0710 ;
09 ;
摘要:
We develop a model-based boosting approach for multivariate distributional regression within the framework of generalized additive models for location, scale, and shape. Our approach enables the simultaneous modeling of all distribution parameters of an arbitrary parametric distribution of a multivariate response conditional on explanatory variables, while being applicable to potentially high-dimensional data. Moreover, the boosting algorithm incorporates data-driven variable selection, taking various different types of effects into account. As a special merit of our approach, it allows for modeling the association between multiple continuous or discrete outcomes through the relevant covariates. After a detailed simulation study investigating estimation and prediction performance, we demonstrate the full flexibility of our approach in three diverse biomedical applications. The first is based on high-dimensional genomic cohort data from the UK Biobank, considering a bivariate binary response (chronic ischemic heart disease and high cholesterol). Here, we are able to identify genetic variants that are informative for the association between cholesterol and heart disease. The second application considers the demand for health care in Australia with the number of consultations and the number of prescribed medications as a bivariate count response. The third application analyses two dimensions of childhood undernutrition in Nigeria as a bivariate response and we find that the correlation between the two undernutrition scores is considerably different depending on the child's age and the region the child lives in.
机构:
Georg August Univ Gottingen, Chair Stat, Humboldtalle 3, D-37073 Gottingen, GermanyCSIC, Estn Biol Donana, Conservat Biol Dept, C Americo Vespucio S-N, Seville 41092, Spain
Kneib, Thomas
Cadarso-Suarez, Carmen
论文数: 0引用数: 0
h-index: 0
机构:
Univ Santiago de Compostela, Sch Med, Dept Stat & Operat Res, Unit Biostat, C San Francisco S-N, Santiago De Compostela 15782, SpainCSIC, Estn Biol Donana, Conservat Biol Dept, C Americo Vespucio S-N, Seville 41092, Spain
Cadarso-Suarez, Carmen
Marey-Perez, Manuel
论文数: 0引用数: 0
h-index: 0
机构:
Univ Santiago de Compostela, Polytech Inst, PROEPLA Res Grp, Campus Univ S-N, Lugo 27002, SpainCSIC, Estn Biol Donana, Conservat Biol Dept, C Americo Vespucio S-N, Seville 41092, Spain
机构:
Univ Vigo, Dept Estat Invest Operat, Vigo 36310, Spain
CITMAga, Galician Ctr Math Res & Technol, Santiago De Compostela 15782, SpainUniv Vigo, Dept Estat Invest Operat, Vigo 36310, Spain