regression;
principal components;
multiple models;
combining estimates;
D O I:
暂无
中图分类号:
学科分类号:
摘要:
The goal of combining the predictions of multiple learned models is to form an improved estimator. A combining strategy must be able to robustly handle the inherent correlation, or multicollinearity, of the learned models while identifying the unique contributions of each. A progression of existing approaches and their limitations with respect to these two issues are discussed. A new approach, PCR*, based on principal components regression is proposed to address these limitations. An evaluation of the new approach on a collection of domains reveals that (1) PCR* was the most robust combining method, (2) correlation could be handled without eliminating any of the learned models, and (3) the principal components of the learned models provided a continuum of “regularized” weights from which PCR* could choose.
机构:
TU Wien, Fac Math & Geoinformat, Inst Stat & Math Methods Econ, Appl Stat Res Unit, Vienna, Austria
Univ Gothenburg, Dept Econ, Gothenburg, Sweden
Univ Gothenburg, Ctr Finance, Gothenburg, Sweden
Karolinska Inst, Inst Environm Med, Div Biostat, Nobels Vag 13, S-17177 Stockholm, SwedenTU Wien, Fac Math & Geoinformat, Inst Stat & Math Methods Econ, Appl Stat Res Unit, Vienna, Austria