Forward selection of explanatory variables

被引:1731
|
作者
Blanchet, F. Guillaume [1 ]
Legendre, Pierre [1 ]
Borcard, Daniel [1 ]
机构
[1] Univ Montreal, Dept Sci Biol, Montreal, PQ H3C 3J7, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
forward selection; Moran's eigenvector maps (MEM); non-orthogonal explanatory variables; orthogonal explanatory variables; principal coordinates of neighbor matrices (PCNM); simulation study; Type I error;
D O I
10.1890/07-0986.1
中图分类号
Q14 [生态学(生物生态学)];
学科分类号
071012 ; 0713 ;
摘要
This paper proposes a new way of using forward selection of explanatory variables in regression or canonical redundancy analysis. The classical forward selection method presents two problems: a highly inflated Type I error and an overestimation of the amount of explained variance. Correcting these problems will greatly improve the performance of this very useful method in ecological modeling. To prevent the first problem, we propose a two-step procedure. First, a global test using all explanatory variables is carried out. If, and only if, the global test is significant, one can proceed with forward selection. To prevent overestimation of the explained variance, the forward selection has to be carried out with two stopping criteria: (1) the usual alpha significance level and (2) the adjusted coefficient of multiple determination (R-a(2)) calculated using all explanatory variables. When forward selection identifies a variable that brings one or the other criterion over the fixed threshold, that variable is rejected, and the procedure is stopped. This improved method is validated by simulations involving univariate and multivariate response data. An ecological example is presented using data from the Bryce Canyon National Park, Utah, USA.
引用
收藏
页码:2623 / 2632
页数:10
相关论文
共 50 条