共 50 条
A note on marginal correlation based screening
被引:2
|作者:
Wang, Run
[1
]
Dutta, Somak
[1
]
Roy, Vivekananda
[1
]
机构:
[1] Iowa State Univ, Dept Stat, 3415 Snedecor Hall, Ames, IA 50011 USA
基金:
美国农业部;
美国食品与农业研究所;
关键词:
correlation;
feature selection;
screening;
sure independence screening;
two-sample t-test;
VARIABLE SELECTION;
REGRESSION;
D O I:
10.1002/sam.11491
中图分类号:
TP18 [人工智能理论];
学科分类号:
081104 ;
0812 ;
0835 ;
1405 ;
摘要:
Independence screening methods such as the two-sample t-test and the marginal correlation based ranking are among the most widely used techniques for variable selection in ultrahigh-dimensional data sets. In this short note, simple examples are used to demonstrate potential problems with the independence screening methods in the presence of correlated predictors. Also, an example is considered where all important variables are independent among themselves and all but one important variables are independent with the unimportant variables. Furthermore, a real data example from a genome-wide association study is used to illustrate inferior performance of marginal correlation screening compared to another screening method.
引用
收藏
页码:88 / 92
页数:5
相关论文