An information theoretical algorithm for analyzing supersaturated designs for a binary response

被引:0
|
作者
N. Balakrishnan
C. Koukouvinos
C. Parpoula
机构
[1] McMaster University,Department of Mathematics and Statistics
[2] National Technical University of Athens,Department of Mathematics
来源
Metrika | 2013年 / 76卷
关键词
Entropy; Error rates; Factor screening; Generalized linear models; Information gain; ROC; Symmetrical uncertainty;
D O I
暂无
中图分类号
学科分类号
摘要
A supersaturated design is a factorial design in which the number of effects to be estimated is greater than the number of runs. It is used in many experiments, for screening purpose, i.e., for studying a large number of factors and identifying the active ones. In this paper, we propose a method for screening out the important factors from a large set of potentially active variables through the symmetrical uncertainty measure combined with the information gain measure. We develop an information theoretical analysis method by using Shannon and some other entropy measures such as Rényi entropy, Havrda–Charvát entropy, and Tsallis entropy, on data and assuming generalized linear models for a Bernoulli response. This method is quite advantageous as it enables us to use supersaturated designs for analyzing data on generalized linear models. Empirical study demonstrates that this method performs well giving low Type I and Type II error rates for any entropy measure we use. Moreover, the proposed method is more efficient when compared to the existing ROC methodology of identifying the significant factors for a dichotomous response in terms of error rates.
引用
收藏
页码:1 / 18
页数:17
相关论文
共 50 条