Can Nonrandomized Experiments Yield Accurate Answers? A Randomized Experiment Comparing Random and Nonrandom Assignments

被引:330
作者
Shadish, William R. [1 ]
Clark, M. H. [2 ]
Steiner, Peter M. [3 ]
机构
[1] Univ Calif, Sch Social Sci Human & Arts, Psychol Sci Sect, Merced, CA 95344 USA
[2] So Illinois Univ, Dept Psychol, Carbondale, IL 62901 USA
[3] Inst Adv Studies, A-1060 Vienna, Austria
关键词
Nonrandomized experiment; Propensity score; Randomized experiment; Selection bias;
D O I
10.1198/016214508000000733
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
A key justification for using nonrandomized experiments is that, with proper adjustment, their results can well approximate results from randomized experiments. This hypothesis has not been consistently supported by empirical studies; however, previous methods used to study this hypothesis have confounded assignment method with other study features. To avoid these confounding factors, this study randomly assigned participants to be in a randomized experiment or a nonrandomized experiment. In the randomized experiment, participants were randomly assigned to mathematics or vocabulary training; in the nonrandomized experiment, participants chose their training. The study held all other features of the experiment constant: it carefully measured pretest variables that might predict the condition that participants chose, and all participants were measured on vocabulary and mathematics outcomes. Ordinary linear regression reduced bias in the nonrandomized experiment by 84-94% using covariate-adjusted randomized results as the benchmark. Propensity score stratification, weighting, and covariance adjustment reduced bias by about 58-96%. depending on the outcome measure and adjustment method. Propensity score adjustment performed poorly when the scores were constructed from predictors of convenience (sex, age, marital status, and ethnicity) rather than from a broader set of predictors that might include these.
引用
收藏
页码:1334 / 1343
页数:10
相关论文
共 41 条
[1]   Comparison of a randomized and two quasi-experimental designs in a single outcome evaluation - Efficacy of a university-level remedial writing program [J].
Aiken, LS ;
West, SG ;
Schwalm, DE ;
Carroll, JL ;
Hsiung, S .
EVALUATION REVIEW, 1998, 22 (02) :207-244
[2]   VISUAL AND VERBAL MODES OF INFORMATION-PROCESSING AND THEIR RELATION TO THE EFFECTIVENESS OF COGNITIVELY-BASED ANXIETY-REDUCTION TECHNIQUES [J].
AKINS, T ;
HOLLANDSWORTH, JG ;
OCONNELL, SJ .
BEHAVIOUR RESEARCH AND THERAPY, 1982, 20 (03) :261-268
[3]  
[Anonymous], APPL BAYESIAN MODELI
[4]  
[Anonymous], CAN NONEXPERIMENTAL
[5]  
[Anonymous], 2002, Experimental and quasi-experimental designs for generalized causal inference
[6]  
[Anonymous], 1997, COLL UNIV
[7]  
[Anonymous], 2007, ALTERNATIVE BALANCE
[8]   SCREENING DEPRESSED PATIENTS IN FAMILY PRACTICE - RAPID TECHNIQUE [J].
BECK, AT ;
BECK, RW .
POSTGRADUATE MEDICINE, 1972, 52 (06) :81-&
[9]   Variable selection for propensity score models [J].
Brookhart, M. Alan ;
Schneeweiss, Sebastian ;
Rothman, Kenneth J. ;
Glynn, Robert J. ;
Avorn, Jerry ;
Sturmer, Til .
AMERICAN JOURNAL OF EPIDEMIOLOGY, 2006, 163 (12) :1149-1156
[10]   Estimating and using propensity scores with partially missing data [J].
D'Agostino, RB ;
Rubin, DB .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2000, 95 (451) :749-759