Multiple imputation in the presence of non-normal data

被引:52
|
作者
Lee, Katherine J. [1 ,2 ]
Carlin, John B. [1 ,2 ]
机构
[1] Murdoch Childrens Res Inst, Clin Epidemiol & Biostat Unit, Flemington Rd, Melbourne, Vic, Australia
[2] Univ Melbourne, Dept Paediat, Melbourne, Vic, Australia
基金
英国医学研究理事会;
关键词
multiple imputation; missing data; non-normal data; transformation; predictive mean matching;
D O I
10.1002/sim.7173
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Multiple imputation (MI) is becoming increasingly popular for handling missing data. Standard approaches for MI assume normality for continuous variables (conditionally on the other variables in the imputation model). However, it is unclear how to impute non-normally distributed continuous variables. Using simulation and a case study, we compared various transformations applied prior to imputation, including a novel non-parametric transformation, to imputation on the raw scale and using predictive mean matching (PMM) when imputing non-normal data. We generated data from a range of non-normal distributions, and set 50% to missing completely at random or missing at random. We then imputed missing values on the raw scale, following a zero-skewness log, Box-Cox or non-parametric transformation and using PMM with both type 1 and 2 matching. We compared inferences regarding the marginal mean of the incomplete variable and the association with a fully observed outcome. We also compared results from these approaches in the analysis of depression and anxiety symptoms in parents of very preterm compared with term-born infants. The results provide novel empirical evidence that the decision regarding how to impute a non-normal variable should be based on the nature of the relationship between the variables of interest. If the relationship is linear in the untransformed scale, transformation can introduce bias irrespective of the transformation used. However, if the relationship is non-linear, it may be important to transform the variable to accurately capture this relationship. A useful alternative is to impute the variable using PMM with type 1 matching. Copyright (C) 2016 John Wiley & Sons, Ltd.
引用
收藏
页码:606 / 617
页数:12
相关论文
共 50 条
  • [21] SOME RESULTS ON THE BEHAVIOR OF ALTERNATE COVARIANCE STRUCTURE ESTIMATION PROCEDURES IN THE PRESENCE OF NON-NORMAL DATA
    SHARMA, S
    DURVASULA, S
    DILLON, WR
    JOURNAL OF MARKETING RESEARCH, 1989, 26 (02) : 214 - 221
  • [22] Multiple imputation of unordered categorical missing data: A comparison of the multivariate normal imputation and multiple imputation by chained equations
    Karangwa, Innocent
    Kotze, Danelle
    Blignaut, Renette
    BRAZILIAN JOURNAL OF PROBABILITY AND STATISTICS, 2016, 30 (04) : 521 - 539
  • [23] Multiple imputation in the presence of high-dimensional data
    Zhao, Yize
    Long, Qi
    STATISTICAL METHODS IN MEDICAL RESEARCH, 2016, 25 (05) : 2021 - 2035
  • [24] Asymptotically optimal shrinkage estimates for non-normal data
    Withers, Christopher S.
    Nadarajah, Saralees
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2011, 81 (12) : 2021 - 2037
  • [25] Process capability for a non-normal quality characteristics data
    Ahmad, S.
    Abdollahian, M.
    Zeephongsekul, P.
    INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY, PROCEEDINGS, 2007, : 420 - +
  • [26] An algorithm for fitting Johnson transformations to non-normal data
    Polansky, AM
    Chou, YM
    Mason, RL
    JOURNAL OF QUALITY TECHNOLOGY, 1999, 31 (03) : 345 - 350
  • [27] Modelinb non-normal data using statistical software
    Minitab, Inc.
    Read R D Community, 2007, 8 (26-27):
  • [28] Two ways of modelling overdispersion in non-normal data
    Lee, Y
    Nelder, JA
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES C-APPLIED STATISTICS, 2000, 49 : 591 - 598
  • [29] Non-normal data: Is ANOVA still a valid option?
    Blanca, Maria J.
    Alarcon, Rafael
    Arnau, Jaume
    Bono, Roser
    Bendayan, Rebecca
    PSICOTHEMA, 2017, 29 (04) : 552 - 557
  • [30] Physical fitness indices for data with non-normal distribution
    Hung, Chang-Hung
    Hsu, Chang-Hsien
    JOURNAL OF INFORMATION & OPTIMIZATION SCIENCES, 2011, 32 (01): : 245 - 254