Fully conditional specification in multivariate imputation

被引:810
|
作者
Van Buuren, S.
Brand, J. P. L.
Groothuis-Oudshoorn, C. G. M.
Rubin, D. B.
机构
[1] TNO Qual Life, Dept Stat, NL-2301 CE Leiden, Netherlands
[2] Univ Utrecht, NL-3508 TC Utrecht, Netherlands
[3] Pennington Biomed Res Ctr, Baton Rouge, LA 70808 USA
[4] Roessingh Res & Dev, Enschede, Netherlands
[5] Harvard Univ, Cambridge, MA 02138 USA
关键词
multivariate missing data; multiple imputation; distributional compatibility; Gibbs sampling; simulation; proper imputation;
D O I
10.1080/10629360600810434
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
The use of the Gibbs sampler with fully conditionally specified models, where the distribution of each variable given the other variables is the starting point, has become a popular method to create imputations in incomplete multivariate data. The theoretical weakness of this approach is that the specified conditional densities can be incompatible, and therefore the stationary distribution to which the Gibbs sampler attempts to converge may not exist. This study investigates practical consequences of this problem by means of simulation. Missing data are created under four different missing data mechanisms. Attention is given to the statistical behavior under compatible and incompatible models. The results indicate that multiple imputation produces essentially unbiased estimates with appropriate coverage in the simple cases investigated, even for the incompatible models. Of particular interest is that these results were produced using only five Gibbs iterations starting from a simple draw from observed marginal distributions. It thus appears that, despite the theoretical weaknesses, the actual performance of conditional model specification for multivariate imputation can be quite good, and therefore deserves further study.
引用
收藏
页码:1049 / 1064
页数:16
相关论文
共 50 条
  • [31] Characterizations involving conditional specification
    Arnold, BC
    JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 1997, 63 (02) : 117 - 131
  • [32] CONDITIONAL MULTIVARIATE CALIBRATION
    FOX, DR
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 1989, 18 (06) : 2311 - 2330
  • [33] Calibrated imputation for multivariate categorical data
    de Waal, Ton
    Daalmans, Jacco
    ASTA-ADVANCES IN STATISTICAL ANALYSIS, 2024, 108 (03) : 545 - 576
  • [34] Multiple imputation in multivariate problems when the imputation and analysis models differ
    Schafer, JL
    STATISTICA NEERLANDICA, 2003, 57 (01) : 19 - 35
  • [35] Multivariate Time Series Imputation With Transformers
    Yildiz, A. Yarkin
    Koc, Emirhan
    Koc, Aykut
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 2517 - 2521
  • [36] Multivariate Time Series Imputation With Transformers
    Yildiz, A. Yarkin
    Koc, Emirhan
    Koc, Aykut
    IEEE SIGNAL PROCESSING LETTERS, 2022, 29 : 2517 - 2521
  • [37] Conditional generative learning for medical image imputation
    Ragheb Raad
    Deep Ray
    Bino Varghese
    Darryl Hwang
    Inderbir Gill
    Vinay Duddalwar
    Assad A. Oberai
    Scientific Reports, 14
  • [38] Conditional generative learning for medical image imputation
    Raad, Ragheb
    Ray, Deep
    Varghese, Bino
    Hwang, Darryl
    Gill, Inderbir
    Duddalwar, Vinay
    Oberai, Assad A.
    SCIENTIFIC REPORTS, 2024, 14 (01)
  • [39] Bounded Conditional Mean Imputation with an Approximate Posterior
    Remes, Ulpu
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 3006 - 3010
  • [40] INFORMATION SPECIFICATION IN ATTAINMENT OF CONDITIONAL CONCEPTS
    LAUGHLIN, PR
    JOURNAL OF EXPERIMENTAL PSYCHOLOGY, 1969, 79 (2P1): : 370 - &