Principled missing data methods for researchers

被引:1399
|
作者
Dong, Yiran [1 ]
Peng, Chao-Ying Joanne [1 ]
机构
[1] Indiana Univ, Bloomington, IN 47405 USA
来源
SPRINGERPLUS | 2013年 / 2卷
关键词
Missing data; Listwise deletion; MI; Gamma IML; EM; MAR; MCAR; MNAR; MULTIPLE IMPUTATION; MAXIMUM-LIKELIHOOD; CHAINED EQUATIONS; PERFORMANCE; SOFTWARE; UPDATE; VALUES; STATE;
D O I
10.1186/2193-1801-2-222
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The impact of missing data on quantitative research can be serious, leading to biased estimates of parameters, loss of information, decreased statistical power, increased standard errors, and weakened generalizability of findings. In this paper, we discussed and demonstrated three principled missing data methods: multiple imputation, full information maximum likelihood, and expectation-maximization algorithm, applied to a real-world data set. Results were contrasted with those obtained from the complete data set and from the listwise deletion method. The relative merits of each method are noted, along with common features they share. The paper concludes with an emphasis on the importance of statistical assumptions, and recommendations for researchers. Quality of research will be enhanced if (a) researchers explicitly acknowledge missing data problems and the conditions under which they occurred, (b) principled methods are employed to handle missing data, and (c) the appropriate treatment of missing data is incorporated into review standards of manuscripts submitted for publication.
引用
收藏
页码:1 / 17
页数:17
相关论文
共 50 条
  • [31] Methods for Handling Missing Secondary Respondent Data
    Young, Rebekah
    Johnson, David
    JOURNAL OF MARRIAGE AND FAMILY, 2013, 75 (01) : 221 - 234
  • [32] Comparison of Methods for Handling Missing Covariate Data
    Åsa M. Johansson
    Mats O. Karlsson
    The AAPS Journal, 2013, 15 : 1232 - 1241
  • [33] Handling Missing Data Problems with Sampling Methods
    Houari, Rima
    Bounceur, Ahcene
    Tari, A-Kamel
    Kechadi, M-Tahar
    2014 INTERNATIONAL CONFERENCE ON ADVANCED NETWORKING DISTRIBUTED SYSTEMS AND APPLICATIONS (INDS 2014), 2014, : 99 - 104
  • [34] Effect of missing data on multitask prediction methods
    de Leon, Antonio de la Vega
    Chen, Beining
    Gillet, Valerie J.
    JOURNAL OF CHEMINFORMATICS, 2018, 10
  • [35] Missing data treatment methods and NBI model
    Liu, Peng
    Lei, Lei
    ISDA 2006: SIXTH INTERNATIONAL CONFERENCE ON INTELLIGENT SYSTEMS DESIGN AND APPLICATIONS, VOL 1, 2006, : 633 - 638
  • [36] Imputation of missing longitudinal data: a comparison of methods
    Engels, JM
    Diehr, P
    JOURNAL OF CLINICAL EPIDEMIOLOGY, 2003, 56 (10) : 968 - 976
  • [37] Taxonomy of Missing Data along with their handling Methods
    Tripathi, Ashok Kumar
    Rathee, Geetanjali
    Saini, Hemraj
    2019 FIFTH INTERNATIONAL CONFERENCE ON IMAGE INFORMATION PROCESSING (ICIIP 2019), 2019, : 463 - 468
  • [38] Effect of missing data on multitask prediction methods
    Antonio de la Vega de León
    Beining Chen
    Valerie J. Gillet
    Journal of Cheminformatics, 10
  • [39] Imputation methods for missing data for polygenic models
    Brooke Fridley
    Kari Rabe
    Mariza de Andrade
    BMC Genetics, 4
  • [40] Analyzing Coarsened and Missing Data by Imputation Methods
    van Der Burg, Lars L. J.
    Bohringer, Stefan
    Bartlett, Jonathan W.
    Bosse, Tjalling
    Horeweg, Nanda
    de Wreede, Liesbeth C.
    Putter, Hein
    STATISTICS IN MEDICINE, 2025, 44 (06)