Principled missing data methods for researchers

被引:1399
|
作者
Dong, Yiran [1 ]
Peng, Chao-Ying Joanne [1 ]
机构
[1] Indiana Univ, Bloomington, IN 47405 USA
来源
SPRINGERPLUS | 2013年 / 2卷
关键词
Missing data; Listwise deletion; MI; Gamma IML; EM; MAR; MCAR; MNAR; MULTIPLE IMPUTATION; MAXIMUM-LIKELIHOOD; CHAINED EQUATIONS; PERFORMANCE; SOFTWARE; UPDATE; VALUES; STATE;
D O I
10.1186/2193-1801-2-222
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The impact of missing data on quantitative research can be serious, leading to biased estimates of parameters, loss of information, decreased statistical power, increased standard errors, and weakened generalizability of findings. In this paper, we discussed and demonstrated three principled missing data methods: multiple imputation, full information maximum likelihood, and expectation-maximization algorithm, applied to a real-world data set. Results were contrasted with those obtained from the complete data set and from the listwise deletion method. The relative merits of each method are noted, along with common features they share. The paper concludes with an emphasis on the importance of statistical assumptions, and recommendations for researchers. Quality of research will be enhanced if (a) researchers explicitly acknowledge missing data problems and the conditions under which they occurred, (b) principled methods are employed to handle missing data, and (c) the appropriate treatment of missing data is incorporated into review standards of manuscripts submitted for publication.
引用
收藏
页码:1 / 17
页数:17
相关论文
共 50 条
  • [41] Reducing Missing Data in Surveys: An Overview of Methods
    Edith D. de Leeuw
    Quality and Quantity, 2001, 35 : 147 - 160
  • [42] Methods to impute missing genotypes for population data
    Zhaoxia Yu
    Daniel J. Schaid
    Human Genetics, 2007, 122 : 495 - 504
  • [43] Missing traffic data: comparison of imputation methods
    Li, Yuebiao
    Li, Zhiheng
    Li, Li
    IET INTELLIGENT TRANSPORT SYSTEMS, 2014, 8 (01) : 51 - 57
  • [44] Imputation methods for missing data for polygenic models
    Fridley, B
    Rabe, K
    de Andrade, M
    BMC GENETICS, 2003, 4 (Suppl 1)
  • [45] Methods for interpolating missing data in aerobiological databases
    Picornell, A.
    Oteros, J.
    Ruiz-Mata, R.
    Recio, M.
    Trigo, M. M.
    Martinez-Bracero, M.
    Lara, B.
    Serrano-Garcia, A.
    Galan, C.
    Garcia-Mozo, H.
    Alcazar, P.
    Perez-Badia, R.
    Cabezudo, B.
    Romero-Morte, J.
    Rojo, J.
    ENVIRONMENTAL RESEARCH, 2021, 200
  • [46] The Effects of Model Based Missing Data Methods on Guessing Parameter in Case of Ignorable Missing Data
    Kocak, Duygu
    PEGEM EGITIM VE OGRETIM DERGISI, 2018, 8 (01): : 155 - 171
  • [47] Comparison of Methods for Handling Missing Covariate Data
    Johansson, Asa M.
    Karlsson, Mats O.
    AAPS JOURNAL, 2013, 15 (04): : 1232 - 1241
  • [48] Missing data in cross-sectional networks - An extensive comparison of missing data treatment methods
    Krause, Robert W.
    Huisman, Mark
    Steglich, Christian
    Snijders, Tom
    SOCIAL NETWORKS, 2020, 62 : 99 - 112
  • [49] Comparison of missing data imputation methods using weather data
    Nida, Hafiza
    Kashif, Muhammad
    Khan, Muhammad Imran
    Ghamkhar, Madiha
    PAKISTAN JOURNAL OF AGRICULTURAL SCIENCES, 2023, 60 (02): : 327 - 336
  • [50] Data Collecting Methods and Experiences: A Guide for Social Researchers
    Damle, Jasmine
    INDIAN JOURNAL OF SOCIAL WORK, 2006, 67 (04): : 434 - 437