A Comparison of Full Information Maximum Likelihood and Machine Learning Missing Data Analytical Methods in Growth Curve Modeling

被引:2
|
作者
Tang, Dandan [1 ]
Tong, Xin [1 ]
机构
[1] Univ Virginia, Dept Psychol, Gilmer Hall, Charlottesville, VA 22903 USA
来源
关键词
PERFORMANCE; IMPUTATION; CART;
D O I
10.1007/978-3-031-55548-0_10
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Missing data are inevitable in longitudinal studies. Traditional methods, such as the full information maximum likelihood (FIML), are commonly used to handle ignorable missing data. However, they may lead to biased model estimation due to missing not at random data that often appear in longitudinal studies. Recently, machine learning methods, such as random forest (RF) and K-nearest neighbors (KNN) imputation methods, have been proposed to cope with missing values. Although machine learning imputation methods have been gaining popularity, few studies have investigated the tenability and utility of these methods in longitudinal research. Through Monte Carlo simulations, this chapter evaluates and compares the performance of traditional and machine learning approaches (FIML, RF, and KNN) in growth curve modeling. The effects of sample size, the rate of missingness, and missing data mechanism on model estimation are investigated. Results indicate that FIML is a better choice than the two machine learning imputation methods in terms of model estimation accuracy and efficiency.
引用
收藏
页码:99 / 107
页数:9
相关论文
共 50 条
  • [41] Gravity Predictions in Data-Missing Areas Using Machine Learning Methods
    Liu, Yubin
    Zhang, Yi
    Pang, Qipei
    Liu, Sulan
    Li, Shaobo
    Shi, Xuguo
    Bian, Shaofeng
    Wu, Yunlong
    REMOTE SENSING, 2024, 16 (22)
  • [42] Longitudinal modeling with randomly and systematically missing data: A simulation of ad hoc, maximum likelihood, and multiple imputation techniques
    Newman, DA
    ORGANIZATIONAL RESEARCH METHODS, 2003, 6 (03) : 328 - 362
  • [43] Analysis of Interactions and Nonlinear Effects with Missing Data: A Factored Regression Modeling Approach Using Maximum Likelihood Estimation
    Luedtke, Oliver
    Robitzsch, Alexander
    West, Stephen G.
    MULTIVARIATE BEHAVIORAL RESEARCH, 2020, 55 (03) : 361 - 381
  • [44] A machine learning approach to Cepheid variable star classification using data alignment and maximum likelihood
    Vilalta, Ricardo
    Gupta, Kinjal Dhar
    Macri, Lucas
    ASTRONOMY AND COMPUTING, 2013, 2 : 46 - 53
  • [45] Comparison of statistical and machine learning methods in modelling of data with multicollinearity
    Garg, Akhil
    Tai, Kang
    INTERNATIONAL JOURNAL OF MODELLING IDENTIFICATION AND CONTROL, 2013, 18 (04) : 295 - 312
  • [46] Robust Two-Stage Approach Outperforms Robust Full Information Maximum Likelihood With Incomplete Nonnormal Data
    Savalei, Victoria
    Falk, Carl F.
    STRUCTURAL EQUATION MODELING-A MULTIDISCIPLINARY JOURNAL, 2014, 21 (02) : 280 - 302
  • [47] FULL-INFORMATION MAXIMUM-LIKELIHOOD-ESTIMATION OF BRAND POSITIONING MAPS USING SUPERMARKET SCANNING DATA
    WAARTS, E
    CARREE, M
    WIERENGA, B
    JOURNAL OF MARKETING RESEARCH, 1991, 28 (04) : 483 - 490
  • [48] A Comparison of GPU Execution Time Prediction using Machine Learning and Analytical Modeling
    Amaris, Marcos
    de Camargo, Raphael Y.
    Dyab, Mohamed
    Goldman, Alfredo
    Trystram, Denis
    15TH IEEE INTERNATIONAL SYMPOSIUM ON NETWORK COMPUTING AND APPLICATIONS (IEEE NCA 2016), 2016, : 326 - 333
  • [49] Missing data analysis using machine learning methods to predict the performance of technical students
    Melo Junior, Gilberto de
    Alcala, Symone G. Soares
    Furriel, Geovanne Pereira
    Vieira, Silvio L.
    REVISTA BRASILEIRA DE COMPUTACAO APLICADA, 2020, 12 (02): : 134 - 143
  • [50] Data modeling in machine learning based on information-theoretic measures
    Liu, YH
    Li, AJ
    Luo, SW
    2002 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-4, PROCEEDINGS, 2002, : 1219 - 1222