Missing data incremental imputation through tree based methods

被引:0
|
作者
Conversano, C [1 ]
Cappelli, C [1 ]
机构
[1] Univ Naples Federico II, Dept Math & Stat, Naples, Italy
关键词
missing data; data mining; lexicographic order; nonparametric; imputation; tree-based models;
D O I
暂无
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Conditional mean imputation is a common way to deal with missing data. Although very simple to implement, the method might suffer from model misspecification and it results unsatisfactory for non linear data. We propose the iterative use of tree based models for missing data imputation in large data bases. The proposed procedure uses lexicographic order to rank missing values that occur in different variables and deals with these incrementally, i.e, augmenting the data by the previously filled in records according to the defined order.
引用
收藏
页码:455 / 460
页数:6
相关论文
共 50 条
  • [11] Imputation methods for missing data for polygenic models
    Brooke Fridley
    Kari Rabe
    Mariza de Andrade
    BMC Genetics, 4
  • [12] Analyzing Coarsened and Missing Data by Imputation Methods
    van Der Burg, Lars L. J.
    Bohringer, Stefan
    Bartlett, Jonathan W.
    Bosse, Tjalling
    Horeweg, Nanda
    de Wreede, Liesbeth C.
    Putter, Hein
    STATISTICS IN MEDICINE, 2025, 44 (06)
  • [13] Missing traffic data: comparison of imputation methods
    Li, Yuebiao
    Li, Zhiheng
    Li, Li
    IET INTELLIGENT TRANSPORT SYSTEMS, 2014, 8 (01) : 51 - 57
  • [14] Imputation methods for missing data for polygenic models
    Fridley, B
    Rabe, K
    de Andrade, M
    BMC GENETICS, 2003, 4 (Suppl 1)
  • [15] Adaptive Deep Incremental Learning - Assisted Missing Data Imputation for Streaming Data
    Syavasya, C. V. S. R.
    Lakshmi, M. A.
    JOURNAL OF INTERCONNECTION NETWORKS, 2022, 22 (SUPP02)
  • [16] Improved methods for the imputation of missing data by nearest neighbor methods
    Tutz, Gerhard
    Ramzan, Shahla
    COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2015, 90 : 84 - 99
  • [17] Comparison of missing data imputation methods using weather data
    Nida, Hafiza
    Kashif, Muhammad
    Khan, Muhammad Imran
    Ghamkhar, Madiha
    PAKISTAN JOURNAL OF AGRICULTURAL SCIENCES, 2023, 60 (02): : 327 - 336
  • [18] Incremental Missing-Data Imputation for Evolving Fuzzy Granular Prediction
    Garcia, Cristiano
    Leite, Daniel
    Skrjanc, Igor
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2020, 28 (10) : 2348 - 2362
  • [19] Analyzing data sets with missing data: An empirical evaluation of imputation methods and likelihood-based methods
    Myrtveit, I
    Stensrud, E
    Olsson, UH
    IEEE TRANSACTIONS ON SOFTWARE ENGINEERING, 2001, 27 (11) : 999 - 1013
  • [20] Multiple Imputation for Missing Data in Inventory on Dominant Tree Height
    Zhang Xiangyun
    Song Hongfeng
    RECENT ADVANCE IN STATISTICS APPLICATION AND RELATED AREAS, VOLS I AND II, 2009, : 463 - 468