Missing data incremental imputation through tree based methods

被引:0
|
作者
Conversano, C [1 ]
Cappelli, C [1 ]
机构
[1] Univ Naples Federico II, Dept Math & Stat, Naples, Italy
关键词
missing data; data mining; lexicographic order; nonparametric; imputation; tree-based models;
D O I
暂无
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Conditional mean imputation is a common way to deal with missing data. Although very simple to implement, the method might suffer from model misspecification and it results unsatisfactory for non linear data. We propose the iterative use of tree based models for missing data imputation in large data bases. The proposed procedure uses lexicographic order to rank missing values that occur in different variables and deals with these incrementally, i.e, augmenting the data by the previously filled in records according to the defined order.
引用
收藏
页码:455 / 460
页数:6
相关论文
共 50 条
  • [41] Framework for regression-based missing data imputation methods in on-line MSPC
    Arteaga, F
    Ferrer, A
    JOURNAL OF CHEMOMETRICS, 2005, 19 (08) : 439 - 447
  • [43] Adaptive pairing of classifier and imputation methods based on the characteristics of missing values in data sets
    Sim, Jaemun
    Kwon, Ohbyung
    Lee, Kun Chang
    EXPERT SYSTEMS WITH APPLICATIONS, 2016, 46 : 485 - 493
  • [44] Comparison of imputation and imputation-free methods for statistical analysis of mass spectrometry data with missing data
    Taylor, Sandra
    Ponzini, Matthew
    Wilson, Machelle
    Kim, Kyoungmi
    BRIEFINGS IN BIOINFORMATICS, 2022, 23 (01)
  • [45] Missing data imputation using fuzzy-rough methods
    Amiri, Mehran
    Jensen, Richard
    NEUROCOMPUTING, 2016, 205 : 152 - 164
  • [46] Evaluation of missing data imputation methods for human osteometric measurements
    Liu, Xiaoming
    Pang, Jinyong
    AMERICAN JOURNAL OF BIOLOGICAL ANTHROPOLOGY, 2024, 183 : 103 - 104
  • [47] Methods for imputation of missing values in air quality data sets
    Junninen, H
    Niska, H
    Tuppurainen, K
    Ruuskanen, J
    Kolehmainen, M
    ATMOSPHERIC ENVIRONMENT, 2004, 38 (18) : 2895 - 2907
  • [48] Imputation Methods for Handling Missing Dietary Supplement Dosage Data
    Leung, June
    Dwyer, Johanna
    Hibberd, Patricia
    Jacques, Paul
    Rand, William
    JOURNAL OF RENAL NUTRITION, 2010, 20 (05) : 342 - 347
  • [49] Optimization methods for the imputation of missing values in Educational Institutions Data
    Aureli, D.
    Bruni, R.
    Daraio, C.
    METHODSX, 2021, 8
  • [50] From Predictive Methods to Missing Data Imputation: An Optimization Approach
    Bertsimas, Dimitris
    Pawlowski, Colin
    Zhuo, Ying Daisy
    JOURNAL OF MACHINE LEARNING RESEARCH, 2018, 18