Missing data incremental imputation through tree based methods

被引:0
|
作者
Conversano, C [1 ]
Cappelli, C [1 ]
机构
[1] Univ Naples Federico II, Dept Math & Stat, Naples, Italy
关键词
missing data; data mining; lexicographic order; nonparametric; imputation; tree-based models;
D O I
暂无
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Conditional mean imputation is a common way to deal with missing data. Although very simple to implement, the method might suffer from model misspecification and it results unsatisfactory for non linear data. We propose the iterative use of tree based models for missing data imputation in large data bases. The proposed procedure uses lexicographic order to rank missing values that occur in different variables and deals with these incrementally, i.e, augmenting the data by the previously filled in records according to the defined order.
引用
收藏
页码:455 / 460
页数:6
相关论文
共 50 条
  • [1] Boosted incremental tree-based imputation of missing data
    Siciliano, Roberta
    Aria, Massimo
    D'Ambrosio, Antonio
    DATA ANALYSIS, CLASSIFICATION AND THE FORWARD SEARCH, 2006, : 271 - +
  • [2] Incremental Tree-Based Missing Data Imputation with Lexicographic Ordering
    Conversano, Claudio
    Siciliano, Roberta
    JOURNAL OF CLASSIFICATION, 2009, 26 (03) : 361 - 379
  • [3] Incremental Tree-Based Missing Data Imputation with Lexicographic Ordering
    Claudio Conversano
    Roberta Siciliano
    Journal of Classification, 2009, 26 : 361 - 379
  • [4] Tree-based Approach to Missing Data Imputation
    Vateekul, Peerapon
    Sarinnapakorn, Kanoksri
    2009 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW 2009), 2009, : 70 - +
  • [5] Missing Data and Imputation Methods
    Schober, Patrick
    Vetter, Thomas R.
    ANESTHESIA AND ANALGESIA, 2020, 131 (05): : 1419 - 1420
  • [6] Robust tree-based incremental imputation method for data fusion
    D'Ambrosio, Antonio
    Aria, Massimo
    Siciliano, Roberta
    ADVANCES IN INTELLIGENT DATA ANALYSIS VII, PROCEEDINGS, 2007, 4723 : 174 - +
  • [7] Indirect methods of imputation of missing data based on available units
    Rueda, MM
    González, S
    Arcos, A
    APPLIED MATHEMATICS AND COMPUTATION, 2005, 164 (01) : 249 - 261
  • [8] MIDA: a Web Tool for MIssing DAta Imputation based on a Boosted and Incremental Learning Algorithm
    Acampora, Giovanni
    Vitiello, Autilia
    Siciliano, Roberta
    2020 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2020,
  • [9] Missing data and imputation methods in partition of variables
    da Silva, AL
    Saporta, G
    Bacelar-Nicolau, H
    CLASSIFICATION, CLUSTERING, AND DATA MINING APPLICATIONS, 2004, : 631 - 637
  • [10] Imputation of missing longitudinal data: a comparison of methods
    Engels, JM
    Diehr, P
    JOURNAL OF CLINICAL EPIDEMIOLOGY, 2003, 56 (10) : 968 - 976