Missing data incremental imputation through tree based methods

被引：0

作者：

Conversano, C ^{[1
]}

Cappelli, C ^{[1
]}

机构：

[1] Univ Naples Federico II, Dept Math & Stat, Naples, Italy

来源：

COMPSTAT 2002: PROCEEDINGS IN COMPUTATIONAL STATISTICS | 2002年

关键词：

missing data; data mining; lexicographic order; nonparametric; imputation; tree-based models;

D O I：

暂无

中图分类号：

O21 [概率论与数理统计]; C8 [统计学];

学科分类号：

020208 ; 070103 ; 0714 ;

摘要：

Conditional mean imputation is a common way to deal with missing data. Although very simple to implement, the method might suffer from model misspecification and it results unsatisfactory for non linear data. We propose the iterative use of tree based models for missing data imputation in large data bases. The proposed procedure uses lexicographic order to rank missing values that occur in different variables and deals with these incrementally, i.e, augmenting the data by the previously filled in records according to the defined order.

引用

页码：455 / 460

页数：6

共 50 条

[41] Framework for regression-based missing data imputation methods in on-line MSPC
Arteaga, F
Ferrer, A
JOURNAL OF CHEMOMETRICS, 2005, 19 (08) : 439 - 447
[42] From predictive methods to missing data imputation: An optimization approach
2018, Microtome Publishing (18)
[43] Adaptive pairing of classifier and imputation methods based on the characteristics of missing values in data sets
Sim, Jaemun
Kwon, Ohbyung
Lee, Kun Chang
EXPERT SYSTEMS WITH APPLICATIONS, 2016, 46 : 485 - 493
[44] Comparison of imputation and imputation-free methods for statistical analysis of mass spectrometry data with missing data
Taylor, Sandra
Ponzini, Matthew
Wilson, Machelle
Kim, Kyoungmi
BRIEFINGS IN BIOINFORMATICS, 2022, 23 (01)
[45] Missing data imputation using fuzzy-rough methods
Amiri, Mehran
Jensen, Richard
NEUROCOMPUTING, 2016, 205 : 152 - 164
[46] Evaluation of missing data imputation methods for human osteometric measurements
Liu, Xiaoming
Pang, Jinyong
AMERICAN JOURNAL OF BIOLOGICAL ANTHROPOLOGY, 2024, 183 : 103 - 104
[47] Methods for imputation of missing values in air quality data sets
Junninen, H
Niska, H
Tuppurainen, K
Ruuskanen, J
Kolehmainen, M
ATMOSPHERIC ENVIRONMENT, 2004, 38 (18) : 2895 - 2907
[48] Imputation Methods for Handling Missing Dietary Supplement Dosage Data
Leung, June
Dwyer, Johanna
Hibberd, Patricia
Jacques, Paul
Rand, William
JOURNAL OF RENAL NUTRITION, 2010, 20 (05) : 342 - 347
[49] Optimization methods for the imputation of missing values in Educational Institutions Data
Aureli, D.
Bruni, R.
Daraio, C.
METHODSX, 2021, 8
[50] From Predictive Methods to Missing Data Imputation: An Optimization Approach
Bertsimas, Dimitris
Pawlowski, Colin
Zhuo, Ying Daisy
JOURNAL OF MACHINE LEARNING RESEARCH, 2018, 18

← 1 2 3 4 5 →