Modeling naive bayes imputation classification for missing data

被引：1

作者：

Khotimah, B. K. ^{[1
,3
]}

Miswanto ^{[1
,2
]}

Suprajitno, H. ^{[1
,2
]}

机构：

[1] Univ Airlangga, Fac Sci & Technol, Surabaya, Indonesia

[2] Univ Airlangga, Dept Math, Surabaya, Indonesia

[3] Univ Trunojoyo Madura, Dept Informat Engn, Bangkalan, Indonesia

来源：

FIRST INTERNATIONAL CONFERENCE ON ENVIRONMENTAL GEOGRAPHY AND GEOGRAPHY EDUCATION (ICEGE) | 2019年 / 243卷

关键词：

VALUES;

D O I：

10.1088/1755-1315/243/1/012111

中图分类号：

G40 [教育学];

学科分类号：

040101 ; 120403 ;

摘要：

Naive Bayes Imputation (NBI) is used to fill in missing values by replacing the attribute information according to the probability estimate. The NBI process divides the whole data into two sub-sets is the complete data and data containing missing data. Complete data is used for the imputation process at the lost value. The process is repeated for each missing attribute to generate complete data for classification. This research applies NBI for imputation and preprocessing as preparation of classification process. The trial of this study used NBI for imputation compared to using the mean and mode to predict the missing data. The data used for imputation is full train of complete data as a whole to predict the missing value so as to represent the entire data. The results of this study prove that imputation with NBI produces the right imputation with higher accuracy than other imputations. NBI with single imputation and multiple imputation results in better performance because of the right features. This study aims to calculate the effect of missing values on Naive Bayes Imputation Algorithm is based on a probalistic model using mixed data. Empirically shows that the interaction between several methods of imputation and supervised classification results in differences in the performance of classification for the same imputation method.

引用

页数：10

共 50 条

[41] Dynamic cost-sensitive naive bayes classification for uncertain data
Huang, Yuwen
International Journal of Database Theory and Application, 2015, 8 (01): : 271 - 280
[42] Fast Feature Selection for Naive Bayes Classification in Data Stream Mining
Lutu, Patricia E. N.
WORLD CONGRESS ON ENGINEERING - WCE 2013, VOL III, 2013, : 1549 - 1554
[43] Incorporating receiver operating characteristics into naive Bayes for unbalanced data classification
Taeheung Kim
Byung Do Chung
Jong-Seok Lee
Computing, 2017, 99 : 203 - 218
[44] Comparison of SVM and Naive Bayes for Sentiment Classification using BERT data
Rana, Shivani
Kanji, Rakesh
Jain, Shruti
2022 5TH INTERNATIONAL CONFERENCE ON MULTIMEDIA, SIGNAL PROCESSING AND COMMUNICATION TECHNOLOGIES (IMPACT), 2022,
[45] A sequential feature extraction approach for naive bayes classification of microarray data
Fan, Liwei
Poh, Kim-Leng
Zhou, Peng
EXPERT SYSTEMS WITH APPLICATIONS, 2009, 36 (06) : 9919 - 9923
[46] Missing Data: data replacement and imputation
Hutcheson, Graeme
Pampaka, Maria
JOURNAL OF MODELLING IN MANAGEMENT, 2012, 7 (02)
[47] Missing Data and Imputation Methods
Schober, Patrick
Vetter, Thomas R.
ANESTHESIA AND ANALGESIA, 2020, 131 (05): : 1419 - 1420
[48] Missing Data and Multiple Imputation
Cummings, Peter
JAMA PEDIATRICS, 2013, 167 (07) : 656 - 661
[49] Missing Data Imputation: A Survey
Kelkar, Bhagyashri Abhay
INTERNATIONAL JOURNAL OF DECISION SUPPORT SYSTEM TECHNOLOGY, 2022, 14 (01)
[50] An Improvement to Naive Bayes for Text Classification
Zhang, Wei
Gao, Feng
CEIS 2011, 2011, 15

← 1 2 3 4 5 →