Ordering attributes for missing values prediction and data classification

被引:0
|
作者
Hruschka, ER [1 ]
Ebecken, NFF [1 ]
机构
[1] Univ Fed Rio de Janeiro, COPPE, Rio De Janeiro, Brazil
来源
DATA MINING III | 2002年 / 6卷
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This work shows the application of the bayesian K2 learning algorithm as a data classifier and preprocessor having an attribute order searcher to improve the results. One of the aspects that have influence on the K2 performance is the initial order of the attributes in the data set, however, in most cases, this algorithm is applied without giving special attention to this preorder, The present work performs an empirical method to select an appropriate attribute order, before applying the learning algorithm (K2). Afterwards, it does the data preparation and classification tasks. In order to analyze the results, in a first step, the data classification. is done without considering the initial order of the attributes. Thereafter it seeks for a good variable order, and having the sequence of the attributes, the classification is performed again. Once these results are obtained, the same algorithm is used to substitute missing values in the learning dataset in order to verify how the process works in this kind of task. The dataset used came from the standard classification problems databases from UCI Machine Learning Repository. The results are empirically compared taking into consideration the mean and standard deviation.
引用
收藏
页码:593 / 601
页数:9
相关论文
共 50 条
  • [41] Exploiting nearest neighbor data and fuzzy membership function to address missing values in classification
    Muludi, Kurnia
    Setianingsih, Revita
    Sholehurrohman, Ridho
    Junaidi, Akmal
    PEERJ COMPUTER SCIENCE, 2024, 10
  • [42] NONPARAMETRIC CLASSIFICATION WITH MISSING DATA
    Sell, Torben
    Berrett, Thomas b.
    Cannings, Timothy i.
    ANNALS OF STATISTICS, 2024, 52 (03): : 1178 - 1200
  • [43] Traffic congestion prediction and missing data: a classification approach using weather information
    Mystakidis, Aristeidis
    Tjortjis, Christos
    INTERNATIONAL JOURNAL OF DATA SCIENCE AND ANALYTICS, 2024,
  • [44] On classification with nonignorable missing data
    Mojirsheibani, Majid
    JOURNAL OF MULTIVARIATE ANALYSIS, 2021, 184
  • [45] Filling the missing values of pregnancy examination data to improve the prediction of gestational diabetes mellitus
    Lu, Xinxi
    Wang, Jikai
    Gao, Ye
    ASIA-PACIFIC JOURNAL OF CLINICAL ONCOLOGY, 2022, 18 : 35 - 35
  • [46] ANALYSIS OF DATA WITH MISSING VALUES - COMMENTARY
    LITTLE, RJA
    STATISTICS IN MEDICINE, 1988, 7 (1-2) : 347 - 355
  • [47] Missing values in monotone data sets
    Popova, Viara
    ISDA 2006: Sixth International Conference on Intelligent Systems Design and Applications, Vol 1, 2006, : 627 - 632
  • [48] SPECTRA FROM DATA WITH MISSING VALUES
    HARRIS, RW
    MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 1987, 1 (01) : 97 - 104
  • [49] Handling missing values in trait data
    Johnson, Thomas F.
    Isaac, Nick J. B.
    Paviolo, Agustin
    Gonzalez-Suarez, Manuela
    GLOBAL ECOLOGY AND BIOGEOGRAPHY, 2021, 30 (01): : 51 - 62
  • [50] Analyzing Longitudinal Data With Missing Values
    Enders, Craig K.
    REHABILITATION PSYCHOLOGY, 2011, 56 (04) : 267 - 288