Data decomposition and decision rule joining for classification of data with missing values

被引:0
|
作者
Latkowski, R
Mikolajczyk, M
机构
[1] Warsaw Univ, Inst Comp Sci, PL-02097 Warsaw, Poland
[2] Warsaw Univ, Inst Math, PL-02097 Warsaw, Poland
来源
TRANSACTIONS ON ROUGH SETS I | 2004年 / 3100卷
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this paper we present a new approach to handling incomplete information and classifier complexity reduction. We describe a method, called D(3)RJ, that performs data decomposition and decision rule joining to avoid the necessity of reasoning with missing attribute values. In the consequence more complex reasoning process is needed than in the case of known algorithms for induction of decision rules. The original incomplete data table is decomposed into sub-tables without missing values. Next, methods for induction of decision rules are applied to these sets. Finally, an algorithm for decision rule joining is used to obtain the final rule set from partial rule sets. Using D(3)RJ method it is possible to obtain smaller set of rules and next better classification accuracy than classic decision rule induction methods. We provide an empirical evaluation of the D(3)RJ method accuracy and model size on data with missing values of natural origin.
引用
收藏
页码:299 / 320
页数:22
相关论文
共 50 条
  • [1] Data decomposition and decision rule joining for classification of data with missing values
    Latkowski, R
    Mikolajczyk, M
    ROUGH SETS AND CURRENT TRENDS IN COMPUTING, 2004, 3066 : 254 - 263
  • [2] Decision-rule solutions for data mining with missing values
    Weiss, SM
    Indurkhya, N
    ADVANCES IN ARTIFICIAL INTELLIGENCE, 2000, 1952 : 1 - 10
  • [3] A Modified Algorithm for Missing Values in Data Stream Decision Tree Classification
    Hou, Xu-shan
    Lv, Pin
    Wang, Hao
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND SOFTWARE ENGINEERING (AISE 2014), 2014, : 307 - 313
  • [4] Ordering attributes for missing values prediction and data classification
    Hruschka, ER
    Ebecken, NFF
    DATA MINING III, 2002, 6 : 593 - 601
  • [5] Rough Set Analysis of Classification Data with Missing Values
    Szelag, Marcin
    Blaszczynski, Jerzy
    Slowinski, Roman
    ROUGH SETS, 2017, 10313 : 552 - 565
  • [6] Visualization of the critical patterns of missing values in classification data
    Wang, Hai
    Wang, Shouhong
    ADVANCES IN VISUAL INFORMATION SYSTEMS, 2007, 4781 : 267 - +
  • [7] Fast Imbalanced Classification of Healthcare Data with Missing Values
    Razzaghi, Talayeh
    Roderick, Oleg
    Safro, Ilya
    Marko, Nick
    2015 18TH INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION), 2015, : 774 - 781
  • [8] Flexible decision tree for data stream classification in the presence of concept change, noise and missing values
    Sattar Hashemi
    Ying Yang
    Data Mining and Knowledge Discovery, 2009, 19 : 95 - 131
  • [9] Flexible decision tree for data stream classification in the presence of concept change, noise and missing values
    Hashemi, Sattar
    Yang, Ying
    DATA MINING AND KNOWLEDGE DISCOVERY, 2009, 19 (01) : 95 - 131
  • [10] Incremental singular value decomposition of uncertain data with missing values
    Brand, M
    COMPUTER VISON - ECCV 2002, PT 1, 2002, 2350 : 707 - 720