Data decomposition and decision rule joining for classification of data with missing values

被引:0
|
作者
Latkowski, R
Mikolajczyk, M
机构
[1] Warsaw Univ, Inst Comp Sci, PL-02097 Warsaw, Poland
[2] Warsaw Univ, Inst Math, PL-02097 Warsaw, Poland
来源
TRANSACTIONS ON ROUGH SETS I | 2004年 / 3100卷
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this paper we present a new approach to handling incomplete information and classifier complexity reduction. We describe a method, called D(3)RJ, that performs data decomposition and decision rule joining to avoid the necessity of reasoning with missing attribute values. In the consequence more complex reasoning process is needed than in the case of known algorithms for induction of decision rules. The original incomplete data table is decomposed into sub-tables without missing values. Next, methods for induction of decision rules are applied to these sets. Finally, an algorithm for decision rule joining is used to obtain the final rule set from partial rule sets. Using D(3)RJ method it is possible to obtain smaller set of rules and next better classification accuracy than classic decision rule induction methods. We provide an empirical evaluation of the D(3)RJ method accuracy and model size on data with missing values of natural origin.
引用
收藏
页码:299 / 320
页数:22
相关论文
共 50 条
  • [31] On rule acquisition methods for data classification in heterogeneous incomplete decision systems
    Meng, Zuqiang
    Shi, Zhongzhi
    KNOWLEDGE-BASED SYSTEMS, 2020, 193
  • [32] Accounting for missing data in monthly temperature series: Testing rule-of-thumb omission of months with missing values
    Anderson, Conor I.
    Gough, William A.
    INTERNATIONAL JOURNAL OF CLIMATOLOGY, 2018, 38 (13) : 4990 - 5002
  • [33] On-Line Classification of Data Streams with Missing Values Based on Reinforcement Learning
    Millan-Giraldo, Monica
    Javier Traver, Vicente
    Salvador Sanchez, J.
    PATTERN RECOGNITION AND IMAGE ANALYSIS: 5TH IBERIAN CONFERENCE, IBPRIA 2011, 2011, 6669 : 355 - 362
  • [34] Test-Cost Sensitive Classification on Data with Missing Values in the Limited Time
    Wan, Chang
    KNOWLEDGE-BASED AND INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT I, 2010, 6276 : 501 - 510
  • [35] Multilevel Weighted Support Vector Machine for Classification on Healthcare Data with Missing Values
    Razzaghi, Talayeh
    Roderick, Oleg
    Safro, Ilya
    Marko, Nicholas
    PLOS ONE, 2016, 11 (05):
  • [36] ANALYSIS OF DATA WITH MISSING VALUES - COMMENTARY
    LITTLE, RJA
    STATISTICS IN MEDICINE, 1988, 7 (1-2) : 347 - 355
  • [37] Missing values in monotone data sets
    Popova, Viara
    ISDA 2006: Sixth International Conference on Intelligent Systems Design and Applications, Vol 1, 2006, : 627 - 632
  • [38] SPECTRA FROM DATA WITH MISSING VALUES
    HARRIS, RW
    MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 1987, 1 (01) : 97 - 104
  • [39] Handling missing values in trait data
    Johnson, Thomas F.
    Isaac, Nick J. B.
    Paviolo, Agustin
    Gonzalez-Suarez, Manuela
    GLOBAL ECOLOGY AND BIOGEOGRAPHY, 2021, 30 (01): : 51 - 62
  • [40] Analyzing Longitudinal Data With Missing Values
    Enders, Craig K.
    REHABILITATION PSYCHOLOGY, 2011, 56 (04) : 267 - 288