Data decomposition and decision rule joining for classification of data with missing values

被引:0
|
作者
Latkowski, R
Mikolajczyk, M
机构
[1] Warsaw Univ, Inst Comp Sci, PL-02097 Warsaw, Poland
[2] Warsaw Univ, Math Inst, PL-02097 Warsaw, Poland
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we present a new approach to handling incomplete information and classifier complexity reduction. We describe a method, called D(3)RJ, that performs data decomposition and decision rule joining to avoid the necessity of reasoning with missing attribute values. In the consequence more complex reasoning process is needed than in the case of known algorithms for induction of decision rules. The original incomplete data table is decomposed into sub-tables without missing values. Next, methods for induction of decision rules are applied to these sets. Finally, an algorithm for decision rule joining is used to obtain the final rule set from partial rule sets. Using D(3)RJ method it is possible to obtain smaller set of rules and next better classification accuracy than standard decision rule induction methods. We provide an empirical evaluation of the D(3)RJ method accuracy and model size on data with missing values of natural origin.
引用
收藏
页码:254 / 263
页数:10
相关论文
共 50 条
  • [21] Classification of nultivariate data with missing values using expected discriminant scores
    Kossa, W
    BETWEEN DATA SCIENCE AND APPLIED DATA ANALYSIS, 2003, : 570 - 577
  • [22] MULTICLASS SUPPORT VECTOR MACHINES FOR CLASSIFICATION OF ECG DATA WITH MISSING VALUES
    Hejazi, Maryamsadat
    Al-Haddad, S. A. R.
    Singh, Yashwant Prasad
    Hashim, Shaiful Jahari
    Aziz, Ahmad Fazli Abdul
    APPLIED ARTIFICIAL INTELLIGENCE, 2015, 29 (07) : 660 - 674
  • [23] Effectiveness of Simple Data Imputation for Missing Feature Values in Binary Classification
    Chatterjee, A.
    Woodruff, H.
    Lobbes, M.
    van Wijk, Y.
    Beuque, M.
    Seuntjens, J.
    Lambin, P.
    MEDICAL PHYSICS, 2020, 47 (06) : E609 - E609
  • [24] The association rule algorithm with missing data in data mining
    Gerardo, BD
    Lee, J
    Lee, J
    Park, M
    Lee, M
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2004, PT 1, 2004, 3043 : 97 - 105
  • [25] Adapting Aerial Root Classifier Missing Data Processor in Data Stream Decision Tree Classification
    Lachiheb, Oussama
    Gouider, Mohamed Salah
    MODEL AND DATA ENGINEERING, MEDI 2014, 2014, 8748 : 92 - 99
  • [26] Generalized multiresolution decomposition frameworks for the analysis of industrial data with uncertainty and missing values
    Reis, Marco S.
    Saraiva, Pedro M.
    INDUSTRIAL & ENGINEERING CHEMISTRY RESEARCH, 2006, 45 (18) : 6330 - 6338
  • [27] A methodology for quantifying the effect of missing data on decision quality in classification problems
    Feldman, Michael
    Even, Adir
    Parmet, Yisrael
    COMMUNICATIONS IN STATISTICS-THEORY AND METHODS, 2018, 47 (11) : 2643 - 2663
  • [28] NONPARAMETRIC CLASSIFICATION WITH MISSING DATA
    Sell, Torben
    Berrett, Thomas b.
    Cannings, Timothy i.
    ANNALS OF STATISTICS, 2024, 52 (03): : 1178 - 1200
  • [29] On classification with nonignorable missing data
    Mojirsheibani, Majid
    JOURNAL OF MULTIVARIATE ANALYSIS, 2021, 184
  • [30] Chronic hepatitis and cirrhosis classification using SNP data, decision tree and decision rule
    Kim, Dong-Hoi
    Uhmn, Saangyong
    Ko, Young-Woong
    Cho, Sung Won
    Cheong, Jae Youn
    Kim, Jin
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2007, PT 3, PROCEEDINGS, 2007, 4707 : 585 - +