Data decomposition and decision rule joining for classification of data with missing values

被引:0
|
作者
Latkowski, R
Mikolajczyk, M
机构
[1] Warsaw Univ, Inst Comp Sci, PL-02097 Warsaw, Poland
[2] Warsaw Univ, Inst Math, PL-02097 Warsaw, Poland
来源
TRANSACTIONS ON ROUGH SETS I | 2004年 / 3100卷
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In this paper we present a new approach to handling incomplete information and classifier complexity reduction. We describe a method, called D(3)RJ, that performs data decomposition and decision rule joining to avoid the necessity of reasoning with missing attribute values. In the consequence more complex reasoning process is needed than in the case of known algorithms for induction of decision rules. The original incomplete data table is decomposed into sub-tables without missing values. Next, methods for induction of decision rules are applied to these sets. Finally, an algorithm for decision rule joining is used to obtain the final rule set from partial rule sets. Using D(3)RJ method it is possible to obtain smaller set of rules and next better classification accuracy than classic decision rule induction methods. We provide an empirical evaluation of the D(3)RJ method accuracy and model size on data with missing values of natural origin.
引用
收藏
页码:299 / 320
页数:22
相关论文
共 50 条
  • [41] ANALYSIS OF DATA WITH MISSING VALUES - DISCUSSION
    HELMS, RW
    LAIRD, NM
    LEBOWITZ, MD
    MANTEL, N
    LOUIS, TA
    WU, M
    STATISTICS IN MEDICINE, 1988, 7 (1-2) : 357 - 360
  • [42] ANOVA FOR LONGITUDINAL DATA WITH MISSING VALUES
    Chen, Song Xi
    Zhong, Ping-Shou
    ANNALS OF STATISTICS, 2010, 38 (06): : 3630 - 3659
  • [43] Dealing with missing values in proteomics data
    Kong, Weijia
    Hui, Harvard Wai Hann
    Peng, Hui
    Bin Goh, Wilson Wen
    PROTEOMICS, 2022, 22 (23-24)
  • [44] Dealing with Missing Values in Microarray Data
    Mohammadi, Azadeh
    Saraee, Mohammad Hossein
    2008 INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES, PROCEEDINGS, 2008, : 258 - 263
  • [45] Variational Mode Decomposition with Missing Data
    Choi, Guebin
    Oh, Hee-Seok
    Lee, Youngjo
    Kim, Donghoh
    Yu, Kyungsang
    KOREAN JOURNAL OF APPLIED STATISTICS, 2015, 28 (02) : 159 - 174
  • [46] Using association rule for missing data imputation
    Wu, Jianhua
    Song, Qinbao
    Shen, Junyi
    Journal of Information and Computational Science, 2007, 4 (04): : 1155 - 1161
  • [47] Dealing with missing values in a probabilistic decision tree during classification
    Hawarah, Lamis
    Simonet, Ana
    Simonet, Michel
    ICDM 2006: SIXTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, WORKSHOPS, 2006, : 325 - +
  • [48] Replacing missing values using trustworthy data values from web data sources
    Jaya, M. Izham
    Sidi, Fatimah
    Yusof, Sharmila Mat
    Affendey, Lilly Suriani
    Ishak, Iskandar
    Jabar, Marzanah A.
    6TH INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND COMPUTATIONAL MATHEMATICS (ICCSCM 2017), 2017, 892
  • [49] Integrating MODIS and Landsat Data for Land Cover Classification by Multilevel Decision Rule
    Guan, Xudong
    Huang, Chong
    Zhang, Rui
    LAND, 2021, 10 (02) : 1 - 18
  • [50] Exploiting nearest neighbor data and fuzzy membership function to address missing values in classification
    Muludi, Kurnia
    Setianingsih, Revita
    Sholehurrohman, Ridho
    Junaidi, Akmal
    PEERJ COMPUTER SCIENCE, 2024, 10