Rough Set Methods for Attribute Clustering and Selection

被引:46
|
作者
Janusz, Andrzej [1 ]
Slezak, Dominik [1 ,2 ]
机构
[1] Univ Warsaw, Inst Math, PL-02097 Warsaw, Poland
[2] Infobright Inc, Warsaw, Poland
关键词
CLASSIFIERS; DIAGNOSIS;
D O I
10.1080/08839514.2014.883902
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this study we investigate methods for attribute clustering and their possible applications to the task of computation of decision reducts from information systems. We focus on high-dimensional datasets, that is, microarray data. For this type of data, the traditional reduct construction techniques either can be extremely computationally intensive or can yield poor performance in terms of the size of the resulting reducts. We propose two reduct computation heuristics that combine the greedy search with a diverse selection of candidate attributes. Our experiments confirm that by proper grouping of similar-in some sense interchangeable-attributes, it is possible to significantly decrease computation time, as well as to increase a quality of the obtained reducts (i.e., to decrease their average size). We examine several criteria for attribute clustering, and we also identify so-called garbage clusters, which contain attributes that can be regarded as irrelevant.
引用
收藏
页码:220 / 242
页数:23
相关论文
共 50 条
  • [1] ROUGH SET THEORY FOR SELECTING CLUSTERING ATTRIBUTE
    Herawan, Tutut
    Dens, Mustafa Mat
    POWER CONTROL AND OPTIMIZATION, PROCEEDINGS, 2009, 1159 : 331 - 338
  • [2] A rough set approach for selecting clustering attribute
    Herawan, Tutut
    Deris, Mustafa Mat
    Abawajy, Jemal H.
    KNOWLEDGE-BASED SYSTEMS, 2010, 23 (03) : 220 - 231
  • [3] Attribute selection in marketing: A rough set approach
    Mahapatra, Sabita
    Sreekumar
    Mahapatra, S. S.
    IIMB MANAGEMENT REVIEW, 2010, 22 (1-2) : 16 - 24
  • [4] Sample Pair Selection for Attribute Reduction with Rough Set
    Chen, Degang
    Zhao, Suyun
    Zhang, Lei
    Yang, Yongping
    Zhang, Xiao
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2012, 24 (11) : 2080 - 2093
  • [5] A hierarchical clustering method for attribute dicretization in rough set theory
    Li, MX
    Wu, CD
    Han, ZH
    Yue, Y
    PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2004, : 3650 - 3654
  • [6] MAR: Maximum Attribute Relative of soft set for clustering attribute selection
    Mamat, Rabiei
    Herawan, Tutut
    Denis, Mustafa Mat
    KNOWLEDGE-BASED SYSTEMS, 2013, 52 : 11 - 20
  • [7] Attribute clustering using rough set theory for feature selection in fault severity classification of rotating machinery
    Pacheco, Fannia
    Cerrada, Mariela
    Sanchez, Rene-Vinicio
    Cabrera, Diego
    Li, Chuan
    de Oliveira, Jose Valente
    EXPERT SYSTEMS WITH APPLICATIONS, 2017, 71 : 69 - 86
  • [8] A Soft Set Approach for Fast Clustering Attribute Selection
    Hartama, Dedy
    Yanto, Iwm Tri Riyadi
    Zarlis, Muhammad
    2016 INTERNATIONAL CONFERENCE ON INFORMATICS AND COMPUTING (ICIC), 2016, : 12 - 15
  • [9] A Framework on Rough Set-Based Partitioning Attribute Selection
    Herawan, Tutut
    Deris, Mustafa Mat
    EMERGING INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS: WITH ASPECTS OF ARTIFICIAL INTELLIGENCE, 2009, 5755 : 91 - 100
  • [10] Reduction of rough set attribute based on immune clone selection
    Liang L.
    Xu G.-H.
    Frontiers of Mechanical Engineering in China, 2006, 1 (4): : 413 - 417