Rough Set Methods for Attribute Clustering and Selection

被引:46
|
作者
Janusz, Andrzej [1 ]
Slezak, Dominik [1 ,2 ]
机构
[1] Univ Warsaw, Inst Math, PL-02097 Warsaw, Poland
[2] Infobright Inc, Warsaw, Poland
关键词
CLASSIFIERS; DIAGNOSIS;
D O I
10.1080/08839514.2014.883902
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this study we investigate methods for attribute clustering and their possible applications to the task of computation of decision reducts from information systems. We focus on high-dimensional datasets, that is, microarray data. For this type of data, the traditional reduct construction techniques either can be extremely computationally intensive or can yield poor performance in terms of the size of the resulting reducts. We propose two reduct computation heuristics that combine the greedy search with a diverse selection of candidate attributes. Our experiments confirm that by proper grouping of similar-in some sense interchangeable-attributes, it is possible to significantly decrease computation time, as well as to increase a quality of the obtained reducts (i.e., to decrease their average size). We examine several criteria for attribute clustering, and we also identify so-called garbage clusters, which contain attributes that can be regarded as irrelevant.
引用
收藏
页码:220 / 242
页数:23
相关论文
共 50 条
  • [21] Approach to Data Table Decomposition Based on Rough Set Attribute Selection Measure
    Jin, Wenbing
    SECOND INTERNATIONAL CONFERENCE ON GENETIC AND EVOLUTIONARY COMPUTING: WGEC 2008, PROCEEDINGS, 2008, : 543 - 548
  • [22] A Network Intrusion Detection Algorithm Based on Rough Set Attribute-weighted Clustering
    Wang Lifang
    ISTM/2009: 8TH INTERNATIONAL SYMPOSIUM ON TEST AND MEASUREMENT, VOLS 1-6, 2009, : 3551 - 3554
  • [23] An New Algorithm-based Rough Set for Selecting Clustering Attribute in Categorical Data
    Baroud, Muftah Mohamed Jomah
    Hashim, Siti Zaiton Mohd
    Zainal, Anazida
    Ahnad, Jamilah
    2020 6TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING AND COMMUNICATION SYSTEMS (ICACCS), 2020, : 1358 - 1364
  • [24] Selection of representative embankments based on rough set - fuzzy clustering method
    Ou Bin
    Lin Zhi-xiang
    Fu Shu-yan
    Gao Sheng-song
    3RD INTERNATIONAL CONFERENCE ON ADVANCES IN ENERGY RESOURCES AND ENVIRONMENT ENGINEERING, 2018, 113
  • [25] Uncertainty mode selection in categorical clustering using the rough set theory
    Naouali, Sami
    Ben Salem, Semeh
    Chtourou, Zied
    EXPERT SYSTEMS WITH APPLICATIONS, 2020, 158
  • [26] Autonomous threshold selection based on rough set theory in clustering algorithm
    Song, Xiao-Yu
    Liu, Feng
    Sun, Huan-Liang
    Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2010, 32 (01): : 192 - 194
  • [27] Clustering algorithm using rough set theory for unsupervised feature selection
    Pacheco, Fannia
    Cerrada, Mariela
    Li, Chuan
    Sanchez, Rene Vinicio
    Cabrera, Diego
    de Oliveira, Jose Valente
    2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 3493 - 3499
  • [28] An Enhanced Feature Selection Method Comprising Rough Set and Clustering Techniques
    Murugan, A.
    Sridevi, T.
    2014 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMPUTING RESEARCH (IEEE ICCIC), 2014, : 401 - 404
  • [29] Attribute reduction based on approximation set of rough set
    Zhang, Qinghua, 1600, Binary Information Press (10):
  • [30] Precision of Rough Set Clustering
    Lingras, Pawan
    Chen, Min
    Miao, Duoqian
    ROUGH SETS AND CURRENT TRENDS IN COMPUTING, PROCEEDINGS, 2008, 5306 : 369 - +