Rough Set Methods for Attribute Clustering and Selection

被引:46
|
作者
Janusz, Andrzej [1 ]
Slezak, Dominik [1 ,2 ]
机构
[1] Univ Warsaw, Inst Math, PL-02097 Warsaw, Poland
[2] Infobright Inc, Warsaw, Poland
关键词
CLASSIFIERS; DIAGNOSIS;
D O I
10.1080/08839514.2014.883902
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this study we investigate methods for attribute clustering and their possible applications to the task of computation of decision reducts from information systems. We focus on high-dimensional datasets, that is, microarray data. For this type of data, the traditional reduct construction techniques either can be extremely computationally intensive or can yield poor performance in terms of the size of the resulting reducts. We propose two reduct computation heuristics that combine the greedy search with a diverse selection of candidate attributes. Our experiments confirm that by proper grouping of similar-in some sense interchangeable-attributes, it is possible to significantly decrease computation time, as well as to increase a quality of the obtained reducts (i.e., to decrease their average size). We examine several criteria for attribute clustering, and we also identify so-called garbage clusters, which contain attributes that can be regarded as irrelevant.
引用
收藏
页码:220 / 242
页数:23
相关论文
共 50 条
  • [31] Rough set methods in feature selection via submodular function
    Zhu, Xiao-Zhong
    Zhu, William
    Fan, Xin-Nan
    SOFT COMPUTING, 2017, 21 (13) : 3699 - 3711
  • [32] A Soft Set Model on Information System and Its Application in Clustering Attribute Selection
    Qin, Hongwu
    Ma, Xiuqin
    Zain, Jasni Mohamad
    Sulaiman, Norrozila
    Herawan, Tutut
    SOFTWARE ENGINEERING AND COMPUTER SYSTEMS, PT 2, 2011, 180 : 16 - 27
  • [33] Rough set methods in feature selection via submodular function
    Xiao-Zhong Zhu
    William Zhu
    Xin-Nan Fan
    Soft Computing, 2017, 21 : 3699 - 3711
  • [34] Intuitionistic Fuzzy Rough Set-Based Granular Structures and Attribute Subset Selection
    Tan, Anhui
    Wu, Wei-Zhi
    Qian, Yuhua
    Liang, Jiye
    Chen, Jinkun
    Li, Jinjin
    IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2019, 27 (03) : 527 - 539
  • [35] Software effort estimation by analogy using attribute selection based on rough set analysis
    Li, Jingzhou
    Ruhe, Guenther
    INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2008, 18 (01) : 1 - 23
  • [36] River Network Dynamic Selection with Spatial Data and Attribute Data Based on Rough Set
    Qiu, Jia
    Li, Wenjing
    2010 18TH INTERNATIONAL CONFERENCE ON GEOINFORMATICS, 2010,
  • [37] Feature Selection Method for Network Intrusion Based on Fast Attribute Reduction of Rough Set
    Geng, Guohua
    Li, Na
    Gong, Shangfu
    2012 INTERNATIONAL CONFERENCE ON INDUSTRIAL CONTROL AND ELECTRONICS ENGINEERING (ICICEE), 2012, : 530 - 534
  • [38] Attribute selection based on rough set theory for electromagnetic interference (EMI) fault diagnosis
    Department of Information Management, National Kaohsiung First University of Science and Technology, Kaohsiung, Taiwan
    不详
    Qual Eng, 2006, 2 (161-171):
  • [39] A fuzzy similarity-based rough set approach for attribute selection in set-valued information systems
    Shivani Singh
    Shivam Shreevastava
    Tanmoy Som
    Gaurav Somani
    Soft Computing, 2020, 24 : 4675 - 4691
  • [40] A fuzzy similarity-based rough set approach for attribute selection in set-valued information systems
    Singh, Shivani
    Shreevastava, Shivam
    Som, Tanmoy
    Somani, Gaurav
    SOFT COMPUTING, 2020, 24 (06) : 4675 - 4691