Attribute reduction algorithm based on combined distance in clustering

被引:0
|
作者
Liang, Baohua [1 ,2 ,3 ]
Lu, Zhengyu [1 ]
机构
[1] Guangxi Normal Univ, Key Lab Educ Blockchain & Intelligent Technol, Minist Educ, Guilin, Peoples R China
[2] Guangxi Normal Univ, Guangxi Key Lab Multisource Informat Min & Secur, Guilin, Peoples R China
[3] Chaohu Univ, Inst Comp & Artif Intelligence, Hefei, Peoples R China
关键词
Rough sets; attribute reduction; clustering; combined distance; ROUGH SET-THEORY; DISCERNIBILITY; MATRIX; ENTROPY;
D O I
10.3233/JIFS-222666
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Attribute reduction is a widely used technique in data preprocessing, aiming to remove redundant and irrelevant attributes. However, most attribute reduction models only consider the importance of attributes as an important basis for reduction, without considering the relationship between attributes and the impact on classification results. In order to overcome this shortcoming, this article firstly defines the distance between samples based on the number of combinations formed by comparing the samples in the same sub-division. Secondly, from the point of view of clustering, according to the principle that the distance between each point in the cluster should be as small as possible, and the sample distance between different clusters should be as large as possible, the combined distance is used to define the importance of attributes. Finally, according to the importance of attributes, a new attribute reduction mechanism is proposed. Furthermore, plenty of experiments are done to verify the performance of the proposed reduction algorithm. The results show that the data sets reduced by our algorithm has a prominent advantage in classification accuracy, which can effectively reduce the dimensionality of high-dimensional data, and at the same time provide new methods for the study of attribute reduction models.
引用
收藏
页码:1481 / 1496
页数:16
相关论文
共 50 条
  • [41] A hierarchy distance computing based clustering algorithm
    School of Electronics Engineering and Computer Science, Peking University, Beijing 100871, China
    不详
    不详
    Jisuanji Xuebao, 2007, 5 (786-795): : 786 - 795
  • [42] Page clustering using a distance based algorithm
    Mojica, JA
    Rojas, DA
    Gómez, J
    González, F
    THIRD LATIN AMERICAN WEB CONGRESS, PROCEEDINGS, 2005, : 223 - 229
  • [43] An attribute reduction algorithm based on genetic algorithm and discernibility matrix
    Zhengjiang, Wu
    Jingmin, Zhang
    Yan, Gao
    Journal of Software, 2012, 7 (11) : 2640 - 2648
  • [44] An Incremental Clustering Algorithm Based on Mahalanobis Distance
    Aik, Lim Eng
    Choon, Tan Wee
    INTERNATIONAL CONFERENCE ON QUANTITATIVE SCIENCES AND ITS APPLICATIONS (ICOQSIA 2014), 2014, 1635 : 788 - 793
  • [45] A non-distance based clustering algorithm
    Zhu, SG
    Li, T
    PROCEEDING OF THE 2002 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOLS 1-3, 2002, : 2357 - 2362
  • [46] A Global Discretization and Attribute Reduction Algorithm based on K-means Clustering and Rough Sets Theory
    Hu Min
    2009 SECOND INTERNATIONAL SYMPOSIUM ON KNOWLEDGE ACQUISITION AND MODELING: KAM 2009, VOL 2, 2009, : 92 - 95
  • [47] Spectral Clustering with Neighborhood Attribute Reduction Based on Information Entropy
    Jia, Hongjie
    Ding, Shifei
    Ma, Heng
    Xing, Wanqiu
    JOURNAL OF COMPUTERS, 2014, 9 (06) : 1316 - 1324
  • [48] Fuzzy Partition Distance Based Attribute Reduction in Decision Tables
    Van Thien Nguyen
    Long Giang Nguyen
    Nhu Son Nguyen
    ROUGH SETS, IJCRS 2018, 2018, 11103 : 614 - 627
  • [49] Attribute Weighted Fuzzy Clustering Algorithm Based on Mutual Information
    Cao, Yao Zhu
    Lin, He
    Liu, Biao
    2017 13TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (ICNC-FSKD), 2017,
  • [50] A Novel Fuzzy Clustering Algorithm Based on Similarity of Attribute Space
    Shi W.
    Zhuo J.
    Lan Y.
    Dianzi Yu Xinxi Xuebao/Journal of Electronics and Information Technology, 2019, 41 (11): : 2722 - 2728