An Experimental Study on Unsupervised Clustering-based Feature Selection Methods

被引:1
|
作者
Covoes, Thiago F. [1 ]
Hruschka, Eduardo R. [1 ]
机构
[1] Univ Sao Paulo, Dept Comp Sci, Sao Carlos, SP, Brazil
关键词
unsupervised feature selection; feature clustering; clustering problems; GENE-EXPRESSION DATA; ALGORITHMS; CLASSIFICATION;
D O I
10.1109/ISDA.2009.79
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature selection is an essential task in data mining because it makes it possible not only to reduce computational times and storage requirements, but also to favor model improvement and better data understanding. In this work, we analyze three methods for unsupervised feature selection that are based on the clustering of features for redundancy removal. We report experimental results obtained in ten datasets that illustrate practical scenarios of particular interest, in which one method may be preferred over another. In order to provide some reassurance about the validity and non-randomness of the obtained results, we also present the results of statistical tests.
引用
收藏
页码:993 / 1000
页数:8
相关论文
共 50 条
  • [31] A comprehensive comparative study of clustering-based unsupervised defect prediction models
    Xu, Zhou
    Li, Li
    Yan, Meng
    Liu, Jin
    Luo, Xiapu
    Grundy, John
    Zhang, Yifeng
    Zhang, Xiaohong
    Journal of Systems and Software, 2021, 172
  • [32] Clustering stability-based feature selection for unsupervised texture classification
    Klepaczko, Artur
    Materka, Andrzej
    Machine Graphics and Vision, 2009, 18 (02): : 125 - 141
  • [33] Unsupervised Bayesian feature selection based on k-means clustering
    Yan, Liu
    Yan, Peng
    IC-BNMT 2007: PROCEEDINGS OF 2007 INTERNATIONAL CONFERENCE ON BROADBAND NETWORK & MULTIMEDIA TECHNOLOGY, 2007, : 352 - 356
  • [34] Joint subspace learning and subspace clustering based unsupervised feature selection
    Xiao, Zijian
    Chen, Hongmei
    Mi, Yong
    Luo, Chuan
    Horng, Shi-Jinn
    Li, Tianrui
    NEUROCOMPUTING, 2025, 635
  • [35] A new unsupervised feature selection algorithm using similarity-based feature clustering
    Zhu, Xiaoyan
    Wang, Yu
    Li, Yingbin
    Tan, Yonghui
    Wang, Guangtao
    Song, Qinbao
    COMPUTATIONAL INTELLIGENCE, 2019, 35 (01) : 2 - 22
  • [36] Unsupervised feature selection based on adaptive similarity learning and subspace clustering
    Parsa, Mohsen Ghassemi
    Zare, Hadi
    Ghatee, Mehdi
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2020, 95 (95)
  • [37] Unsupervised Feature Selection and Clustering Optimization Based on Improved Differential Evolution
    Li, Tao
    Dong, Hongbin
    IEEE ACCESS, 2019, 7 : 140438 - 140450
  • [38] An efficient unsupervised feature selection procedure through feature clustering
    Yan, Xuyang
    Nazmi, Shabnam
    Erol, Berat A.
    Homaifar, Abdollah
    Gebru, Biniam
    Tunstel, Edward
    PATTERN RECOGNITION LETTERS, 2020, 131 : 277 - 284
  • [39] Clustering-based feature subset selection with analysis on the redundancy-complementarity dimension
    Chen, Zhijun
    Chen, Qiushi
    Zhang, Yishi
    Zhou, Lei
    Jiang, Junfeng
    Wu, Chaozhong
    Huang, Zhen
    COMPUTER COMMUNICATIONS, 2021, 168 : 65 - 74
  • [40] Clustering-based feature selection for black-box weather temperature prediction
    Karevan, Zahra
    Suykens, Johan A. K.
    2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 2722 - 2729