An Experimental Study on Unsupervised Clustering-based Feature Selection Methods

被引:1
|
作者
Covoes, Thiago F. [1 ]
Hruschka, Eduardo R. [1 ]
机构
[1] Univ Sao Paulo, Dept Comp Sci, Sao Carlos, SP, Brazil
关键词
unsupervised feature selection; feature clustering; clustering problems; GENE-EXPRESSION DATA; ALGORITHMS; CLASSIFICATION;
D O I
10.1109/ISDA.2009.79
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature selection is an essential task in data mining because it makes it possible not only to reduce computational times and storage requirements, but also to favor model improvement and better data understanding. In this work, we analyze three methods for unsupervised feature selection that are based on the clustering of features for redundancy removal. We report experimental results obtained in ten datasets that illustrate practical scenarios of particular interest, in which one method may be preferred over another. In order to provide some reassurance about the validity and non-randomness of the obtained results, we also present the results of statistical tests.
引用
收藏
页码:993 / 1000
页数:8
相关论文
共 50 条
  • [21] A clustering-based feature selection framework for handwritten Indic script classification
    Chatterjee, Iman
    Ghosh, Manosij
    Sing, Pawan Kumar
    Sarkar, Ram
    Nasipuri, Mita
    EXPERT SYSTEMS, 2019, 36 (06)
  • [22] Reactive search-MST optimized clustering-based feature selection
    Kaleemullah, A.
    Suresh, A.
    INTERNATIONAL JOURNAL OF FINANCIAL ENGINEERING, 2022, 09 (03)
  • [23] Spectral Clustering-based Local and Global Structure Preservation for Feature Selection
    Zhou, Sihang
    Liu, Xinwang
    Zhu, Chengzhang
    Liu, Qiang
    Yin, Jianping
    PROCEEDINGS OF THE 2014 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2014, : 550 - 557
  • [24] Fuzzy Clustering-based GMDH Model to Feature Selection in Customer Analysis
    Zhao, Hengjun
    He, Changzheng
    Ye, Zhen
    ISBIM: 2008 INTERNATIONAL SEMINAR ON BUSINESS AND INFORMATION MANAGEMENT, VOL 1, 2009, : 461 - 464
  • [25] A clustering-based feature selection method for automatically generated relational attributes
    Mostafa Rezaei
    Ivor Cribben
    Michele Samorani
    Annals of Operations Research, 2021, 303 : 233 - 263
  • [26] A clustering-based method for unsupervised intrusion detections
    Jiang, SY
    Song, XY
    Wang, H
    Han, JJ
    Li, QH
    PATTERN RECOGNITION LETTERS, 2006, 27 (07) : 802 - 810
  • [27] A unifying criterion for unsupervised clustering and feature selection
    Breaban, Mihaela
    Luchian, Henri
    PATTERN RECOGNITION, 2011, 44 (04) : 854 - 865
  • [28] Unsupervised Feature Selection with Joint Clustering Analysis
    An, Shuai
    Wang, Jun
    Wei, Jinmao
    Yang, Zhenglu
    CIKM'17: PROCEEDINGS OF THE 2017 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2017, : 1639 - 1648
  • [29] Subspace clustering guided unsupervised feature selection
    Zhu, Pengfei
    Zhu, Wencheng
    Hu, Qinghua
    Zhang, Changqing
    Zuo, Wangmeng
    PATTERN RECOGNITION, 2017, 66 : 364 - 374
  • [30] A comprehensive comparative study of clustering-based unsupervised defect prediction models
    Xu, Zhou
    Li, Li
    Yan, Meng
    Liu, Jin
    Luo, Xiapu
    Grundy, John
    Zhang, Yifeng
    Zhang, Xiaohong
    JOURNAL OF SYSTEMS AND SOFTWARE, 2021, 172