An Experimental Study on Unsupervised Clustering-based Feature Selection Methods

被引:1
|
作者
Covoes, Thiago F. [1 ]
Hruschka, Eduardo R. [1 ]
机构
[1] Univ Sao Paulo, Dept Comp Sci, Sao Carlos, SP, Brazil
关键词
unsupervised feature selection; feature clustering; clustering problems; GENE-EXPRESSION DATA; ALGORITHMS; CLASSIFICATION;
D O I
10.1109/ISDA.2009.79
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature selection is an essential task in data mining because it makes it possible not only to reduce computational times and storage requirements, but also to favor model improvement and better data understanding. In this work, we analyze three methods for unsupervised feature selection that are based on the clustering of features for redundancy removal. We report experimental results obtained in ten datasets that illustrate practical scenarios of particular interest, in which one method may be preferred over another. In order to provide some reassurance about the validity and non-randomness of the obtained results, we also present the results of statistical tests.
引用
收藏
页码:993 / 1000
页数:8
相关论文
共 50 条
  • [41] A review of unsupervised feature selection methods
    Saúl Solorio-Fernández
    J. Ariel Carrasco-Ochoa
    José Fco. Martínez-Trinidad
    Artificial Intelligence Review, 2020, 53 : 907 - 948
  • [42] A review of unsupervised feature selection methods
    Solorio-Fernandez, Saul
    Carrasco-Ochoa, J. Ariel
    Martinez-Trinidad, Jose Fco.
    ARTIFICIAL INTELLIGENCE REVIEW, 2020, 53 (02) : 907 - 948
  • [43] A novel dissimilarity metric based on feature-to-feature scatter frequencies for clustering-based feature selection in biomedical data
    Sheikhi, Ghazaal
    Altincay, Hakan
    COMPUTATIONAL INTELLIGENCE, 2021, 37 (04) : 1865 - 1889
  • [44] Clustering-based Sequential Feature Selection Approach for High Dimensional Data Classification
    Alimoussa, M.
    Porebski, A.
    Vandenbroucke, N.
    Thami, R. Oulad Haj
    El Fkihi, S.
    VISAPP: PROCEEDINGS OF THE 16TH INTERNATIONAL JOINT CONFERENCE ON COMPUTER VISION, IMAGING AND COMPUTER GRAPHICS THEORY AND APPLICATIONS - VOL. 4: VISAPP, 2021, : 122 - 132
  • [45] Clustering-based hybrid feature selection approach for high dimensional microarray data
    Babu, Samson Anosh P.
    Annavarapu, Chandra Sekhara Rao
    Dara, Suresh
    CHEMOMETRICS AND INTELLIGENT LABORATORY SYSTEMS, 2021, 213
  • [46] Research on Feature Selection Methods Based on Feature Clustering and Information Theory
    Wang, Wenhui
    Zhou, Changyin
    ADVANCED INTELLIGENT COMPUTING TECHNOLOGY AND APPLICATIONS, PT XIII, ICIC 2024, 2024, 14874 : 71 - 82
  • [48] An improved unsupervised clustering-based intrusion detection method
    Hai, YJ
    Wu, Y
    Wang, GY
    Data Mining, Intrusion Detection, Information Assurance, and Data Networks Security 2005, 2005, 5812 : 52 - 60
  • [49] A Mixed Unsupervised Clustering-based Intrusion Detection Model
    Zhang, Cuixiao
    Zhang, Guobing
    Sun, Shanshan
    THIRD INTERNATIONAL CONFERENCE ON GENETIC AND EVOLUTIONARY COMPUTING, 2009, : 426 - 428
  • [50] A Hybrid Unsupervised Clustering-Based Anomaly Detection Method
    Guo Pu
    Lijuan Wang
    Jun Shen
    Fang Dong
    TsinghuaScienceandTechnology, 2021, 26 (02) : 146 - 153