An Experimental Study on Unsupervised Clustering-based Feature Selection Methods

被引:1
|
作者
Covoes, Thiago F. [1 ]
Hruschka, Eduardo R. [1 ]
机构
[1] Univ Sao Paulo, Dept Comp Sci, Sao Carlos, SP, Brazil
关键词
unsupervised feature selection; feature clustering; clustering problems; GENE-EXPRESSION DATA; ALGORITHMS; CLASSIFICATION;
D O I
10.1109/ISDA.2009.79
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature selection is an essential task in data mining because it makes it possible not only to reduce computational times and storage requirements, but also to favor model improvement and better data understanding. In this work, we analyze three methods for unsupervised feature selection that are based on the clustering of features for redundancy removal. We report experimental results obtained in ten datasets that illustrate practical scenarios of particular interest, in which one method may be preferred over another. In order to provide some reassurance about the validity and non-randomness of the obtained results, we also present the results of statistical tests.
引用
收藏
页码:993 / 1000
页数:8
相关论文
共 50 条
  • [1] Clustering-based feature selection
    School of Informatics, Guangdong University of Foreign Studies, Guangzhou 510006, China
    Tien Tzu Hsueh Pao, 2008, SUPPL. (157-160):
  • [2] A comparative study on unsupervised feature selection methods for text clustering
    Liu, LY
    Kang, JC
    Yu, J
    Wang, ZL
    PROCEEDINGS OF THE 2005 IEEE INTERNATIONAL CONFERENCE ON NATURAL LANGUAGE PROCESSING AND KNOWLEDGE ENGINEERING (IEEE NLP-KE'05), 2005, : 597 - 601
  • [3] A clustering-based feature selection via feature separability
    Jiang, Shengyi
    Wang, Lianxi
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2016, 31 (02) : 927 - 937
  • [4] PSO with surrogate models for feature selection: static and dynamic clustering-based methods
    Hoai Bach Nguyen
    Xue, Bing
    Andreae, Peter
    MEMETIC COMPUTING, 2018, 10 (03) : 291 - 300
  • [5] PSO with surrogate models for feature selection: static and dynamic clustering-based methods
    Hoai Bach Nguyen
    Bing Xue
    Peter Andreae
    Memetic Computing, 2018, 10 : 291 - 300
  • [6] Unsupervised Feature Selection with Feature Clustering
    Cheung, Yiu-ming
    Jia, Hong
    2012 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE AND INTELLIGENT AGENT TECHNOLOGY (WI-IAT 2012), VOL 1, 2012, : 9 - 15
  • [7] Clustering-based feature selection for verb sense disambiguation
    Chen, JY
    Palmer, M
    Proceedings of the 2005 IEEE International Conference on Natural Language Processing and Knowledge Engineering (IEEE NLP-KE'05), 2005, : 36 - 41
  • [8] Clustering-based Feature Selection for Internet Attack Defense
    Seo, Jungtaek
    Kim, Jungtae
    Moon, Jongsub
    Kang, Boo Jung
    Im, Eul Gyu
    INTERNATIONAL JOURNAL OF FUTURE GENERATION COMMUNICATION AND NETWORKING, 2008, 1 (01): : 91 - 98
  • [9] Feature selection in unsupervised context: Clustering based approach
    Klepaczko, A
    Materka, A
    Computer Recognition Systems, Proceedings, 2005, : 219 - 226
  • [10] Spectral Clustering Based Unsupervised Feature Selection Algorithms
    Xie J.-Y.
    Ding L.-J.
    Wang M.-Z.
    Ruan Jian Xue Bao/Journal of Software, 2020, 31 (04): : 1009 - 1024