A GA-based Feature Selection for High-dimensional Data Clustering

被引:5
|
作者
Sun, Mei [1 ]
Xiong, Langhuan [1 ]
Sun, Haojun [1 ]
Jiang, Dazhi [1 ]
机构
[1] Shantou Univ, Dept Comp Sci & Technol, Shantou 515063, Peoples R China
关键词
feature selection; clustering; genetic algorithms; high-dimensional data;
D O I
10.1109/WGEC.2009.140
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
High-dimensional data clustering is an open problem in modern data mining. This paper proposed a new genetic algorithm-based feature selection for high-dimensional data clustering, called GA-FSFclustering. This approach searches effective feature subsets for clustering in all features by genetic algorithm. The candidate features and cluster centers are real number encoded. A new criterion for evaluating feature subsets is employed as the fitness function. The experimental results indicate the feasibility and efficiency of the GA-FSFclustering algorithm.
引用
收藏
页码:769 / 772
页数:4
相关论文
共 50 条
  • [41] Evaluating Feature Selection Robustness on High-Dimensional Data
    Pes, Barbara
    HYBRID ARTIFICIAL INTELLIGENT SYSTEMS (HAIS 2018), 2018, 10870 : 235 - 247
  • [42] Feature selection for classifying high-dimensional numerical data
    Wu, YM
    Zhang, AD
    PROCEEDINGS OF THE 2004 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, VOL 2, 2004, : 251 - 258
  • [43] Incomplete high-dimensional data imputation algorithm using feature selection and clustering analysis on cloud
    Fanyu Bu
    Zhikui Chen
    Qingchen Zhang
    Laurence T. Yang
    The Journal of Supercomputing, 2016, 72 : 2977 - 2990
  • [44] A Hybrid Feature Extraction Selection Approach for High-Dimensional Non-Gaussian Data Clustering
    Boutemedjet, Sabri
    Bouguila, Nizar
    Ziou, Djemel
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2009, 31 (08) : 1429 - 1443
  • [45] A novel minorization–maximization framework for simultaneous feature selection and clustering of high-dimensional count data
    Nuha Zamzami
    Nizar Bouguila
    Pattern Analysis and Applications, 2023, 26 : 91 - 106
  • [46] A Clustering Algorithm for High-Dimensional Nonlinear Feature Data with Applications
    Jiang H.
    Wang G.
    Gao J.
    Gao Z.
    Gao R.
    Guo Q.
    Hsi-An Chiao Tung Ta Hsueh/Journal of Xi'an Jiaotong University, 2017, 51 (12): : 49 - 55and90
  • [47] Joint image clustering and feature selection with auto-adjoined learning for high-dimensional data
    Wang, Xiaodong
    Wu, Pengtao
    Xu, Qinghua
    Zeng, Zhiqiang
    Xie, Yong
    KNOWLEDGE-BASED SYSTEMS, 2021, 232
  • [48] Incomplete high-dimensional data imputation algorithm using feature selection and clustering analysis on cloud
    Bu, Fanyu
    Chen, Zhikui
    Zhang, Qingchen
    Yang, Laurence T.
    JOURNAL OF SUPERCOMPUTING, 2016, 72 (08): : 2977 - 2990
  • [49] Latent Feature Group Learning for High-Dimensional Data Clustering
    Wang, Wenting
    He, Yulin
    Ma, Liheng
    Huang, Joshua Zhexue
    INFORMATION, 2019, 10 (06)
  • [50] A Clustering-Guided Integer Brain Storm Optimizer for Feature Selection in High-Dimensional Data
    Jia Yun-Tao
    Zhang Wan-Qiu
    He Chun-Lin
    DISCRETE DYNAMICS IN NATURE AND SOCIETY, 2021, 2021