An Unsupervised Feature Selection Algorithm: Laplacian Score Combined with Distance-based Entropy Measure

被引:27
|
作者
Liu, Rongye [1 ]
Yang, Ning [2 ]
Ding, Xiangqian [2 ]
Ma, Lintao [2 ]
机构
[1] Ocean Univ China, Dept Informat Sci & Engn, Qingdao, Peoples R China
[2] Ocean Univ China, Ctr Informat Engn, Qingdao, Peoples R China
关键词
Unsupervised Feature Selction; Laplacian Score(LS); Distance-based Entropy Measure; LSE;
D O I
10.1109/IITA.2009.390
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In unsupervised learning paradigm, we are not given class labels, which features should we keep? Unsupervised feature selection method well solves this problem and has got a good effect in features selection with unlabeled data. Laplacian Score (LS) is a newly proposed unsupervised feature selection algorithm. However it uses k-means clustering method to select the top k features, therefore, the disadvantages of k-means clustering method greatly affect the result and increases the complexity of LS. In this paper, we introduce a novel algorithm called LSE (Laplacian Score combined with distance-based entropy measure) for automatically selecting subset of features. LSE uses distance-based entropy to replace the k-means clustering method in LS, which intrinsically solves the drawbacks of LS and contribute to the stability and efficiency of LSE. We compare LSE with LS on six UCI data sets. Experimental results demonstrate LSE can outperform LS on stability and efficiency, especially when processing high dimension datasets.
引用
收藏
页码:65 / +
页数:2
相关论文
共 50 条
  • [31] Distance-Based Tournament Selection
    Oesch, Christian
    APPLICATIONS OF EVOLUTIONARY COMPUTATION, EVOAPPLICATIONS 2017, PT I, 2017, 10199 : 705 - 714
  • [32] A Note on Distance-Based Entropy of Dendrimers
    Ghorbani, Modjtaba
    Dehmer, Matthias
    Zangi, Samaneh
    Mowshowitz, Abbe
    Emmert-Streib, Frank
    AXIOMS, 2019, 8 (03)
  • [33] Unsupervised Feature Selection Using Correlation Score
    Pattanshetti, Tanuja
    Attar, Vahida
    COMPUTING, COMMUNICATION AND SIGNAL PROCESSING, ICCASP 2018, 2019, 810 : 355 - 362
  • [34] Unsupervised bidirectional feature selection based on contribution entropy for medical databases
    Devakumari, D.
    Thangavel, K.
    Sarojini, K.
    INTERNATIONAL JOURNAL OF HEALTHCARE TECHNOLOGY AND MANAGEMENT, 2011, 12 (5-6) : 364 - 378
  • [35] An Unsupervised Attribute Clustering Algorithm for Unsupervised Feature Selection
    Zhou, Pei-Yuan
    Chan, Keith C. C.
    PROCEEDINGS OF THE 2015 IEEE INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (IEEE DSAA 2015), 2015, : 710 - 716
  • [36] Feature evaluation and selection based on an entropy measure with data clustering
    Chi, ZR
    Yan, H
    OPTICAL ENGINEERING, 1995, 34 (12) : 3514 - 3519
  • [37] Unsupervised feature selection based on variance-covariance subspace distance
    Karami, Saeed
    Saberi-Movahed, Farid
    Tiwari, Prayag
    Marttinen, Pekka
    Vahdati, Sahar
    NEURAL NETWORKS, 2023, 166 : 188 - 203
  • [38] Vertical distance-based clonal selection mechanism for the multiobjective immune algorithm
    Li, Lingjie
    Lin, Qiuzhen
    Li, Ke
    Ming, Zhong
    SWARM AND EVOLUTIONARY COMPUTATION, 2021, 63
  • [39] An unsupervised feature selection algorithm based on ant colony optimization
    Tabakhi, Sina
    Moradi, Parham
    Akhlaghian, Fardin
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2014, 32 : 112 - 123
  • [40] Classification of Cancer Data Based on Support Vectors Machines with Feature Selection Using Genetic Algorithm and Laplacian Score
    Rustam, Z.
    Primasari, I.
    Widya, D.
    PROCEEDINGS OF THE 3RD INTERNATIONAL SYMPOSIUM ON CURRENT PROGRESS IN MATHEMATICS AND SCIENCES 2017 (ISCPMS2017), 2018, 2023