An Unsupervised Feature Selection Algorithm: Laplacian Score Combined with Distance-based Entropy Measure

被引:27
|
作者
Liu, Rongye [1 ]
Yang, Ning [2 ]
Ding, Xiangqian [2 ]
Ma, Lintao [2 ]
机构
[1] Ocean Univ China, Dept Informat Sci & Engn, Qingdao, Peoples R China
[2] Ocean Univ China, Ctr Informat Engn, Qingdao, Peoples R China
关键词
Unsupervised Feature Selction; Laplacian Score(LS); Distance-based Entropy Measure; LSE;
D O I
10.1109/IITA.2009.390
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In unsupervised learning paradigm, we are not given class labels, which features should we keep? Unsupervised feature selection method well solves this problem and has got a good effect in features selection with unlabeled data. Laplacian Score (LS) is a newly proposed unsupervised feature selection algorithm. However it uses k-means clustering method to select the top k features, therefore, the disadvantages of k-means clustering method greatly affect the result and increases the complexity of LS. In this paper, we introduce a novel algorithm called LSE (Laplacian Score combined with distance-based entropy measure) for automatically selecting subset of features. LSE uses distance-based entropy to replace the k-means clustering method in LS, which intrinsically solves the drawbacks of LS and contribute to the stability and efficiency of LSE. We compare LSE with LS on six UCI data sets. Experimental results demonstrate LSE can outperform LS on stability and efficiency, especially when processing high dimension datasets.
引用
收藏
页码:65 / +
页数:2
相关论文
共 50 条
  • [1] Hierarchical Laplacian Score for unsupervised feature selection
    Nhat-Quang Doan
    Azzag, Hanane
    Lebbah, Mustapha
    2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018, : 765 - 771
  • [2] Unsupervised feature selection based on incremental forward iterative Laplacian score
    Jiang, Jiefang
    Zhang, Xianyong
    Yang, Jilin
    ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (05) : 4077 - 4112
  • [3] Unsupervised feature selection based on incremental forward iterative Laplacian score
    Jiefang Jiang
    Xianyong Zhang
    Jilin Yang
    Artificial Intelligence Review, 2023, 56 : 4077 - 4112
  • [4] Feature selection algorithm based on kernel distance measure
    Cai, Zhe-Yuan
    Yu, Jian-Guo
    Li, Xian-Peng
    Jin, Zhen-Dong
    Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence, 2010, 23 (02): : 235 - 240
  • [5] Fast Backward Iterative Laplacian Score for Unsupervised Feature Selection
    Pang, Qing-Qing
    Zhang, Li
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT (KSEM 2020), PT I, 2020, 12274 : 409 - 420
  • [6] Ensemble feature selection using distance-based supervised and unsupervised methods in binary classification
    Hallajian, Bita
    Motameni, Homayun
    Akbari, Ebrahim
    EXPERT SYSTEMS WITH APPLICATIONS, 2022, 200
  • [7] Unsupervised Classification under Uncertainty: The Distance-Based Algorithm
    Ghanaiem, Alaa
    Kagan, Evgeny
    Kumar, Parteek
    Raviv, Tal
    Glynn, Peter
    Ben-Gal, Irad
    MATHEMATICS, 2023, 11 (23)
  • [8] Differentiable Unsupervised Feature Selection based on a Gated Laplacian
    Lindenbaum, Ofir
    Shaham, Uri
    Peterfreund, Erez
    Svirsky, Jonathan
    Casey, Nicolas
    Kluger, Yuval
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [9] Ensemble- and distance-based feature ranking for unsupervised learning
    Petkovic, Matej
    Kocev, Dragi
    Skrlj, Blaz
    Dzeroski, Saso
    INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2021, 36 (07) : 3068 - 3086
  • [10] Unsupervised Representative Feature Selection Algorithm Based on Information Entropy and Relevance Analysis
    Wang, Yintong
    IEEE ACCESS, 2018, 6 : 45317 - 45324