Utility metric for unsupervised feature selection

被引:3
|
作者
Villa, Amalia [1 ,2 ]
Narayanan, Abhijith Mundanad [1 ,2 ]
Van Huffel, Sabine [1 ,2 ]
Bertrand, Alexander [1 ,2 ]
Varon, Carolina [1 ,3 ,4 ]
机构
[1] Katholieke Univ Leuven, Dept Elect Engn ESAT, STADIUS Ctr Dynam Syst Signal Proc & Data Analyt, Leuven, Belgium
[2] KU Leuven Inst AI, Leuven AI, Leuven, Belgium
[3] Delft Univ Technol, Circuits & Syst CAS Grp, Delft, Netherlands
[4] Katholieke Univ Leuven, E Media Res Lab, Campus GroepT, Leuven, Belgium
基金
欧盟地平线“2020”; 欧洲研究理事会;
关键词
Unsupervised feature selection; Dimensionality reduction; Manifold learning; Kernel methods; SUBSET-SELECTION; ALGORITHM;
D O I
10.7717/peerj-cs.477
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature selection techniques are very useful approaches for dimensionality reduction in data analysis. They provide interpretable results by reducing the dimensions of the data to a subset of the original set of features. When the data lack annotations, unsupervised feature selectors are required for their analysis. Several algorithms for this aim exist in the literature, but despite their large applicability, they can be very inaccessible or cumbersome to use, mainly due to the need for tuning non-intuitive parameters and the high computational demands. In this work, a publicly available ready-to-use unsupervised feature selector is proposed, with comparable results to the state-of-the-art at a much lower computational cost. The suggested approach belongs to the methods known as spectral feature selectors. These methods generally consist of two stages: manifold learning and subset selection. In the first stage, the underlying structures in the high-dimensional data are extracted, while in the second stage a subset of the features is selected to replicate these structures. This paper suggests two contributions to this field, related to each of the stages involved. In the manifold learning stage, the effect of non-linearities in the data is explored, making use of a radial basis function (RBF) kernel, for which an alternative solution for the estimation of the kernel parameter is presented for cases with high-dimensional data. Additionally, the use of a backwards greedy approach based on the least-squares utility metric for the subset selection stage is proposed. The combination of these new ingredients results in the utility metric for unsupervised feature selection U2FS algorithm. The proposed U2FS algorithm succeeds in selecting the correct features in a simulation environment. In addition, the performance of the method on benchmark datasets is comparable to the state-of-the-art, while requiring less computational time. Moreover, unlike the state-of-the-art, U2FS does not require any tuning of parameters.
引用
收藏
页码:1 / 26
页数:26
相关论文
共 50 条
  • [41] Discriminative embedded unsupervised feature selection
    Zhu, Qi-Hai
    Yang, Yu-Bin
    PATTERN RECOGNITION LETTERS, 2018, 112 : 219 - 225
  • [42] Feature Selection with Unsupervised Consensus Guidance
    Liu, Hongfu
    Shao, Ming
    Fu, Yun
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2019, 31 (12) : 2319 - 2331
  • [43] Unsupervised Feature Selection for Noisy Data
    Mahdavi, Kaveh
    Labarta, Jesus
    Gimenez, Judit
    ADVANCED DATA MINING AND APPLICATIONS, ADMA 2019, 2019, 11888 : 79 - 94
  • [44] Structure preserving unsupervised feature selection
    Lu, Quanmao
    Li, Xuelong
    Dong, Yongsheng
    NEUROCOMPUTING, 2018, 301 : 36 - 45
  • [45] Unsupervised Robust Bayesian Feature Selection
    Sun, Jianyong
    Zhou, Aimin
    PROCEEDINGS OF THE 2014 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2014, : 558 - 564
  • [46] A Dynamic Evaluation Metric for Feature Selection
    Rajabinasab, Muhammad
    Lautrup, Anton D.
    Hyrup, Tobias
    Zimek, Arthur
    SIMILARITY SEARCH AND APPLICATIONS, SISAP 2024, 2025, 15268 : 65 - 72
  • [47] An efficient unsupervised feature selection procedure through feature clustering
    Yan, Xuyang
    Nazmi, Shabnam
    Erol, Berat A.
    Homaifar, Abdollah
    Gebru, Biniam
    Tunstel, Edward
    PATTERN RECOGNITION LETTERS, 2020, 131 : 277 - 284
  • [48] Unsupervised feature selection guided by orthogonal representation of feature space
    Jahani, Mahsa Samareh
    Aghamollaei, Gholamreza
    Eftekhari, Mahdi
    Saberi-Movahed, Farid
    NEUROCOMPUTING, 2023, 516 : 61 - 76
  • [49] Robust graph regularized unsupervised feature selection
    Tang, Chang
    Zhu, Xinzhong
    Chen, Jiajia
    Wang, Pichao
    Liu, Xinwang
    Tian, Jie
    EXPERT SYSTEMS WITH APPLICATIONS, 2018, 96 : 64 - 76
  • [50] Unsupervised Feature Selection in Cardiac Arrhythmias Analysis
    Rodriguez-Sotelo, J. L.
    Cuesta-Frau, D.
    Peluffo-Ordonez, D.
    Castellanos-Dominguez, G.
    2009 ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-20, 2009, : 2571 - 2574