Exploring Feature Selection With Limited Labels: A Comprehensive Survey of Semi-Supervised and Unsupervised Approaches

被引:10
|
作者
Li, Guojie [1 ]
Yu, Zhiwen [1 ,2 ]
Yang, Kaixiang [1 ,3 ]
Lin, Mianfen [1 ]
Chen, C. L. Philip [1 ,3 ]
机构
[1] South China Univ Technol, Sch Comp Sci & Engn, Guangzhou 510650, Peoples R China
[2] Pengcheng Lab, Shenzhen 518066, Peoples R China
[3] Pazhou Lab, Guangzhou 510335, Peoples R China
基金
中国国家自然科学基金;
关键词
Feature extraction; Surveys; Taxonomy; Reviews; Data models; Semisupervised learning; Unsupervised learning; Dimensionality reduction; feature selection; semi-supervised learning; unsupervised learning; research review; DIMENSIONALITY REDUCTION; CONSTRAINT SCORE; LAPLACIAN SCORE; INFORMATION; EFFICIENT; RELEVANCE; CLASSIFICATION; ALGORITHMS; REGRESSION; ENTROPY;
D O I
10.1109/TKDE.2024.3397878
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature selection is a highly regarded research area in the field of data mining, as it significantly enhances the efficiency and performance of high-dimensional data analysis by eliminating redundant and irrelevant features. Despite the ease of data acquisition, labeling data remains a laborious and expensive task. To leverage the abundance of unlabeled data, researchers have proposed various feature selection methods that operate with limited labels, including semi-supervised feature selection and unsupervised feature selection. However, a comprehensive review encompassing a thorough overview of feature selection algorithms with limited labels is lacking. To bridge this gap, this paper conducts a comprehensive exploration of feature selection methods specifically tailored to limited-label scenarios. These methods are systematically classified into two primary categories: semi-supervised and unsupervised feature selection. Additionally, by introducing a novel taxonomy and discussing future challenges, this survey aims to provide researchers with a comprehensive and in-depth understanding of feature selection in limited-label scenarios. Moreover, it aims to offer valuable insights that can guide further research and development in this domain.
引用
收藏
页码:6124 / 6144
页数:21
相关论文
共 50 条
  • [1] Supervised, Unsupervised, and Semi-Supervised Feature Selection: A Review on Gene Selection
    Ang, Jun Chin
    Mirzal, Andri
    Haron, Habibollah
    Hamed, Haza Nuzly Abdull
    IEEE-ACM TRANSACTIONS ON COMPUTATIONAL BIOLOGY AND BIOINFORMATICS, 2016, 13 (05) : 971 - 989
  • [2] A Survey on Supervised, Unsupervised, and Semi-Supervised Approaches in Crowd Counting
    Wang, Jianyong
    Gao, Mingliang
    Li, Qilei
    Kim, Hyunbum
    Jeon, Gwanggil
    CMC-COMPUTERS MATERIALS & CONTINUA, 2024, 81 (03): : 3561 - 3582
  • [3] A Survey on semi-supervised feature selection methods
    Sheikhpour, Razieh
    Sarram, Mehdi Agha
    Gharaghani, Sajjad
    Chahooki, Mohammad Ali Zare
    PATTERN RECOGNITION, 2017, 64 : 141 - 158
  • [4] Forward semi-supervised feature selection
    Ren, Jiangtao
    Qiu, Zhengyuan
    Fan, Wei
    Cheng, Hong
    Yu, Philip S.
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2008, 5012 : 970 - +
  • [5] Neurodynamics-driven holistic approaches to semi-supervised feature selection
    Wang, Yadi
    Wang, Jun
    NEURAL NETWORKS, 2023, 157 : 377 - 386
  • [6] Evolving Ensembles of Feature Subsets towards Optimal Feature Selection for Unsupervised and Semi-supervised Clustering
    Breaban, Mihaela Elena
    TRENDS IN APPLIED INTELLIGENT SYSTEMS, PT II, PROCEEDINGS, 2010, 6097 : 67 - 76
  • [7] Semi-supervised relevance index for feature selection
    Frederico Coelho
    Cristiano Castro
    Antônio P. Braga
    Michel Verleysen
    Neural Computing and Applications, 2019, 31 : 989 - 997
  • [8] Simple strategies for semi-supervised feature selection
    Konstantinos Sechidis
    Gavin Brown
    Machine Learning, 2018, 107 : 357 - 395
  • [9] Semi-supervised Feature Selection for Gender Classification
    Wu, Jing
    Smith, William A. P.
    Hancock, Edwin R.
    COMPUTER VISION - ACCV 2009, PT II, 2010, 5995 : 23 - 33
  • [10] Semi-supervised relevance index for feature selection
    Coelho, Frederico
    Castro, Cristiano
    Braga, Antonio P.
    Verleysen, Michel
    NEURAL COMPUTING & APPLICATIONS, 2019, 31 (Suppl 2): : 989 - 997