Feature subset selection for data and feature streams: a review

被引:0
|
作者
Carlos Villa-Blanco
Concha Bielza
Pedro Larrañaga
机构
[1] Universidad Politécnica de Madrid,Computational Intelligence Group, Departamento de Inteligencia Artificial
来源
关键词
Data streams; Feature streams; Dynamic environments; Feature subset selection; Supervised classification; Clustering;
D O I
暂无
中图分类号
学科分类号
摘要
Real-world problems are commonly characterized by a high feature dimensionality, which hinders the modelling and descriptive analysis of the data. However, some of these data may be irrelevant or redundant for the learning process. Different approaches can be used to reduce this information, improving not only the speed of building models but also their performance and interpretability. In this review, we focus on feature subset selection (FSS) techniques, which select a subset of the original feature set without making any transformation on the attributes. Traditional batch FSS algorithms may not be adequate to efficiently handle large volumes of data, either because memory problems arise or data are received in a sequential manner. Thus, this article aims to survey the state of the art of incremental FSS algorithms, which can perform more efficiently under these circumstances. Different strategies are described, such as incrementally updating feature weights, applying information theory or using rough set-based FSS, as well as multiple supervised and unsupervised learning tasks where the application of FSS is interesting.
引用
收藏
页码:1011 / 1062
页数:51
相关论文
共 50 条
  • [1] Feature subset selection for data and feature streams: a review
    Villa-Blanco, Carlos
    Bielza, Concha
    Larranaga, Pedro
    ARTIFICIAL INTELLIGENCE REVIEW, 2023, 56 (SUPPL 1) : 1011 - 1062
  • [2] Iterative Subset Selection for Feature Drifting Data Streams
    Yuan, Lanqin
    Pfahringer, Bernhard
    Barddal, Jean Paul
    33RD ANNUAL ACM SYMPOSIUM ON APPLIED COMPUTING, 2018, : 510 - 517
  • [3] Addressing Feature Drift in Data Streams Using Iterative Subset Selection
    Yuan, Lanqin
    Pfahringer, Bernhard
    Barddal, Jean Paul
    APPLIED COMPUTING REVIEW, 2019, 19 (01): : 20 - 33
  • [4] Online feature subset selection for mining feature streams in big data via incremental learning and evolutionary computation
    Vivek, Yelleti
    Ravi, Vadlamani
    Krishna, P. Radha
    SWARM AND EVOLUTIONARY COMPUTATION, 2025, 94
  • [5] Feature subset selection with applications to hyperspectral data
    Chen, H
    Varshney, PK
    2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 249 - 252
  • [6] Optimal Feature Selection using Fuzzy Combination of Feature Subset for Transcriptome Data
    Singh, Vikas
    Vardhan, Harsh
    Verma, Nishchal K.
    Cui, Yan
    2018 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2018,
  • [7] A conservative feature subset selection algorithm with missing data
    Aussem, Alex
    de Morais, Sergio Rodrigues
    NEUROCOMPUTING, 2010, 73 (4-6) : 585 - 590
  • [8] A Conservative Feature Subset Selection Algorithm with Missing Data
    Aussem, Alex
    de Morais, Sergio Rodrigues
    ICDM 2008: EIGHTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2008, : 725 - 730
  • [9] Survey on Feature Subset Selection for High Dimensional Data
    Shahana, A. H.
    Preeja, V
    PROCEEDINGS OF IEEE INTERNATIONAL CONFERENCE ON CIRCUIT, POWER AND COMPUTING TECHNOLOGIES (ICCPCT 2016), 2016,
  • [10] Feature subset selection and ranking for data dimensionality reduction
    Wei, Hua-Liang
    Billings, Stephen A.
    IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2007, 29 (01) : 162 - 166