Feature subset selection for data and feature streams: a review

被引:0
|
作者
Carlos Villa-Blanco
Concha Bielza
Pedro Larrañaga
机构
[1] Universidad Politécnica de Madrid,Computational Intelligence Group, Departamento de Inteligencia Artificial
来源
关键词
Data streams; Feature streams; Dynamic environments; Feature subset selection; Supervised classification; Clustering;
D O I
暂无
中图分类号
学科分类号
摘要
Real-world problems are commonly characterized by a high feature dimensionality, which hinders the modelling and descriptive analysis of the data. However, some of these data may be irrelevant or redundant for the learning process. Different approaches can be used to reduce this information, improving not only the speed of building models but also their performance and interpretability. In this review, we focus on feature subset selection (FSS) techniques, which select a subset of the original feature set without making any transformation on the attributes. Traditional batch FSS algorithms may not be adequate to efficiently handle large volumes of data, either because memory problems arise or data are received in a sequential manner. Thus, this article aims to survey the state of the art of incremental FSS algorithms, which can perform more efficiently under these circumstances. Different strategies are described, such as incrementally updating feature weights, applying information theory or using rough set-based FSS, as well as multiple supervised and unsupervised learning tasks where the application of FSS is interesting.
引用
收藏
页码:1011 / 1062
页数:51
相关论文
共 50 条
  • [31] The Minimum Feature Subset Selection Problem
    陈彬
    洪家荣
    王亚东
    JournalofComputerScienceandTechnology, 1997, (02) : 145 - 153
  • [32] Towards an optimal feature subset selection
    Shiba, OA
    Saeed, W
    Sulaiman, MN
    Ahmad, F
    Mamat, A
    SCORED 2003: STUDENT CONFERENCE ON RESEARCH AND DEVELOPMENT, PROCEEDINGS: NETWORKING THE FUTURE MIND IN CONVERGENCE TECHNOLOGY, 2003, : 376 - 380
  • [33] Algorithm for the optimal feature subset selection
    Zhu, Ming
    Wang, Junpu
    Cai, Qingsheng
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 35 (09): : 803 - 805
  • [34] Subset Feature Selection with Structural Variables
    Urrutia, Juan A.
    Estevez, Pablo A.
    Vergara, Jorge R.
    2021 IEEE LATIN AMERICAN CONFERENCE ON COMPUTATIONAL INTELLIGENCE (LA-CCI), 2021,
  • [35] Fizzy: feature subset selection for metagenomics
    Ditzler, Gregory
    Morrison, J. Calvin
    Lan, Yemin
    Rosen, Gail L.
    BMC BIOINFORMATICS, 2015, 16
  • [36] The minimum feature subset selection problem
    Bin Chen
    Jiarong Hong
    Yadong Wang
    Journal of Computer Science and Technology, 1997, 12 (2) : 145 - 153
  • [37] Feature Subset Selection by SVM Ensemble
    Ban, Tao
    Inoue, Daisuke
    PROCEEDINGS OF 2016 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2016,
  • [38] Fizzy: feature subset selection for metagenomics
    Gregory Ditzler
    J. Calvin Morrison
    Yemin Lan
    Gail L. Rosen
    BMC Bioinformatics, 16
  • [39] Feature subset selection in an ICA space
    Bressan, M
    Vitrià, J
    TOPICS IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2002, 2504 : 196 - 206
  • [40] A new approach to feature subset selection
    Liu, DZ
    Feng, ZJ
    Wang, XZ
    PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2004, : 1822 - 1825