Feature subset selection for data and feature streams: a review

被引:0
|
作者
Carlos Villa-Blanco
Concha Bielza
Pedro Larrañaga
机构
[1] Universidad Politécnica de Madrid,Computational Intelligence Group, Departamento de Inteligencia Artificial
来源
关键词
Data streams; Feature streams; Dynamic environments; Feature subset selection; Supervised classification; Clustering;
D O I
暂无
中图分类号
学科分类号
摘要
Real-world problems are commonly characterized by a high feature dimensionality, which hinders the modelling and descriptive analysis of the data. However, some of these data may be irrelevant or redundant for the learning process. Different approaches can be used to reduce this information, improving not only the speed of building models but also their performance and interpretability. In this review, we focus on feature subset selection (FSS) techniques, which select a subset of the original feature set without making any transformation on the attributes. Traditional batch FSS algorithms may not be adequate to efficiently handle large volumes of data, either because memory problems arise or data are received in a sequential manner. Thus, this article aims to survey the state of the art of incremental FSS algorithms, which can perform more efficiently under these circumstances. Different strategies are described, such as incrementally updating feature weights, applying information theory or using rough set-based FSS, as well as multiple supervised and unsupervised learning tasks where the application of FSS is interesting.
引用
收藏
页码:1011 / 1062
页数:51
相关论文
共 50 条
  • [21] A Novel Scalable and Data Efficient Feature Subset Selection Algorithm
    de Morais, Sergio Rodrigues
    Aussem, Alex
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PART II, PROCEEDINGS, 2008, 5212 : 298 - +
  • [22] Feature subset selection and feature ranking for multivariate time series
    Yoon, H
    Yang, KY
    Shahabi, C
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2005, 17 (09) : 1186 - 1198
  • [23] An Adaptive Multiple Feature Subset Method for Feature Ranking and Selection
    Chang, Fu
    Chen, Jen-Cheng
    INTERNATIONAL CONFERENCE ON TECHNOLOGIES AND APPLICATIONS OF ARTIFICIAL INTELLIGENCE (TAAI 2010), 2010, : 255 - 262
  • [24] Feature ranking based consensus clustering for feature subset selection
    Rani, D. Sandhya
    Rani, T. Sobha
    Bhavani, S. Durga
    Krishna, G. Bala
    APPLIED INTELLIGENCE, 2024, 54 (17-18) : 8154 - 8169
  • [25] Evaluation of feature subset selection, feature weighting, and prototype selection for biomedical applications
    Little, Suzanne
    Salvetti, Ovidio
    Perner, Petra
    ADVANCES IN CASE-BASED REASONING, PROCEEDINGS, 2008, 5239 : 312 - 324
  • [26] Improved Data Streams Classification with Fast Unsupervised Feature Selection
    Wang, Lulu
    Shen, Hong
    2016 17TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED COMPUTING, APPLICATIONS AND TECHNOLOGIES (PDCAT), 2016, : 221 - 226
  • [27] Boosting decision stumps for dynamic feature selection on data streams
    Barddal, Jean Paul
    Enembreck, Fabricio
    Gomes, Heitor Murilo
    Bifet, Albert
    Pfahringer, Bernhard
    INFORMATION SYSTEMS, 2019, 83 : 13 - 29
  • [28] On the utility of incremental feature selection for the classification of textual data streams
    Katakis, L
    Tsoumakas, G
    Vlahavas, L
    ADVANCES IN INFORMATICS, PROCEEDINGS, 2005, 3746 : 338 - 348
  • [29] Dynamic Feature Selection for Clustering High Dimensional Data Streams
    Fahy, Conor
    Yang, Shengxiang
    IEEE ACCESS, 2019, 7 : 127128 - 127140
  • [30] Feature Selection on High Dimensional Data using Wrapper Based Subset Selection
    Manikandan, G.
    Susi, E.
    Abirami, S.
    2017 SECOND INTERNATIONAL CONFERENCE ON RECENT TRENDS AND CHALLENGES IN COMPUTATIONAL MODELS (ICRTCCM), 2017, : 320 - 325