Iterative Subset Selection for Feature Drifting Data Streams

被引:8
|
作者
Yuan, Lanqin [1 ]
Pfahringer, Bernhard [2 ]
Barddal, Jean Paul [3 ]
机构
[1] Univ Waikato, Hamilton, New Zealand
[2] Univ Auckland, Deparment Comp Sci, Auckland, New Zealand
[3] Pontificia Univ Catolica Parana, Programa Posgrad Informat, Curitiba, Parana, Brazil
关键词
Data Stream Mining; Feature Selection; Concept Drift; Embedded Feature Selection; Iterative Subset Selection;
D O I
10.1145/3167132.3167188
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Feature selection has been studied and shown to improve classifier performance in standard batch data mining but is mostly unexplored in data stream mining. Feature selection becomes even more important when the relevant subset of features changes over time, as the underlying concept of a data stream drifts. This specific kind of drift is known as feature drift and requires specific techniques not only to determine which features are the most important but also to take advantage of them. This paper presents a novel method of feature subset selection specialized for dealing with the occurrence of feature drifts called Iterative Subset Selection (ISS), which splits the feature selection process into two stages by first ranking the features, and then iteratively selecting features from the ranking. Applying our feature selection method together with Naive Bayes or k-Nearest Neighbour as a classifier, results in compelling accuracy improvements, compared to prior work.
引用
收藏
页码:510 / 517
页数:8
相关论文
共 50 条
  • [21] Remainder Subset Awareness for Feature Subset Selection
    Prat-Masramon, Gabriel
    Belanche-Munoz, Lluis A.
    RESEARCH AND DEVELOPMENT IN INTELLIGENT SYSTEMS XXVI: INCORPORATING APPLICATIONS AND INNOVATIONS IN INTELLIGENT SYSTEMS XVII, 2010, : 317 - 322
  • [22] Feature Selection on High Dimensional Data using Wrapper Based Subset Selection
    Manikandan, G.
    Susi, E.
    Abirami, S.
    2017 SECOND INTERNATIONAL CONFERENCE ON RECENT TRENDS AND CHALLENGES IN COMPUTATIONAL MODELS (ICRTCCM), 2017, : 320 - 325
  • [23] Feature transformation and subset selection
    Liu, H
    Motoda, H
    IEEE INTELLIGENT SYSTEMS & THEIR APPLICATIONS, 1998, 13 (02): : 26 - 28
  • [24] Feature transformation and subset selection
    Natl Univ of Singapore, Singapore, Singapore
    IEEE Intell Syst their Appl, 2 (26-28):
  • [25] Wrappers for feature subset selection
    Kohavi, R
    John, GH
    ARTIFICIAL INTELLIGENCE, 1997, 97 (1-2) : 273 - 324
  • [26] THE FEATURE SUBSET SELECTION ALGORITHM
    Liu Yongguo Li Xueming Wu Zhongfu (Department of Computer Science and Engineering
    Journal of Electronics(China), 2003, (01) : 57 - 61
  • [27] Wrappers for feature subset selection
    Silicon Graphics, Inc, Mountain View, United States
    Artif Intell, 1-2 (273-324):
  • [28] Improved Data Streams Classification with Fast Unsupervised Feature Selection
    Wang, Lulu
    Shen, Hong
    2016 17TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED COMPUTING, APPLICATIONS AND TECHNOLOGIES (PDCAT), 2016, : 221 - 226
  • [29] Boosting decision stumps for dynamic feature selection on data streams
    Barddal, Jean Paul
    Enembreck, Fabricio
    Gomes, Heitor Murilo
    Bifet, Albert
    Pfahringer, Bernhard
    INFORMATION SYSTEMS, 2019, 83 : 13 - 29
  • [30] Dynamic Feature Selection for Clustering High Dimensional Data Streams
    Fahy, Conor
    Yang, Shengxiang
    IEEE ACCESS, 2019, 7 : 127128 - 127140