Streaming random forests

被引:0
|
作者
Abdulsalam, Hanady [1 ]
Skillicorn, David B. [1 ]
Martin, Patrick [1 ]
机构
[1] Queens Univ, Sch Comp, Kingston, ON K7L 3N6, Canada
关键词
data mining; classification; decision trees; data-stream classification; random forests;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Many recent applications deal with data streams, conceptually endless sequences of data records, often arriving at high flow rates. Standard data-mining techniques typically assume that records can be accessed multiple times and so do not naturally extend to streaming data. Algorithms for mining streams must be able to extract all necessary information from records with only one, or perhaps a few, passes over the data. We present the Streaming Random Forests algorithm, an online and incremental stream classification algorithm that extends Breiman's Random Forests algorithm. The Streaming Random Forests algorithm grows multiple decision trees, and classifies unlabelled records based on the plurality of tree votes. We evaluate the classification accuracy of the Streaming Random Forests algorithm on several datasets, and show that its accuracy is comparable to the standard Random Forest algorithm.
引用
收藏
页码:225 / 232
页数:8
相关论文
共 50 条
  • [1] Classification Using Streaming Random Forests
    Abdulsalam, Hanady
    Skillicorn, David B.
    Martin, Patrick
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2011, 23 (01) : 22 - 36
  • [2] Classifying evolving data streams using dynamic Streaming Random Forests
    Abdulsalam, H.
    Skillicorn, D. B.
    Martin, P.
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2008, 5181 : 643 - 651
  • [3] Imprecise Extensions of Random Forests and Random Survival Forests
    Utkin, Lev, V
    Kovalev, Maxim S.
    Meldo, Anna A.
    Coolen, Frank P. A.
    PROCEEDINGS OF THE ELEVENTH INTERNATIONAL SYMPOSIUM ON IMPRECISE PROBABILITIES: THEORIES AND APPLICATIONS (ISIPTA 2019), 2019, 103 : 404 - 413
  • [4] Random forests
    Breiman, L
    MACHINE LEARNING, 2001, 45 (01) : 5 - 32
  • [5] Random Forests
    Leo Breiman
    Machine Learning, 2001, 45 : 5 - 32
  • [6] Random forests
    Pavlov, YL
    PROBABILISTIC METHODS IN DISCRETE MATHEMATICS, 1997, : 11 - 18
  • [7] On the Streaming Indistinguishability of a Random Permutation and a Random Function
    Dinur, Itai
    ADVANCES IN CRYPTOLOGY - EUROCRYPT 2020, PT II, 2020, 12106 : 433 - 460
  • [8] Random Prism: An Alternative to Random Forests
    Stahl, Frederic
    Bramer, Max
    RESEARCH AND DEVELOPMENT IN INTELLIGENT SYSTEMS XXVIII: INCORPORATING APPLICATIONS AND INNOVATIONS IN INTELLIGENT SYSTEMS XIX, 2011, : 5 - 18
  • [9] CONSISTENCY OF RANDOM FORESTS
    Scornet, Erwan
    Biau, Gerard
    Vert, Jean-Philippe
    ANNALS OF STATISTICS, 2015, 43 (04): : 1716 - 1741
  • [10] Unsupervised random forests
    Mantero, Alejandro
    Ishwaran, Hemant
    STATISTICAL ANALYSIS AND DATA MINING, 2021, 14 (02) : 144 - 167