Random Forests

被引:231
|
作者
Leo Breiman
机构
[1] University of California,Statistics Department
来源
Machine Learning | 2001年 / 45卷
关键词
classification; regression; ensemble;
D O I
暂无
中图分类号
学科分类号
摘要
Random forests are a combination of tree predictors such that each tree depends on the values of a random vector sampled independently and with the same distribution for all trees in the forest. The generalization error for forests converges a.s. to a limit as the number of trees in the forest becomes large. The generalization error of a forest of tree classifiers depends on the strength of the individual trees in the forest and the correlation between them. Using a random selection of features to split each node yields error rates that compare favorably to Adaboost (Y. Freund & R. Schapire, Machine Learning: Proceedings of the Thirteenth International conference, ***, 148–156), but are more robust with respect to noise. Internal estimates monitor error, strength, and correlation and these are used to show the response to increasing the number of features used in the splitting. Internal estimates are also used to measure variable importance. These ideas are also applicable to regression.
引用
收藏
页码:5 / 32
页数:27
相关论文
共 50 条
  • [31] Uplift Random Forests
    Guelman, Leo
    Guillen, Montserrat
    Perez-Marin, Ana M.
    CYBERNETICS AND SYSTEMS, 2015, 46 (3-4) : 230 - 248
  • [32] Subtractive random forests
    Broutin, Nicolas
    Devroye, Luc
    Lugosi, Gabor
    Oliveira, Roberto Imbuzeiro
    ALEA-LATIN AMERICAN JOURNAL OF PROBABILITY AND MATHEMATICAL STATISTICS, 2024, 21 : 575 - 591
  • [33] Neural Random Forests
    Gérard Biau
    Erwan Scornet
    Johannes Welbl
    Sankhya A, 2019, 81 : 347 - 386
  • [34] Random subsequence forests
    He Z.
    Wang J.
    Jiang M.
    Hu L.
    Zou Q.
    Information Sciences, 2024, 667
  • [35] Compressing Random Forests
    Painsky, Amichai
    Rosset, Saharon
    2016 IEEE 16TH INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2016, : 1131 - 1136
  • [36] Random forests for microarrays
    Cutler, Adele
    Stevens, John R.
    DNA MICROARRAYS, PART B: DATABASES AND STATISTICS, 2006, 411 : 422 - +
  • [37] On Oblique Random Forests
    Menze, Bjoern H.
    Kelm, B. Michael
    Splitthoff, Daniel N.
    Koethe, Ullrich
    Hamprecht, Fred A.
    MACHINE LEARNING AND KNOWLEDGE DISCOVERY IN DATABASES, PT II, 2011, 6912 : 453 - 469
  • [38] On the profile of random forests
    Gittenberger, B
    MATHEMATICS AND COMPUTER SCIENCE II: ALGORITHMS, TREES, COMBINATORICS AND PROBABILITIES, 2002, : 279 - 293
  • [39] Random Forests with R
    Maindonald, John H.
    INTERNATIONAL STATISTICAL REVIEW, 2021,
  • [40] Mondrian Forests: Efficient Online Random Forests
    Lakshminarayanan, Balaji
    Roy, Daniel M.
    Teh, Yee Whye
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 27 (NIPS 2014), 2014, 27