Random Forests

被引:231
|
作者
Leo Breiman
机构
[1] University of California,Statistics Department
来源
Machine Learning | 2001年 / 45卷
关键词
classification; regression; ensemble;
D O I
暂无
中图分类号
学科分类号
摘要
Random forests are a combination of tree predictors such that each tree depends on the values of a random vector sampled independently and with the same distribution for all trees in the forest. The generalization error for forests converges a.s. to a limit as the number of trees in the forest becomes large. The generalization error of a forest of tree classifiers depends on the strength of the individual trees in the forest and the correlation between them. Using a random selection of features to split each node yields error rates that compare favorably to Adaboost (Y. Freund & R. Schapire, Machine Learning: Proceedings of the Thirteenth International conference, ***, 148–156), but are more robust with respect to noise. Internal estimates monitor error, strength, and correlation and these are used to show the response to increasing the number of features used in the splitting. Internal estimates are also used to measure variable importance. These ideas are also applicable to regression.
引用
收藏
页码:5 / 32
页数:27
相关论文
共 50 条
  • [41] Random Shapley Forests: Cooperative Game-Based Random Forests With Consistency
    Sun, Jianyuan
    Yu, Hui
    Zhong, Guoqiang
    Dong, Junyu
    Zhang, Shu
    Yu, Hongchuan
    IEEE TRANSACTIONS ON CYBERNETICS, 2022, 52 (01) : 205 - 214
  • [42] Banzhaf random forests: Cooperative game theory based random forests with consistency
    Sun, Jianyuan
    Zhong, Guoqiang
    Huang, Kaizhu
    Dong, Junyu
    NEURAL NETWORKS, 2018, 106 : 20 - 29
  • [43] Diversified random forests using random subspaces
    Fawagreh, Khaled
    Gaber, Mohamed Medhat
    Elyan, Eyad
    Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2014, 8669 : 85 - 92
  • [44] Random Musical Bands Playing in Random Forests
    Kursa, Miron B.
    Kubera, Elzbieta
    Rudnicki, Witold R.
    Wieczorkowska, Alicja A.
    ROUGH SETS AND CURRENT TRENDS IN COMPUTING, PROCEEDINGS, 2010, 6086 : 580 - +
  • [45] Diversified Random Forests Using Random Subspaces
    Fawagreh, Khaled
    Gaber, Mohamed Medhat
    Elyan, Eyad
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING - IDEAL 2014, 2014, 8669 : 85 - 92
  • [46] Tracking Cloud Forests With Cloud Technology and Random Forests
    Pascual, Pasky
    Pascual, Cam
    FRONTIERS IN ENVIRONMENTAL SCIENCE, 2021, 9
  • [47] Random Forests for Time Series
    Goehry, Benjamin
    Yan, Hui
    Goude, Yannig
    Massart, Pascal
    Poggi, Jean-Michel
    REVSTAT-STATISTICAL JOURNAL, 2023, 21 (02) : 283 - 302
  • [48] Oxides Classification with Random Forests
    Xiao, Kai
    Chen, Baitong
    Bao, Wenzheng
    Cheng, Honglin
    INTELLIGENT COMPUTING THEORIES AND APPLICATION, ICIC 2022, PT II, 2022, 13394 : 680 - 686
  • [49] Random Forest for the Real Forests
    Agrawal, Sharan
    Rana, Shivam
    Ahmad, Tanvir
    PROCEEDINGS OF THE SECOND INTERNATIONAL CONFERENCE ON COMPUTER AND COMMUNICATION TECHNOLOGIES, IC3T 2015, VOL 3, 2016, 381 : 301 - 309
  • [50] Robustness of random forests for regression
    Roy, Marie-Helene
    Larocque, Denis
    JOURNAL OF NONPARAMETRIC STATISTICS, 2012, 24 (04) : 993 - 1006