GENERALIZED RANDOM FORESTS

被引:798
|
作者
Athey, Susan [1 ]
Tibshirani, Julie [2 ]
Wager, Stefan [1 ]
机构
[1] Stanford Univ, Stanford Grad Sch Business, 655 Knight Way, Stanford, CA 94305 USA
[2] Elasticsearch BV, 800 West El Camino Real,Suite 350, Mountain View, CA 94040 USA
来源
ANNALS OF STATISTICS | 2019年 / 47卷 / 02期
关键词
Asymptotic theory; causal inference; instrumental variable; INSTRUMENTAL VARIABLE ESTIMATION; PARAMETER INSTABILITY; STRUCTURAL-CHANGE; REGRESSION; TESTS; MODELS; CONSISTENCY; JACKKNIFE; CONSTANCY; VARIANCE;
D O I
10.1214/18-AOS1709
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We propose generalized random forests, a method for nonparametric statistical estimation based on random forests (Breiman [Mach. Learn. 45 (2001) 5-32]) that can be used to fit any quantity of interest identified as the solution to a set of local moment equations. Following the literature on local maximum likelihood estimation, our method considers a weighted set of nearby training examples; however, instead of using classical kernel weighting functions that are prone to a strong curse of dimensionality, we use an adaptive weighting function derived from a forest designed to express heterogeneity in the specified quantity of interest. We propose a flexible, computationally efficient algorithm for growing generalized random forests, develop a large sample theory for our method showing that our estimates are consistent and asymptotically Gaussian and provide an estimator for their asymptotic variance that enables valid confidence intervals. We use our approach to develop new methods for three statistical tasks: nonparametric quantile regression, conditional average partial effect estimation and heterogeneous treatment effect estimation via instrumental variables. A software implementation, grf for R and C++, is available from CRAN.
引用
收藏
页码:1148 / 1178
页数:31
相关论文
共 50 条
  • [1] Generalized random shapelet forests
    Karlsson, Isak
    Papapetrou, Panagiotis
    Bostrom, Henrik
    DATA MINING AND KNOWLEDGE DISCOVERY, 2016, 30 (05) : 1053 - 1085
  • [2] Generalized random shapelet forests
    Isak Karlsson
    Panagiotis Papapetrou
    Henrik Boström
    Data Mining and Knowledge Discovery, 2016, 30 : 1053 - 1085
  • [3] Rates of convergence for random forests via generalized U-statistics
    Peng, Wei
    Coleman, Tim
    Mentch, Lucas
    ELECTRONIC JOURNAL OF STATISTICS, 2022, 16 (01): : 232 - 292
  • [4] Imprecise Extensions of Random Forests and Random Survival Forests
    Utkin, Lev, V
    Kovalev, Maxim S.
    Meldo, Anna A.
    Coolen, Frank P. A.
    PROCEEDINGS OF THE ELEVENTH INTERNATIONAL SYMPOSIUM ON IMPRECISE PROBABILITIES: THEORIES AND APPLICATIONS (ISIPTA 2019), 2019, 103 : 404 - 413
  • [5] Using generalized additive models and random forests to model prosodic prominence in German
    Arnold, Denis
    Wagner, Petra
    Baayen, R. Harald
    14TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2013), VOLS 1-5, 2013, : 272 - 276
  • [6] Random forests
    Breiman, L
    MACHINE LEARNING, 2001, 45 (01) : 5 - 32
  • [7] Random Forests
    Leo Breiman
    Machine Learning, 2001, 45 : 5 - 32
  • [8] Random forests
    Pavlov, YL
    PROBABILISTIC METHODS IN DISCRETE MATHEMATICS, 1997, : 11 - 18
  • [9] Generalized height-diameter models with random effects for natural forests of central Mexico
    Rubio Camacho, Ernesto Alonso
    Rivas, Sacramento Corral
    Lopez Hernandez, Juan Antonio
    Chavez Duran, Alvaro Agustin
    Carmona, Jaqueline Xelhuantzi
    Nagel, Juergen
    CERNE, 2022, 28
  • [10] Evaluation of random forests and generalized regression neural networks for daily reference evapotranspiration modelling
    Feng, Yu
    Cui, Ningbo
    Gong, Daozhi
    Zhang, Qingwen
    Zhao, Lu
    AGRICULTURAL WATER MANAGEMENT, 2017, 193 : 163 - 173