Instance-Based Uncertainty Estimation for Gradient-Boosted Regression Trees

被引:0
|
作者
Brophy, Jonathan [1 ]
Lowd, Daniel [1 ]
机构
[1] Univ Oregon, Eugene, OR 97403 USA
关键词
MACHINE; PERFORMANCE; PREDICTION; TUTORIAL; FORESTS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Gradient-boosted regression trees (GBRTs) are hugely popular for solving tabular regression problems, but provide no estimate of uncertainty. We propose Instance-Based Uncertainty estimation for Gradient-boosted regression trees (IBUG), a simple method for extending any GBRT point predictor to produce probabilistic predictions. IBUG computes a non-parametric distribution around a prediction using the k-nearest training instances, where distance is measured with a tree-ensemble kernel. The runtime of IBUG depends on the number of training examples at each leaf in the ensemble, and can be improved by sampling trees or training instances. Empirically, we find that IBUG achieves similar or better performance than the previous state-of-the-art across 22 benchmark regression datasets. We also find that IBUG can achieve improved probabilistic performance by using different base GBRT models, and can more flexibly model the posterior distribution of a prediction than competing methods. We also find that previous methods suffer from poor probabilistic calibration on some datasets, which can be mitigated using a scalar factor tuned on the validation data. Source code is available at https://github.com/jjbrophy47/ibug.
引用
收藏
页数:15
相关论文
共 50 条
  • [1] Adversarial Training of Gradient-Boosted Decision Trees
    Calzavara, Stefano
    Lucchese, Claudio
    Tolomei, Gabriele
    PROCEEDINGS OF THE 28TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT (CIKM '19), 2019, : 2429 - 2432
  • [2] Adapting and Evaluating Influence-Estimation Methods for Gradient-Boosted Decision Trees
    Brophy, Jonathan
    Hammoudeh, Zayd
    Lowd, Daniel
    JOURNAL OF MACHINE LEARNING RESEARCH, 2023, 24
  • [3] Improving the prediction of an atmospheric chemistry transport model using gradient-boosted regression trees
    Ivatt, Peter D.
    Evans, Mathew J.
    ATMOSPHERIC CHEMISTRY AND PHYSICS, 2020, 20 (13) : 8063 - 8082
  • [4] Gradient-Boosted Based Structured and Unstructured Learning
    Gavito, Andrea Trevino
    Klabjan, Diego
    Utke, Jean
    ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING, ICANN 2023, PT III, 2023, 14256 : 439 - 451
  • [5] GBDT-MO: Gradient-Boosted Decision Trees for Multiple Outputs
    Zhang, Zhendong
    Jung, Cheolkon
    IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 32 (07) : 3156 - 3167
  • [6] Dynamic Streamflow Simulation via Online Gradient-Boosted Regression Tree
    Zhang, Heng
    Yang, Qinli
    Shao, Junming
    Wang, Guoqing
    JOURNAL OF HYDROLOGIC ENGINEERING, 2019, 24 (10)
  • [7] Generating and Imputing Tabular Data via Diffusion and Flow-based Gradient-Boosted Trees
    Jolicoeur-Martineau, Alexia
    Fatras, Kilian
    Kachman, Tal
    INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND STATISTICS, VOL 238, 2024, 238
  • [8] Bagging Gradient-Boosted Trees for High Precision, Low Variance Ranking Models
    Ganjisaffar, Yasser
    Caruana, Rich
    Lopes, Cristina Videira
    PROCEEDINGS OF THE 34TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR'11), 2011, : 85 - 94
  • [9] Residual uncertainty estimation using instance-based learning with applications to hydrologic forecasting
    Wani, Omar
    Beckers, Joost V. L.
    Weerts, Albrecht H.
    Solomatine, Dimitri P.
    HYDROLOGY AND EARTH SYSTEM SCIENCES, 2017, 21 (08) : 4021 - 4036
  • [10] Mixed-Integer Convex Nonlinear Optimization with Gradient-Boosted Trees Embedded
    Mistry, Miten
    Letsios, Dimitrios
    Krennrich, Gerhard
    Lee, Robert M.
    Misener, Ruth
    INFORMS JOURNAL ON COMPUTING, 2021, 33 (03) : 1103 - 1119