Instance-Based Uncertainty Estimation for Gradient-Boosted Regression Trees

被引:0
|
作者
Brophy, Jonathan [1 ]
Lowd, Daniel [1 ]
机构
[1] Univ Oregon, Eugene, OR 97403 USA
关键词
MACHINE; PERFORMANCE; PREDICTION; TUTORIAL; FORESTS;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Gradient-boosted regression trees (GBRTs) are hugely popular for solving tabular regression problems, but provide no estimate of uncertainty. We propose Instance-Based Uncertainty estimation for Gradient-boosted regression trees (IBUG), a simple method for extending any GBRT point predictor to produce probabilistic predictions. IBUG computes a non-parametric distribution around a prediction using the k-nearest training instances, where distance is measured with a tree-ensemble kernel. The runtime of IBUG depends on the number of training examples at each leaf in the ensemble, and can be improved by sampling trees or training instances. Empirically, we find that IBUG achieves similar or better performance than the previous state-of-the-art across 22 benchmark regression datasets. We also find that IBUG can achieve improved probabilistic performance by using different base GBRT models, and can more flexibly model the posterior distribution of a prediction than competing methods. We also find that previous methods suffer from poor probabilistic calibration on some datasets, which can be mitigated using a scalar factor tuned on the validation data. Source code is available at https://github.com/jjbrophy47/ibug.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] Wind Ramp Event Prediction with Parallelized Gradient Boosted Regression Trees
    Gupta, Saurav
    Shrivastava, Nitin Anand
    Khosravi, Abbas
    Panigrahi, Bijaya Ketan
    2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 5296 - 5301
  • [42] Forecasting PM2.5 Concentration Using Gradient-Boosted Regression Tree with CNN Learning Model
    A. Usha Ruby
    J. George Chellin Chandran
    Prasannavenkatesan Theerthagiri
    Renuka Patil
    B. N. Chaithanya
    T. J. Swasthika Jain
    Optical Memory and Neural Networks, 2024, 33 : 86 - 96
  • [43] Estimation of inorganic crystal densities using gradient boosted trees
    Zhao, Jesse
    FRONTIERS IN MATERIALS, 2022, 9
  • [44] Estimation of the masses in the local group by gradient boosted decision trees
    Carlesi, Edoardo
    Hoffman, Yehuda
    Libeskind, Noam, I
    MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY, 2022, 513 (02) : 2385 - 2393
  • [45] Offshore application of landslide susceptibility mapping using gradient-boosted decision trees: a Gulf of Mexico case study
    Dyer, Alec S.
    Mark-Moser, MacKenzie
    Duran, Rodrigo
    Bauer, Jennifer R.
    NATURAL HAZARDS, 2024, 120 (07) : 6223 - 6244
  • [46] Deep neural networks, gradient-boosted trees, random forests: Statistical arbitrage on the S&P 500
    Krauss, Christopher
    Xuan Anh Do
    Huck, Nicolas
    EUROPEAN JOURNAL OF OPERATIONAL RESEARCH, 2017, 259 (02) : 689 - 702
  • [47] Offshore application of landslide susceptibility mapping using gradient-boosted decision trees: a Gulf of Mexico case study
    Alec S. Dyer
    MacKenzie Mark-Moser
    Rodrigo Duran
    Jennifer R. Bauer
    Natural Hazards, 2024, 120 : 6223 - 6244
  • [48] Classification and Recognition of Building Appearance Based on Optimized Gradient-Boosted Decision Tree Algorithm
    Hu, Mengting
    Guo, Lingxiang
    Liu, Jing
    Song, Yuxuan
    SENSORS, 2023, 23 (11)
  • [49] An Instance-Based Method for Remaining Useful Life Estimation for Aircraft Engines
    Xue, Feng
    Bonissone, Piero
    Varma, Anil
    Yan, Weizhong
    Eklund, Neil
    Goebel, Kai
    JOURNAL OF FAILURE ANALYSIS AND PREVENTION, 2008, 8 (02) : 199 - 206
  • [50] PredRSA: a gradient boosted regression trees approach for predicting protein solvent accessibility
    Fan, Chao
    Liu, Diwei
    Huang, Rui
    Chen, Zhigang
    Deng, Lei
    BMC BIOINFORMATICS, 2016, 17