Accelerating difficulty estimation for conformal regression forests

被引:0
|
作者
Henrik Boström
Henrik Linusson
Tuve Löfström
Ulf Johansson
机构
[1] Stockholm University,Department of Computer and Systems Sciences
[2] University of Borås,Department of Information Technology
[3] Jönköping University,Department of Computer Science and Informatics
关键词
Conformal prediction; Nonconformity measures; Regression; Random forests; 62G08; 62G15; 62J02; 62M20;
D O I
暂无
中图分类号
学科分类号
摘要
The conformal prediction framework allows for specifying the probability of making incorrect predictions by a user-provided confidence level. In addition to a learning algorithm, the framework requires a real-valued function, called nonconformity measure, to be specified. The nonconformity measure does not affect the error rate, but the resulting efficiency, i.e., the size of output prediction regions, may vary substantially. A recent large-scale empirical evaluation of conformal regression approaches showed that using random forests as the learning algorithm together with a nonconformity measure based on out-of-bag errors normalized using a nearest-neighbor-based difficulty estimate, resulted in state-of-the-art performance with respect to efficiency. However, the nearest-neighbor procedure incurs a significant computational cost. In this study, a more straightforward nonconformity measure is investigated, where the difficulty estimate employed for normalization is based on the variance of the predictions made by the trees in a forest. A large-scale empirical evaluation is presented, showing that both the nearest-neighbor-based and the variance-based measures significantly outperform a standard (non-normalized) nonconformity measure, while no significant difference in efficiency between the two normalized approaches is observed. The evaluation moreover shows that the computational cost of the variance-based measure is several orders of magnitude lower than when employing the nearest-neighbor-based nonconformity measure. The use of out-of-bag instances for calibration does, however, result in nonconformity scores that are distributed differently from those obtained from test instances, questioning the validity of the approach. An adjustment of the variance-based measure is presented, which is shown to be valid and also to have a significant positive effect on the efficiency. For conformal regression forests, the variance-based nonconformity measure is hence a computationally efficient and theoretically well-founded alternative to the nearest-neighbor procedure.
引用
收藏
页码:125 / 144
页数:19
相关论文
共 50 条
  • [21] Soil Moisture Estimation Based on Polarimetric Decomposition and Quantile Regression Forests
    Zhang, Li
    Lv, Xiaolei
    Wang, Rui
    REMOTE SENSING, 2022, 14 (17)
  • [22] Direct Estimation of Cardiac Bi-ventricular Volumes with Regression Forests
    Zhen, Xiantong
    Wang, Zhijie
    Islam, Ali
    Bhaduri, Mousumi
    Chan, Ian
    Li, Shuo
    MEDICAL IMAGE COMPUTING AND COMPUTER-ASSISTED INTERVENTION - MICCAI 2014, PT II, 2014, 8674 : 586 - 593
  • [23] Growing Regression Tree Forests by Classification for Continuous Object Pose Estimation
    Kota Hara
    Rama Chellappa
    International Journal of Computer Vision, 2017, 122 : 292 - 312
  • [24] Estimation of suspended sediment concentration and yield using linear models, random forests and quantile regression forests
    Francke, T.
    Lopez-Tarazon, J. A.
    Schroeder, B.
    HYDROLOGICAL PROCESSES, 2008, 22 (25) : 4892 - 4904
  • [25] Conformal symmetries of FRW accelerating cosmologies
    Kehagias, A.
    Riotto, A.
    NUCLEAR PHYSICS B, 2014, 884 : 547 - 565
  • [26] Real-Time Head Pose Estimation Using Random Regression Forests
    Tang, Yunqi
    Sun, Zhenan
    Tan, Tieniu
    BIOMETRIC RECOGNITION: CCBR 2011, 2011, 7098 : 66 - 73
  • [27] Segmentation-Free Estimation of Kidney Volumes in CT with Dual Regression Forests
    Hussain, Mohammad Arafat
    Hamarneh, Ghassan
    O'Connell, Timothy W.
    Mohammed, Mohammed F.
    Abugharbieh, Rafeef
    MACHINE LEARNING IN MEDICAL IMAGING, MLMI 2016, 2016, 10019 : 156 - 163
  • [28] Dynamic random regression forests for real-time head pose estimation
    Ying, Ying
    Wang, Han
    MACHINE VISION AND APPLICATIONS, 2013, 24 (08) : 1705 - 1719
  • [29] Estimation of the Aboveground Biomass of Forests in Complex Mountainous Areas Using Regression Kriging
    Luo, Yining
    Yan, Lihui
    Zhou, Zhongfa
    Huang, Denghong
    Cai, Lu
    Du, Shuanglong
    Yang, Yue
    Huang, Youyan
    Li, Qianxia
    FORESTS, 2024, 15 (10):
  • [30] Dynamic random regression forests for real-time head pose estimation
    Ying Ying
    Han Wang
    Machine Vision and Applications, 2013, 24 : 1705 - 1719