Ensembles of Overfit and Overconfident Forecasts

被引:49
|
作者
Grushka-Cockayne, Yael [1 ]
Jose, Victor Richmond R. [2 ]
Lichtendahl, Kenneth C., Jr. [1 ]
机构
[1] Univ Virginia, Darden Sch Business, Charlottesville, VA 22903 USA
[2] Georgetown Univ, McDonough Sch Business, Washington, DC 20057 USA
关键词
wisdom of crowds; base-rate neglect; linear opinion pool; trimmed opinion pool; hit rate; calibration; random forest; data science; PROBABILITY; AGGREGATION; CALIBRATION; ACCURACY; OPINIONS; WISDOM;
D O I
10.1287/mnsc.2015.2389
中图分类号
C93 [管理学];
学科分类号
12 ; 1201 ; 1202 ; 120202 ;
摘要
Firms today average forecasts collected from multiple experts and models. Because of cognitive biases, strategic incentives, or the structure of machine-learning algorithms, these forecasts are often overfit to sample data and are overconfident. Little is known about the challenges associated with aggregating such forecasts. We introduce a theoretical model to examine the combined effect of overfitting and overconfidence on the average forecast. Their combined effect is that the mean and median probability forecasts are poorly calibrated with hit rates of their prediction intervals too high and too low, respectively. Consequently, we prescribe the use of a trimmed average, or trimmed opinion pool, to achieve better calibration. We identify the random forest, a leading machine-learning algorithm that pools hundreds of overfit and overconfident regression trees, as an ideal environment for trimming probabilities. Using several known data sets, we demonstrate that trimmed ensembles can significantly improve the random forest's predictive accuracy.
引用
收藏
页码:1110 / 1130
页数:21
相关论文
共 50 条
  • [41] Overconfident Competing Newsvendors
    Li, Meng
    Petruzzi, Nicholas C.
    Zhang, Jun
    MANAGEMENT SCIENCE, 2017, 63 (08) : 2637 - 2646
  • [42] Gradient Methods Never Overfit On Separable Data
    Shamir, Ohad
    JOURNAL OF MACHINE LEARNING RESEARCH, 2021, 22
  • [43] Are overconfident executives alike? overconfident executives and compensation structure: Evidence from China
    Huang, Ying Sophie
    Li, Mengyu
    NORTH AMERICAN JOURNAL OF ECONOMICS AND FINANCE, 2019, 48 : 434 - 449
  • [44] Probability forecasts of ice accretion on wind turbines derived from multiphysics and neighbourhood ensembles
    Strauss, Lukas
    Serafin, Stefano
    Dorninger, Manfred
    QUARTERLY JOURNAL OF THE ROYAL METEOROLOGICAL SOCIETY, 2022, 148 (746) : 2446 - 2467
  • [45] Evaluation of the ENSEMBLES multi-model seasonal forecasts of Indian summer monsoon variability
    M. Rajeevan
    C. K. Unnikrishnan
    B. Preethi
    Climate Dynamics, 2012, 38 : 2257 - 2274
  • [46] Evaluation of quantitative precipitation forecasts by TIGGE ensembles for south China during the presummer rainy
    Huang, Ling
    Luo, Yali
    JOURNAL OF GEOPHYSICAL RESEARCH-ATMOSPHERES, 2017, 122 (16) : 8494 - 8516
  • [47] Improving Short-Term Urban Water Demand Forecasts with Reforecast Analog Ensembles
    Tian, Di
    Martinez, Christopher J.
    Asefa, Tirusew
    JOURNAL OF WATER RESOURCES PLANNING AND MANAGEMENT, 2016, 142 (06)
  • [48] Evaluation of the ENSEMBLES multi-model seasonal forecasts of Indian summer monsoon variability
    Rajeevan, M.
    Unnikrishnan, C. K.
    Preethi, B.
    CLIMATE DYNAMICS, 2012, 38 (11-12) : 2257 - 2274
  • [49] A comparison of probabilistic forecasts from bred, singular-vector, and perturbed observation ensembles
    Hamill, TM
    Snyder, C
    Morss, RE
    MONTHLY WEATHER REVIEW, 2000, 128 (06) : 1835 - 1851
  • [50] Uncertainty and scale interactions in ocean ensembles: From seasonal forecasts to multidecadal climate predictions
    Zanna, L.
    Brankart, J. M.
    Huber, M.
    Leroux, S.
    Penduff, T.
    Williams, P. D.
    QUARTERLY JOURNAL OF THE ROYAL METEOROLOGICAL SOCIETY, 2019, 145 (S1) : 160 - 175