Ensembles of Overfit and Overconfident Forecasts

被引:49
|
作者
Grushka-Cockayne, Yael [1 ]
Jose, Victor Richmond R. [2 ]
Lichtendahl, Kenneth C., Jr. [1 ]
机构
[1] Univ Virginia, Darden Sch Business, Charlottesville, VA 22903 USA
[2] Georgetown Univ, McDonough Sch Business, Washington, DC 20057 USA
关键词
wisdom of crowds; base-rate neglect; linear opinion pool; trimmed opinion pool; hit rate; calibration; random forest; data science; PROBABILITY; AGGREGATION; CALIBRATION; ACCURACY; OPINIONS; WISDOM;
D O I
10.1287/mnsc.2015.2389
中图分类号
C93 [管理学];
学科分类号
12 ; 1201 ; 1202 ; 120202 ;
摘要
Firms today average forecasts collected from multiple experts and models. Because of cognitive biases, strategic incentives, or the structure of machine-learning algorithms, these forecasts are often overfit to sample data and are overconfident. Little is known about the challenges associated with aggregating such forecasts. We introduce a theoretical model to examine the combined effect of overfitting and overconfidence on the average forecast. Their combined effect is that the mean and median probability forecasts are poorly calibrated with hit rates of their prediction intervals too high and too low, respectively. Consequently, we prescribe the use of a trimmed average, or trimmed opinion pool, to achieve better calibration. We identify the random forest, a leading machine-learning algorithm that pools hundreds of overfit and overconfident regression trees, as an ideal environment for trimming probabilities. Using several known data sets, we demonstrate that trimmed ensembles can significantly improve the random forest's predictive accuracy.
引用
收藏
页码:1110 / 1130
页数:21
相关论文
共 50 条
  • [31] Neural network model ensembles for building-level electricity load forecasts
    Jetcheva, Jorjeta G.
    Majidpour, Mostafa
    Chen, Wei-Peng
    ENERGY AND BUILDINGS, 2014, 84 : 214 - 223
  • [32] Evaluation of Probabilistic Quality and Value of the ENSEMBLES Multimodel Seasonal Forecasts: Comparison with DEMETER
    Alessandri, Andrea
    Borrelli, Andrea
    Navarra, Antonio
    Arribas, Alberto
    Deque, Michel
    Rogel, Philippe
    Weisheimer, Antje
    MONTHLY WEATHER REVIEW, 2011, 139 (02) : 581 - 607
  • [33] Skill of ENSEMBLES seasonal re-forecasts for malaria prediction in West Africa
    Jones, A. E.
    Morse, A. P.
    GEOPHYSICAL RESEARCH LETTERS, 2012, 39
  • [34] LEARNING TO BE OVERCONFIDENT AND UNDERCONFIDENT
    Lu, Yuanzhu
    Hu, Jinming
    Gong, Yaxian
    SINGAPORE ECONOMIC REVIEW, 2023, 68 (05): : 1815 - 1827
  • [35] Machine Learning Students Overfit to Overfitting
    Valdenegro-Toro, Matias
    Sabatelli, Matthia
    THIRD TEACHING MACHINE LEARNING AND ARTIFICIAL INTELLIGENCE WORKSHOP, VOL 207, 2022, 207
  • [36] Overconfident Consumers in the Marketplace
    Grubb, Michael D.
    JOURNAL OF ECONOMIC PERSPECTIVES, 2015, 29 (04): : 9 - 36
  • [37] Are professional forecasters overconfident?
    Casey, Eddie
    INTERNATIONAL JOURNAL OF FORECASTING, 2021, 37 (02) : 716 - 732
  • [38] Selling to Overconfident Consumers
    Grubb, Michael D.
    AMERICAN ECONOMIC REVIEW, 2009, 99 (05): : 1770 - 1807
  • [39] Overconfident Distribution Channels
    Li, Meng
    PRODUCTION AND OPERATIONS MANAGEMENT, 2019, 28 (06) : 1347 - 1365
  • [40] The overconfident, or the more informed?
    Gui, Hefa
    Cai, Mingchao
    Wang, Yongxiang
    APPLIED ECONOMICS LETTERS, 2009, 16 (03) : 315 - 318