A better measure of relative prediction accuracy for model selection and model estimation

被引:321
|
作者
Tofallis, Chris [1 ]
机构
[1] Univ Hertfordshire, Hatfield AL10 9AB, Herts, England
关键词
prediction; forecasting; model selection; loss function; regression; time series; RELIABILITY; VALIDITY;
D O I
10.1057/jors.2014.103
中图分类号
C93 [管理学];
学科分类号
12 ; 1201 ; 1202 ; 120202 ;
摘要
Surveys show that the mean absolute percentage error (MAPE) is the most widely used measure of prediction accuracy in businesses and organizations. It is, however, biased: when used to select among competing prediction methods it systematically selects those whose predictions are too low. This has not been widely discussed and so is not generally known among practitioners. We explain why this happens. We investigate an alternative relative accuracy measure which avoids this bias: the log of the accuracy ratio, that is, log (prediction/actual). Relative accuracy is particularly relevant if the scatter in the data grows as the value of the variable grows (heteroscedasticity). We demonstrate using simulations that for heteroscedastic data (modelled by a multiplicative error factor) the proposed metric is far superior to MAPE for model selection. Another use for accuracy measures is in fitting parameters to prediction models Minimum MAPE models do not predict a simple statistic and so theoretical analysis is limited. We prove that when the proposed metric is used instead, the resulting least squares regression model predicts the geometric mean. This important property allows its theoretical properties to be understood.
引用
收藏
页码:1352 / 1362
页数:11
相关论文
共 50 条
  • [31] Statistical estimation with model selection
    Birge, Lucien
    INDAGATIONES MATHEMATICAE-NEW SERIES, 2006, 17 (04): : 497 - 537
  • [32] Economics of genomic selection: the role of prediction accuracy and relative genotyping costs
    Predrag Rajsic
    Alfons Weersink
    Alireza Navabi
    K. Peter Pauls
    Euphytica, 2016, 210 : 259 - 276
  • [33] Economics of genomic selection: the role of prediction accuracy and relative genotyping costs
    Rajsic, Predrag
    Weersink, Alfons
    Navabi, Alireza
    Pauls, K. Peter
    EUPHYTICA, 2016, 210 (02) : 259 - 276
  • [34] Prediction and Model Selection in Experiments
    Breig, Zachary
    ECONOMIC RECORD, 2020, 96 (313) : 153 - 176
  • [35] Canopy coverage of wheat measured with high accuracy using the HSV colour model and relative depth estimation model, MiDaS
    Mizuta, Keisuke
    Sato, Yuichi
    Hayashi, Jun-Ichiro
    Toyota, Masanori
    Morokuma, Masahiro
    PLANT PRODUCTION SCIENCE, 2024, 27 (04) : 294 - 303
  • [36] Is random model better? On its accuracy and efficiency
    Fan, W
    Wang, HX
    Yu, PS
    Ma, S
    THIRD IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2003, : 51 - 58
  • [37] Measure Estimation in the Barycentric Coding Model
    Werenski, Matthew
    Jiang, Ruijie
    Tasissa, Abiy
    Aeron, Shuchin
    Murphy, James M.
    INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
  • [38] Developing cancer prediction model based on stepwise selection by AUC measure for proteomics data
    Kim, Yongkang
    Lee, Seungyeoun
    Kwon, Min-Seok
    Na, Ahrum
    Chop, Yonghwan
    Yp, Sung Gon
    Namkung, Junghyun
    Han, Sangjo
    Kang, Meejoo
    Kim, Sun Whe
    Jang, Jin-Young
    Kim, Yikwon
    Kim, Youngsoo
    Park, Taesung
    PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2015, : 1345 - 1350
  • [39] Benchmarks and the accuracy of GARCH model estimation
    Brooks, C
    Burke, SP
    Persand, G
    INTERNATIONAL JOURNAL OF FORECASTING, 2001, 17 (01) : 45 - 56
  • [40] Model Selection of Symbolic Regression to Improve the Accuracy of PM2.5 Concentration Prediction
    Yang, Guangfei
    Huang, Jian
    TRENDS AND APPLICATIONS IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2015, 2015, 9441 : 189 - 197