A better measure of relative prediction accuracy for model selection and model estimation

被引：321

作者：

Tofallis, Chris ^{[1
]}

机构：

[1] Univ Hertfordshire, Hatfield AL10 9AB, Herts, England

来源：

JOURNAL OF THE OPERATIONAL RESEARCH SOCIETY | 2015年 / 66卷 / 08期

关键词：

prediction; forecasting; model selection; loss function; regression; time series; RELIABILITY; VALIDITY;

D O I：

10.1057/jors.2014.103

中图分类号：

C93 [管理学];

学科分类号：

12 ; 1201 ; 1202 ; 120202 ;

摘要：

Surveys show that the mean absolute percentage error (MAPE) is the most widely used measure of prediction accuracy in businesses and organizations. It is, however, biased: when used to select among competing prediction methods it systematically selects those whose predictions are too low. This has not been widely discussed and so is not generally known among practitioners. We explain why this happens. We investigate an alternative relative accuracy measure which avoids this bias: the log of the accuracy ratio, that is, log (prediction/actual). Relative accuracy is particularly relevant if the scatter in the data grows as the value of the variable grows (heteroscedasticity). We demonstrate using simulations that for heteroscedastic data (modelled by a multiplicative error factor) the proposed metric is far superior to MAPE for model selection. Another use for accuracy measures is in fitting parameters to prediction models Minimum MAPE models do not predict a simple statistic and so theoretical analysis is limited. We prove that when the proposed metric is used instead, the resulting least squares regression model predicts the geometric mean. This important property allows its theoretical properties to be understood.

引用

页码：1352 / 1362

页数：11

共 50 条

[31] Statistical estimation with model selection
Birge, Lucien
INDAGATIONES MATHEMATICAE-NEW SERIES, 2006, 17 (04): : 497 - 537
[32] Economics of genomic selection: the role of prediction accuracy and relative genotyping costs
Predrag Rajsic
Alfons Weersink
Alireza Navabi
K. Peter Pauls
Euphytica, 2016, 210 : 259 - 276
[33] Economics of genomic selection: the role of prediction accuracy and relative genotyping costs
Rajsic, Predrag
Weersink, Alfons
Navabi, Alireza
Pauls, K. Peter
EUPHYTICA, 2016, 210 (02) : 259 - 276
[34] Prediction and Model Selection in Experiments
Breig, Zachary
ECONOMIC RECORD, 2020, 96 (313) : 153 - 176
[35] Canopy coverage of wheat measured with high accuracy using the HSV colour model and relative depth estimation model, MiDaS
Mizuta, Keisuke
Sato, Yuichi
Hayashi, Jun-Ichiro
Toyota, Masanori
Morokuma, Masahiro
PLANT PRODUCTION SCIENCE, 2024, 27 (04) : 294 - 303
[36] Is random model better? On its accuracy and efficiency
Fan, W
Wang, HX
Yu, PS
Ma, S
THIRD IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2003, : 51 - 58
[37] Measure Estimation in the Barycentric Coding Model
Werenski, Matthew
Jiang, Ruijie
Tasissa, Abiy
Aeron, Shuchin
Murphy, James M.
INTERNATIONAL CONFERENCE ON MACHINE LEARNING, VOL 162, 2022,
[38] Developing cancer prediction model based on stepwise selection by AUC measure for proteomics data
Kim, Yongkang
Lee, Seungyeoun
Kwon, Min-Seok
Na, Ahrum
Chop, Yonghwan
Yp, Sung Gon
Namkung, Junghyun
Han, Sangjo
Kang, Meejoo
Kim, Sun Whe
Jang, Jin-Young
Kim, Yikwon
Kim, Youngsoo
Park, Taesung
PROCEEDINGS 2015 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE, 2015, : 1345 - 1350
[39] Benchmarks and the accuracy of GARCH model estimation
Brooks, C
Burke, SP
Persand, G
INTERNATIONAL JOURNAL OF FORECASTING, 2001, 17 (01) : 45 - 56
[40] Model Selection of Symbolic Regression to Improve the Accuracy of PM2.5 Concentration Prediction
Yang, Guangfei
Huang, Jian
TRENDS AND APPLICATIONS IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2015, 2015, 9441 : 189 - 197

← 1 2 3 4 5 →