Big Data Impacts on Stochastic Forecast Models: Evidence from FX Time Series

被引:1
|
作者
Dietz, Sebastian [1 ]
机构
[1] Univ Passau, Dept Business Adm & Econ, Passau, Germany
关键词
FX prediction; High Frequency Data; Big Data Analytics; Autoregressive Neural Networks; Support Vector Machines; Computational Intelligence;
D O I
10.18187/pjsor.v9i3.587
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
With the rise of the Big Data paradigm new tasks for prediction models appeared. In addition to the volume problem of such data sets nonlinearity becomes important, as the more detailed data sets contain also more comprehensive information, e.g. about non regular seasonal or cyclical movements as well as jumps in time series. This essay compares two nonlinear methods for predicting a high frequency time series, the USD/Euro exchange rate. The first method investigated is Autoregressive Neural Network Processes (ARNN), a neural network based nonlinear extension of classical autoregressive process models from time series analysis (see Dietz 2011). Its advantage is its simple but scalable time series process model architecture, which is able to include all kinds of nonlinearities based on the universal approximation theorem of Hornik, Stinchcombe and White 1989 and the extensions of Hornik 1993. However, restrictions related to the numeric estimation procedures limit the flexibility of the model. The alternative is a Support Vector Machine Model (SVM, Vapnik 1995). The two methods compared have different approaches of error minimization (Empirical error minimization at the ARNN vs. structural error minimization at the SVM). Our new finding is, that time series data classified as "Big Data" need new methods for prediction. Those new methods should be able to be "customized" to nonlinearity and other non-standard effects, which come along with increasing data volume and can not be standardized to be included in tradional time series models. Estimation and prediction was performed using the statistical programming language R. Besides prediction results we will also discuss the impact of Big Data on data preparation and model validation steps.
引用
收藏
页码:277 / 291
页数:15
相关论文
共 50 条
  • [31] Discovering Stochastic Dynamical Equations from Ecological Time Series Data
    Nabeel, Arshed
    Karichannavar, Ashwin
    Palathingal, Shuaib
    Jhawar, Jitesh
    Bruckner, David B.
    Raj, M. Danny
    Guttal, Vishwesha
    AMERICAN NATURALIST, 2025, 205 (04): : E100 - E117
  • [32] The Distance Between: An Algorithmic Approach to Comparing Stochastic Models to Time-Series Data
    Sherlock, Brock D.
    Boon, Marko A. A.
    Vlasiou, Maria
    Coster, Adelle C. F.
    BULLETIN OF MATHEMATICAL BIOLOGY, 2024, 86 (09)
  • [33] Inference of stochastic time series with missing data
    Lee, Sangwon
    Periwal, Vipul
    Jo, Junghyo
    PHYSICAL REVIEW E, 2021, 104 (02)
  • [34] Assessment of forecasts and forecast uncertainty using generalized linear regression models for time series count data
    Vijapurkar, UP
    Gotway, CA
    JOURNAL OF STATISTICAL COMPUTATION AND SIMULATION, 2001, 68 (04) : 321 - 349
  • [35] TRAINING KSIM MODELS FROM TIME-SERIES DATA
    BLACK, RL
    OLDHAM, WJB
    MARCY, WM
    TECHNOLOGICAL FORECASTING AND SOCIAL CHANGE, 1994, 47 (03) : 293 - 307
  • [36] Discovering ecosystem models from time-series data
    George, D
    Saito, K
    Langley, P
    Bay, S
    Arrigo, KR
    DISCOVERY SCIENCE, PROCEEDINGS, 2003, 2843 : 141 - 152
  • [37] Challenges in Constructing Time Series Models from Process Data
    Ledolter, Johannes
    Bisgaard, Soren
    QUALITY AND RELIABILITY ENGINEERING INTERNATIONAL, 2011, 27 (02) : 165 - 178
  • [38] Determinants of suicides in Denmark: Evidence from time series data
    Andres, Antonio R.
    Halicioglu, Ferda
    HEALTH POLICY, 2010, 98 (2-3) : 263 - 269
  • [39] Preference shocks from aggregation: time series data evidence
    Maliar, L
    Maliar, S
    CANADIAN JOURNAL OF ECONOMICS-REVUE CANADIENNE D ECONOMIQUE, 2004, 37 (03): : 768 - 781
  • [40] Real Time Interpretation and Optimization of Time Series Data Stream in Big Data
    Jiang, Zheyuan
    Liu, Ke
    2018 IEEE 3RD INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND BIG DATA ANALYSIS (ICCCBDA), 2018, : 243 - 247