Improving Time Series Regression Model Accuracy via Systematic Training Dataset Augmentation and Sampling

被引:1
|
作者
Stroebel, Robin [1 ]
Mau, Marcus [1 ]
Puchta, Alexander [1 ]
Fleischer, Juergen [1 ]
机构
[1] Karlsruhe Inst Technol, Wbk Inst Prod Sci, Kaiserstr 12, D-76131 Karlsruhe, Germany
来源
关键词
time series regression; data augmentation; model accuracy; training datasets;
D O I
10.3390/make6020049
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This study addresses a significant gap in the field of time series regression modeling by highlighting the central role of data augmentation in improving model accuracy. The primary objective is to present a detailed methodology for systematic sampling of training datasets through data augmentation to improve the accuracy of time series regression models. Therefore, different augmentation techniques are compared to evaluate their impact on model accuracy across different datasets and model architectures. In addition, this research highlights the need for a standardized approach to creating training datasets using multiple augmentation methods. The lack of a clear framework hinders the easy integration of data augmentation into time series regression pipelines. Our systematic methodology promotes model accuracy while providing a robust foundation for practitioners to seamlessly integrate data augmentation into their modeling practices. The effectiveness of our approach is demonstrated using process data from two milling machines. Experiments show that the optimized training dataset improves the generalization ability of machine learning models in 86.67% of the evaluated scenarios. However, the prediction accuracy of models trained on a sufficient dataset remains largely unaffected. Based on these results, sophisticated sampling strategies such as Quadratic Weighting of multiple augmentation approaches may be beneficial.
引用
收藏
页码:1072 / 1086
页数:15
相关论文
共 50 条
  • [21] Exploring the influence of training sampling strategies on time-series deep learning model in hydrology
    Yoon, Sunghyun
    Ahn, Kuk-Hyun
    JOURNAL OF HYDROLOGY, 2025, 653
  • [22] A linear hybrid methodology for improving accuracy of time series forecasting
    Ratnadip Adhikari
    R. K. Agrawal
    Neural Computing and Applications, 2014, 25 : 269 - 281
  • [23] Weather Radar Time Series Simulation: Improving Accuracy and Performance
    Curtis, Christopher D.
    JOURNAL OF ATMOSPHERIC AND OCEANIC TECHNOLOGY, 2018, 35 (11) : 2169 - 2187
  • [24] Weather radar time series simulation: Improving accuracy and performance
    Curtis, Christopher D. (chris.curtis@noaa.gov), 1600, American Meteorological Society (35):
  • [25] A linear hybrid methodology for improving accuracy of time series forecasting
    Adhikari, Ratnadip
    Agrawal, R. K.
    NEURAL COMPUTING & APPLICATIONS, 2014, 25 (02): : 269 - 281
  • [26] Improving Model Training by Periodic Sampling over Weight Distributions
    Tripathi, Samarth
    Liu, Jiayi
    Dhar, Sauptik
    Kurup, Unmesh
    Shah, Mohak
    2020 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2020, : 112 - 122
  • [27] Time Series Analysis vs Mixed Regression Analysis in Predictive Accuracy
    Miao Fei
    Liu Zhisong
    PROCEEDINGS OF THE 2018 EURO-ASIA CONFERENCE ON ENVIRONMENT AND CSR: TOURISM, SOCIETY AND EDUCATION SESSION (PART III), 2018, : 205 - 209
  • [28] Simultaneous model construction and noise reduction for hierarchical time series via Support Vector Regression
    Pablo Karmy, Juan
    Lopez, Julio
    Maldonado, Sebastian
    KNOWLEDGE-BASED SYSTEMS, 2021, 232 (232)
  • [29] The seasonal model of chili price movement with the effect of long memory and exogenous variables for improving time series model accuracy
    Devianto, Dodi
    Wahyuni, Elsa
    Maiyastri, Maiyastri
    Yollanda, Mutia
    FRONTIERS IN APPLIED MATHEMATICS AND STATISTICS, 2024, 10
  • [30] Uncertain regression model with autoregressive time series errors
    Chen, Dan
    SOFT COMPUTING, 2021, 25 (23) : 14549 - 14559