Improving Time Series Regression Model Accuracy via Systematic Training Dataset Augmentation and Sampling

被引:1
|
作者
Stroebel, Robin [1 ]
Mau, Marcus [1 ]
Puchta, Alexander [1 ]
Fleischer, Juergen [1 ]
机构
[1] Karlsruhe Inst Technol, Wbk Inst Prod Sci, Kaiserstr 12, D-76131 Karlsruhe, Germany
来源
关键词
time series regression; data augmentation; model accuracy; training datasets;
D O I
10.3390/make6020049
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This study addresses a significant gap in the field of time series regression modeling by highlighting the central role of data augmentation in improving model accuracy. The primary objective is to present a detailed methodology for systematic sampling of training datasets through data augmentation to improve the accuracy of time series regression models. Therefore, different augmentation techniques are compared to evaluate their impact on model accuracy across different datasets and model architectures. In addition, this research highlights the need for a standardized approach to creating training datasets using multiple augmentation methods. The lack of a clear framework hinders the easy integration of data augmentation into time series regression pipelines. Our systematic methodology promotes model accuracy while providing a robust foundation for practitioners to seamlessly integrate data augmentation into their modeling practices. The effectiveness of our approach is demonstrated using process data from two milling machines. Experiments show that the optimized training dataset improves the generalization ability of machine learning models in 86.67% of the evaluated scenarios. However, the prediction accuracy of models trained on a sufficient dataset remains largely unaffected. Based on these results, sophisticated sampling strategies such as Quadratic Weighting of multiple augmentation approaches may be beneficial.
引用
收藏
页码:1072 / 1086
页数:15
相关论文
共 50 条
  • [1] EMG dataset augmentation approaches for improving the multi-DOF wrist movement regression accuracy and robustness
    Yang, Wei
    Yang, Dapeng
    Li, Jiaming
    Liu, Yu
    Liu, Hong
    2018 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND BIOMIMETICS (ROBIO), 2018, : 1268 - 1273
  • [2] Improving the accuracy of global forecasting models using time series data augmentation
    Bandara, Kasun
    Hewamalage, Hansika
    Liu, Yuan-Hao
    Kang, Yanfei
    Bergmeir, Christoph
    PATTERN RECOGNITION, 2021, 120
  • [3] Accelerating Dataset Distillation via Model Augmentation
    Zhang, Lei
    Zhang, Jie
    Lei, Bowen
    Mukherjee, Subhabrata
    Pan, Xiang
    Zhao, Bo
    Ding, Caiwen
    Li, Yao
    Xu, Dongkuan
    2023 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2023, : 11950 - 11959
  • [4] The influence of parameter initialization on the training time and accuracy of a nonlinear regression model
    E. V. Burnaev
    P. D. Erofeev
    Journal of Communications Technology and Electronics, 2016, 61 : 646 - 660
  • [5] The influence of parameter initialization on the training time and accuracy of a nonlinear regression model
    Burnaev, E. V.
    Erofeev, P. D.
    JOURNAL OF COMMUNICATIONS TECHNOLOGY AND ELECTRONICS, 2016, 61 (06) : 646 - 660
  • [6] Time Series Model Interpretation via Temporal Feature Sampling
    Liu, Zhaoyang
    Li, Xiaodong
    Cui, Yanping
    WEB AND BIG DATA, PT I, APWEB-WAIM 2023, 2024, 14331 : 346 - 360
  • [7] Improving Accuracy and Efficiency in Time Series Forecasting with an Optimized Transformer Model
    Chen, Junhong
    Dai, Hong
    Wang, Shuang
    Liu, Chengrui
    ENGINEERING LETTERS, 2024, 32 (01) : 1 - 11
  • [8] A Hybrid Regression Model for Improving Prediction Accuracy
    Poojari, Satyanarayana
    Ismail, B.
    ELECTRONIC JOURNAL OF APPLIED STATISTICAL ANALYSIS, 2023, 16 (03) : 784 - 801
  • [9] Improving disk failure detection accuracy via data augmentation
    Wang, Wang
    Tang, Xuehai
    Zhou, Biyu
    Xiao, Wenjie
    Han, Jizhong
    Hu, Songlin
    2022 IEEE/ACM 30TH INTERNATIONAL SYMPOSIUM ON QUALITY OF SERVICE (IWQOS), 2022,
  • [10] Fast Training Logistic Regression via Adaptive Sampling
    Song, Yunsheng
    Kong, Xiaohan
    Huang, Shuoping
    Zhang, Chao
    SCIENTIFIC PROGRAMMING, 2021, 2021