Smart Organization of Imbalanced Traffic Datasets for Long-Term Traffic Forecasting

被引:0
|
作者
Kara, Mustafa M. [1 ]
Turkmen, H. Irem [1 ]
Guvensan, M. Amac [1 ]
机构
[1] Yildiz Tech Univ, Dept Comp Engn, TR-34220 Istanbul, Turkiye
关键词
long-term traffic speed prediction; intelligent transportation systems; deep learning; data preprocessing; imbalanced datasets; data grouping; training enhancements; NEAREST NEIGHBOR MODEL; NEURAL-NETWORK; PREDICTION;
D O I
10.3390/s25041225
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Predicting traffic speed is an important issue, especially in urban regions. Precise long-term forecasts would enable individuals to conserve time and financial resources while diminishing air pollution. Despite extensive research on this subject, to our knowledge, no publications investigate or tackle the issue of imbalanced datasets in traffic speed prediction. Traffic speed data are often biased toward high numbers because low traffic speeds are infrequent. The temporal aspect of traffic carries two important factors for low-speed value. The daily population movement, captured by the time of day, and the weather data, recorded by month, are both considered in this study. Hour-wise Pattern Organization and Month-wise Pattern Organization techniques were devised, which organize the speed data using these two factors as a metric with a view to providing a superior representation of data characteristics that are in the minority. In addition to these two methods, a Speed-wise Pattern Organization strategy is proposed, which arranges train and test samples by setting boundaries on speed while taking the volatile nature of traffic into consideration. We evaluated these strategies using four popular model types: long short-term memory (LSTM), gated recurrent unit networks (GRUs), bi-directional LSTM, and convolutional neural networks (CNNs). GRU had the best performance, achieving a MAPE (Mean Absolute Percentage Error) of 13.51%, whereas LSTM demonstrated the lowest performance, with a MAPE of 13.74%. We validated their robustness through our studies and observed improvements in model accuracy across all categories. While the average improvement was approximately 4%, our methodologies demonstrated superior performance in low-traffic speed scenarios, augmenting model prediction accuracy by 11.2%. The presented methodologies in this study are applied in the pre-processing steps, allowing their application with various models and additional pre-processing procedures to attain comparable performance improvements.
引用
收藏
页数:27
相关论文
共 50 条
  • [1] Long-term forecasting of Internet backbone traffic
    Papagiannaki, K
    Taft, N
    Zhang, ZL
    Diot, C
    IEEE TRANSACTIONS ON NEURAL NETWORKS, 2005, 16 (05): : 1110 - 1124
  • [2] LONG-TERM FORECASTING OF VEHICLE OWNERSHIP AND ROAD TRAFFIC
    TANNER, JC
    JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-STATISTICS IN SOCIETY, 1978, 141 : 14 - 63
  • [3] TRECK: Long-Term Traffic Forecasting With Contrastive Representation Learning
    Zheng, Xiao
    Bagloee, Saeed Asadi
    Sarvi, Majid
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (11) : 16964 - 16977
  • [4] Constructing Geographic and Long-term Temporal Graph for Traffic Forecasting
    Sun, Yiwen
    Wang, Yulu
    Fu, Kun
    Wang, Zheng
    Zhang, Changshui
    Ye, Jieping
    2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 3483 - 3490
  • [5] Long-term forecasting of Internet traffic for Pakistan Internet exchange
    Alam, Mehreen
    2007 INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING, 2007, : 326 - 331
  • [6] Long-term forecasting oriented to urban expressway traffic situation
    Su, Fei
    Dong, Honghui
    Jia, Limin
    Qin, Yong
    Tian, Zhao
    ADVANCES IN MECHANICAL ENGINEERING, 2016, 8 (01)
  • [7] Bayes classifiers for imbalanced traffic accidents datasets
    Mujalli, Randa Oqab
    Lopez, Griselda
    Garach, Laura
    ACCIDENT ANALYSIS AND PREVENTION, 2016, 88 : 37 - 51
  • [8] Long-term traffic pattern forecasting using dynamic classifier selection
    Kianifar, Mohammad Ali
    Motallebi, Hassan
    Bardsiri, Vahid Khatibi
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2023, 44 (06) : 9783 - 9797
  • [9] Long-term Traffic Forecasting in Optical Networks Using Machine Learning
    Walkowiak, Krzysztof
    Szostak, Daniel
    Wlodarczyk, Adam
    Kasprzak, Andrzej
    INTERNATIONAL JOURNAL OF ELECTRONICS AND TELECOMMUNICATIONS, 2023, 69 (04) : 751 - 762
  • [10] A recurrent neural network for urban long-term traffic flow forecasting
    Asma Belhadi
    Youcef Djenouri
    Djamel Djenouri
    Jerry Chun-Wei Lin
    Applied Intelligence, 2020, 50 : 3252 - 3265