Smart Organization of Imbalanced Traffic Datasets for Long-Term Traffic Forecasting

被引:0
|
作者
Kara, Mustafa M. [1 ]
Turkmen, H. Irem [1 ]
Guvensan, M. Amac [1 ]
机构
[1] Yildiz Tech Univ, Dept Comp Engn, TR-34220 Istanbul, Turkiye
关键词
long-term traffic speed prediction; intelligent transportation systems; deep learning; data preprocessing; imbalanced datasets; data grouping; training enhancements; NEAREST NEIGHBOR MODEL; NEURAL-NETWORK; PREDICTION;
D O I
10.3390/s25041225
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Predicting traffic speed is an important issue, especially in urban regions. Precise long-term forecasts would enable individuals to conserve time and financial resources while diminishing air pollution. Despite extensive research on this subject, to our knowledge, no publications investigate or tackle the issue of imbalanced datasets in traffic speed prediction. Traffic speed data are often biased toward high numbers because low traffic speeds are infrequent. The temporal aspect of traffic carries two important factors for low-speed value. The daily population movement, captured by the time of day, and the weather data, recorded by month, are both considered in this study. Hour-wise Pattern Organization and Month-wise Pattern Organization techniques were devised, which organize the speed data using these two factors as a metric with a view to providing a superior representation of data characteristics that are in the minority. In addition to these two methods, a Speed-wise Pattern Organization strategy is proposed, which arranges train and test samples by setting boundaries on speed while taking the volatile nature of traffic into consideration. We evaluated these strategies using four popular model types: long short-term memory (LSTM), gated recurrent unit networks (GRUs), bi-directional LSTM, and convolutional neural networks (CNNs). GRU had the best performance, achieving a MAPE (Mean Absolute Percentage Error) of 13.51%, whereas LSTM demonstrated the lowest performance, with a MAPE of 13.74%. We validated their robustness through our studies and observed improvements in model accuracy across all categories. While the average improvement was approximately 4%, our methodologies demonstrated superior performance in low-traffic speed scenarios, augmenting model prediction accuracy by 11.2%. The presented methodologies in this study are applied in the pre-processing steps, allowing their application with various models and additional pre-processing procedures to attain comparable performance improvements.
引用
收藏
页数:27
相关论文
共 50 条
  • [21] Forecasting Traffic Flow: Short Term, Long Term, and When It Rains
    Peng, Hao
    Bobade, Santosh U.
    Cotterell, Michael E.
    Miller, John A.
    BIG DATA - BIGDATA 2018, 2018, 10968 : 57 - 71
  • [22] Long-term traffic forecasting based on adaptive graph cross strided convolution network
    Li, Zhao
    Zhang, Yong
    Guo, Da
    Zhou, Xu
    Wang, Xing
    Zhu, Lin
    APPLIED INTELLIGENCE, 2023, 53 (04) : 3672 - 3686
  • [23] Network traffic forecasting model based on long-term intuitionistic fuzzy time series
    Fan, Xiaoshi
    Wang, Yanan
    Zhang, Mengyu
    INFORMATION SCIENCES, 2020, 506 : 131 - 147
  • [24] Multivariate Long-Term Traffic Forecasting with Graph Convolutional Network and Historical Attention Mechanism
    Wang, Zhaohuan
    Xu, Yi
    Han, Liangzhe
    Zhu, Tongyu
    Sun, Leilei
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, PT IV, KSEM 2023, 2023, 14120 : 112 - 123
  • [25] Long-term traffic forecasting based on adaptive graph cross strided convolution network
    Zhao Li
    Yong Zhang
    Da Guo
    Xu Zhou
    Xing Wang
    Lin Zhu
    Applied Intelligence, 2023, 53 : 3672 - 3686
  • [26] Long-term traffic flow forecasting using a hybrid CNN-BiLSTM model
    Mendez, Manuel
    Merayo, Mercedes G.
    Nunez, Manuel
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2023, 121
  • [27] STPSformer: Spatial-Temporal ProbSparse Transformer for Long-Term Traffic Flow Forecasting
    Wang, Zhanquan (zhqwang@ecust.edu.cn), 1600, Institute of Electrical and Electronics Engineers Inc.
  • [28] Long-Term Data Traffic Forecasting for Network Dimensioning in LTE with Short Time Series
    Gijon, Carolina
    Toril, Matias
    Luna-Ramirez, Salvador
    Mari-Altozano, Maria Luisa
    Ruiz-Aviles, Jose Maria
    ELECTRONICS, 2021, 10 (10)
  • [29] Efficient Generative Adversarial Networks for Imbalanced Traffic Collision Datasets
    Chen, Mu-Yen
    Chiang, Hsiu-Sen
    Huang, Wei-Kai
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (10) : 19864 - 19873
  • [30] Automated Traffic Incident Detection: Coping With Imbalanced and Small Datasets
    Xie, Tian
    Shang, Qiang
    Yu, Yang
    IEEE ACCESS, 2022, 10 : 35521 - 35540