Traffic Flow Prediction With Missing Data Imputed by Tensor Completion Methods

被引:27
|
作者
Li, Qin [1 ]
Tan, Huachun [2 ]
Wu, Yuankai [3 ]
Ye, Linhui [4 ]
Ding, Fan [2 ]
机构
[1] Beijing Inst Technol, Sch Mech Engn, Beijing 100081, Peoples R China
[2] Joint Res Inst Internet Mobil Univ Wisconsin Madi, Nanjing 211102, Peoples R China
[3] McGill Univ, Dept Civil Engn, Montreal, PQ H3A 0G4, Canada
[4] Univ Wisconsin, Dept Civil & Environm Engn, Madison, WI 53705 USA
基金
中国国家自然科学基金;
关键词
Missing data imputation; missing traffic data; tensor completion; traffic flow prediction; BIG DATA; DECOMPOSITION; IMPUTATION; MODELS;
D O I
10.1109/ACCESS.2020.2984588
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Missing data is inevitable and ubiquitous in intelligent transportation systems (ITSs). A handful of completion methods have been proposed, among which the tensor-based models have been shown to be the most advantageous for missing traffic data imputation. Despite their superior imputation accuracies, the adoption of these imputed data is not uniform in modern ITSs applications. The primary goal of this paper is to explore how to use tensor completion methods to support ITSs. In particular, we study how to improve traffic flow prediction accuracy under different missing scenarios. Specifically, three common missing scenarios including element-wise random missing, time-structured missing, and space-structured missing are considered. Four classical tensor completion models including Smooth PARAFAC Decomposition based Completion (SPC), CP Decomposition-based (CP-WOPT) Completion, Tucker Decomposition-based Completion (TDI), and High-accuracy Low-rank Tensor Completion (HaLRTC) are used to impute the missing data. Four well-known prediction methods including Support Vector Regression (SVR), K-nearest Neighbor (KNN), Gradient Boost Regression Tree (GBRT), and Long Short-term Memory (LSTM) are tested. The simple mean value interpolation completed traffic data is regarded as the baseline data. The extensive experiments show that improvements of traffic flow prediction can be achieved by increasing missing traffic data imputation accuracy at most cases. Interestingly we find that prediction accuracy cannot be improved by an imputation model when the sparsely observed training datasets already provide sufficient training samples.
引用
收藏
页码:63188 / 63201
页数:14
相关论文
共 50 条
  • [41] Accurate Recovery of Internet Traffic Data: A Sequential Tensor Completion Approach
    Xie, Kun
    Wang, Lele
    Wang, Xin
    Xie, Gaogang
    Wen, Jigang
    Zhang, Guangxing
    Cao, Jiannong
    Zhang, Dafang
    IEEE-ACM TRANSACTIONS ON NETWORKING, 2018, 26 (02) : 793 - 806
  • [42] Internet traffic tensor completion with tensor nuclear norm
    Li, Can
    Chen, Yannan
    Li, Dong-Hui
    COMPUTATIONAL OPTIMIZATION AND APPLICATIONS, 2024, 87 (03) : 1033 - 1057
  • [43] Internet traffic tensor completion with tensor nuclear norm
    Can Li
    Yannan Chen
    Dong-Hui Li
    Computational Optimization and Applications, 2024, 87 : 1033 - 1057
  • [44] Tensor Completion for Three-dimensional Radar Imaging with Randomly Missing Data
    Qiu, Wei
    Zhou, Jianxiong
    2018 IEEE 23RD INTERNATIONAL CONFERENCE ON DIGITAL SIGNAL PROCESSING (DSP), 2018,
  • [45] RETRACTED: Low Multilinear Rank Tensor Completion with Missing Data (Retracted Article)
    Tan, Huachun
    Feng, Jianshuai
    Li, Feng
    Zhang, Yujin
    Chen, Tao
    2011 INTERNATIONAL CONFERENCE ON ENERGY AND ENVIRONMENTAL SCIENCE-ICEES 2011, 2011, 11
  • [46] A Method Based on Tensor Decomposition for Missing Multi-dimensional Data Completion
    Chen, Jianke
    Chen, Pinghua
    2017 IEEE 2ND INTERNATIONAL CONFERENCE ON BIG DATA ANALYSIS (ICBDA), 2017, : 149 - 153
  • [47] A tensor completion algorithm for missing user data in spot trading of electricity market
    Yang, Ting
    Liu, Guoliang
    Wang, Yong
    Suo, Siyuan
    Zhang, Meiling
    Yang, Zhenning
    COMPUTERS & ELECTRICAL ENGINEERING, 2025, 122
  • [48] Tucker factorization with missing data with application to low--rank tensor completion
    Filipovic, Marko
    Jukic, Ante
    MULTIDIMENSIONAL SYSTEMS AND SIGNAL PROCESSING, 2015, 26 (03) : 677 - 692
  • [49] A tensor-based K-nearest neighbors method for traffic speed prediction under data missing
    Zheng, Liang
    Huang, Huimin
    Zhu, Chuang
    Zhang, Kunpeng
    TRANSPORTMETRICA B-TRANSPORT DYNAMICS, 2020, 8 (01) : 182 - 199
  • [50] A comparison of model selection methods for prediction in the presence of multiply imputed data
    Le Thi Phuong Thao
    Geskus, Ronald
    BIOMETRICAL JOURNAL, 2019, 61 (02) : 343 - 356