Ensemble correlation-based low-rank matrix completion with applications to traffic data imputation

被引:51
|
作者
Chen, Xiaobo [1 ]
Wei, Zhongjie [2 ]
Li, Zuoyong [3 ]
Liang, Jun [1 ]
Cai, Yingfeng [1 ]
Zhang, Bob [4 ]
机构
[1] Jiangsu Univ, Automot Engn Res Ins, Zhenjiang 212013, Peoples R China
[2] Jiangsu Univ, Sch Automot & Traff Engn, Zhenjiang 212013, Peoples R China
[3] Minjiang Univ, Fujian Prov Key Lab Informat Proc & Intelligent C, Fuzhou 350108, Fujian, Peoples R China
[4] Univ Macau, Dept Comp & Informat Sci, Macau, Peoples R China
基金
中国国家自然科学基金; 中国博士后科学基金;
关键词
Missing data; Low-rank matrix completion; Nearest neighbor; Pearson's correlation; Ensemble learning; MISSING VALUE ESTIMATION;
D O I
10.1016/j.knosys.2017.06.010
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Low-rank matrix completion (LRMC) is a recently emerging technique which has achieved promising performance in many real-world applications, such as traffic data imputation. In order to estimate missing values, the current LRMC based methods optimize the rank of the matrix comprising the whole traffic data, potentially assuming that all traffic data is equally important. As a result, it puts more emphasis on the commonality of traffic data while ignoring its subtle but crucial difference due to different locations of loop detectors as well as dates of sampling. To handle this problem and further improve imputation performance, a novel correlation-based LRMC method is proposed in this paper. Firstly, LRMC is applied to get initial estimations of missing values. Then, a distance matrix containing pairwise distance between samples is built based on a weighted Pearson's correlation which strikes a balance between observed values and imputed values. For a specific sample, its most similar samples based on the distance matrix constructed are chosen by using an adaptive K-nearest neighboring (KNN) search. LRMC is then applied on these samples with much stronger correlation to obtain refined estimations of missing values. Finally, we also propose a simple but effective ensemble learning strategy to integrate multiple imputed values for a specific sample for further improving imputation performance. Extensive numerical experiments are performed on both traffic flow volume data as well as standard benchmark datasets. The results confirm that the proposed correlation-based LRMC and its ensemble learning version achieve better imputation performance than competing methods. (C) 2017 Elsevier B.V. All rights reserved.
引用
收藏
页码:249 / 262
页数:14
相关论文
共 50 条
  • [1] Low-Rank Autoregressive Tensor Completion for Spatiotemporal Traffic Data Imputation
    Chen, Xinyu
    Lei, Mengying
    Saunier, Nicolas
    Sun, Lijun
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (08) : 12301 - 12310
  • [2] Traffic Data Imputation Algorithm Based on Improved Low-Rank Matrix Decomposition
    Luo, Xianglong
    Meng, Xue
    Gan, Wenjuan
    Chen, Yonghong
    JOURNAL OF SENSORS, 2019, 2019
  • [3] A nonconvex low-rank tensor completion model for spatiotemporal traffic data imputation
    Chen, Xinyu
    Yang, Jinming
    Sun, Lijun
    TRANSPORTATION RESEARCH PART C-EMERGING TECHNOLOGIES, 2020, 117
  • [4] A nonconvex low-rank tensor completion model for spatiotemporal traffic data imputation
    Chen, Xinyu
    Yang, Jinming
    Sun, Lijun
    Transportation Research Part C: Emerging Technologies, 2020, 117
  • [5] Low-Rank Representation based Traffic Data Completion Method
    Du, Rong
    Zhang, Yong
    Wang, Boyue
    Liu, Hao
    Qi, Guanglei
    Yin, Baocai
    2016 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2016, : 5127 - 5134
  • [6] Low-rank traffic matrix completion with marginal information
    Xiong, Zikai
    Wei, Yimin
    Xu, Renjie
    Xu, Yanwei
    JOURNAL OF COMPUTATIONAL AND APPLIED MATHEMATICS, 2022, 410
  • [7] Low-Rank Tensor Completion With 3-D Spatiotemporal Transform for Traffic Data Imputation
    Shu, Hao
    Wang, Hailin
    Peng, Jiangjun
    Meng, Deyu
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (11) : 18673 - 18687
  • [8] Interpolation method of traffic volume missing data based on improved low-rank matrix completion
    Chen, Xiao-Bo
    Chen, Cheng
    Chen, Lei
    Wei, Zhong-Jie
    Cai, Ying-Feng
    Zhou, Jun-Jie
    Jiaotong Yunshu Gongcheng Xuebao/Journal of Traffic and Transportation Engineering, 2019, 19 (05): : 180 - 190
  • [9] Low-Rank Matrix Completion
    Chi, Yuejie
    IEEE SIGNAL PROCESSING MAGAZINE, 2018, 35 (05) : 178 - 181
  • [10] Low-Rank Autoregressive Tucker Decomposition for Traffic Data Imputation
    Lu, Jiaxin
    Gong, Wenwu
    Yang, Lili
    2024 29TH INTERNATIONAL CONFERENCE ON AUTOMATION AND COMPUTING, ICAC 2024, 2024, : 236 - 241