Missing Traffic Data Imputation with a Linear Generative Model Based on Probabilistic Principal Component Analysis

被引:2
|
作者
Huang, Liping [1 ]
Li, Zhenghuan [1 ]
Luo, Ruikang [1 ]
Su, Rong [1 ]
机构
[1] Nanyang Technol Univ, Sch Elect & Elect Engn, Singapore 639798, Singapore
关键词
missing data; urban traffic sensing; probabilistic; principal component analysis; PREDICTION;
D O I
10.3390/s23010204
中图分类号
O65 [分析化学];
学科分类号
070302 ; 081704 ;
摘要
Even with the ubiquitous sensing data in intelligent transportation systems, such as the mobile sensing of vehicle trajectories, traffic estimation is still faced with the data missing problem due to the detector faults or limited number of probe vehicles as mobile sensors. Such data missing issue poses an obstacle for many further explorations, e.g., the link-based traffic status modeling. Although many studies have focused on tackling this kind of problem, existing studies mainly focus on the situation in which data are missing at random and ignore the distinction between links of missing data. In the practical scenario, traffic speed data are always missing not at random (MNAR). The distinction for recovering missing data on different links has not been studied yet. In this paper, we propose a general linear model based on probabilistic principal component analysis (PPCA) for solving MNAR traffic speed data imputation. Furthermore, we propose a metric, i.e., Pearson score (p-score), for distinguishing links and investigate how the model performs on links with different p-score values. Experimental results show that the new model outperforms the typically used PPCA model, and missing data on links with higher p-score values can be better recovered.
引用
收藏
页数:13
相关论文
共 50 条
  • [31] Principal component analysis of dissolution data with missing elements
    Adams, E
    Walczak, B
    Vervaet, C
    Risha, PG
    Massart, DL
    INTERNATIONAL JOURNAL OF PHARMACEUTICS, 2002, 234 (1-2) : 169 - 178
  • [32] Missing Data Repairs for Traffic Flow With Self-Attention Generative Adversarial Imputation Net
    Zhang, Weibin
    Zhang, Pulin
    Yu, Yinghao
    Li, Xiying
    Biancardo, Salvatore Antonio
    Zhang, Junyi
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 23 (07) : 7919 - 7930
  • [33] Multiple imputation in principal component analysis
    Julie Josse
    Jérôme Pagès
    François Husson
    Advances in Data Analysis and Classification, 2011, 5 : 231 - 246
  • [34] Identifiable Generative Models for Missing Not at Random Data Imputation
    Ma, Chao
    Zhang, Cheng
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [35] Improved generative adversarial imputation networks for missing data
    Qin, Xiwen
    Shi, Hongyu
    Dong, Xiaogang
    Zhang, Siqi
    Yuan, Liping
    APPLIED INTELLIGENCE, 2024, 54 (21) : 11068 - 11082
  • [36] MissII: Missing Information Imputation for Traffic Data
    Hou, Mingliang
    Tang, Tao
    Xia, Feng
    Sultan, Ibrahim
    Kaur, Roopdeep
    Kong, Xiangjie
    IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTING, 2024, 12 (03) : 752 - 765
  • [37] A Comprehensive Survey on Traffic Missing Data Imputation
    Zhang, Yimei
    Kong, Xiangjie
    Zhou, Wenfeng
    Liu, Jin
    Fu, Yanjie
    Shen, Guojiang
    IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 25 (12) : 19252 - 19275
  • [38] Missing traffic data: comparison of imputation methods
    Li, Yuebiao
    Li, Zhiheng
    Li, Li
    IET INTELLIGENT TRANSPORT SYSTEMS, 2014, 8 (01) : 51 - 57
  • [39] Multiple imputation in principal component analysis
    Josse, Julie
    Pages, Jerome
    Husson, Francois
    ADVANCES IN DATA ANALYSIS AND CLASSIFICATION, 2011, 5 (03) : 231 - 246
  • [40] Microarray missing value imputation by bayesian principal component analysis and local least squares
    Meng, Fanchi
    Cai, Cheng
    Li, Shuqin
    ICIC Express Letters, Part B: Applications, 2013, 4 (03): : 699 - 703