Time Series Data Cleaning Based on Dynamic Speed Constraints

被引:0
|
作者
Ding, Guohui [1 ]
Li, Chenyang [1 ]
Wei, Ru [1 ]
Sun, Shasha [1 ]
Liu, Zhaoyu [1 ]
Fan, Chunlong [1 ]
机构
[1] Shenyang Aerosp Univ, Shenyang, Peoples R China
基金
中国国家自然科学基金;
关键词
Data cleaning; Speed constraint; Minimum change principle;
D O I
10.1007/978-3-030-62008-0_33
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Errors are ubiquitous in time series data as sensors are often unstable. Existing approaches based on constraints can achieve good data repair effect on abnormal values. The constraint typically refers to the speed range of data changes. If the speed of data changes is not in the range, it is identified as abnormal data violating the constraint and needs repair, like if the oil consumption per hour of a sedan is negative or greater than 15 gallons, it is probably abnormal data. However, existing methods are only limited to specific type of data whose value change speed is stable. They will be inefficient when handling the data stream with sharp fluctuation because their constraints based on priori, fixed speed range might miss most abnormal data. To make up the gap in this scenario, an online cleaning approach based on dynamic speed constraints is proposed for time series data with fluctuating value change speed. The dynamic constraints proposed is not determined in advance but self-adaptive as data changes over time. A dual window mechanism is devised to transform the global optimum of data repair problem to local optimum problem. The classic minimum change principle and median principle are introduced for data repair. With respect to repair invalidation of minimum change principle facing consecutive data points violating constraints, we propose to use the boundary of the corresponding candidate repair set as repair strategy. Extensive experiments on real datasets demonstrate that the proposed approach can achieve higher repair accuracy than traditional approaches.
引用
收藏
页码:475 / 487
页数:13
相关论文
共 50 条
  • [1] Time Series Data Cleaning under Multi-speed Constraints
    Gao F.
    Song S.-X.
    Wang J.-M.
    Ruan Jian Xue Bao/Journal of Software, 2021, 32 (03): : 689 - 711
  • [2] Time Series Data Cleaning Method Based on Optimized ELM Prediction Constraints
    Ding, Guohui
    Zhu, Yueyi
    Li, Chenyang
    Wang, Jinwei
    Wei, Ru
    Liu, Zhaoyu
    JOURNAL OF INFORMATION PROCESSING SYSTEMS, 2023, 19 (02): : 149 - 163
  • [3] A Dynamic Path Data Cleaning Algorithm Based on Constraints for RFID Data Cleaning
    Hu, Kongfa
    Li, Long
    Hu, Chengjun
    Xie, Jiadong
    Lu, Zhipeng
    2014 11TH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (FSKD), 2014, : 537 - 541
  • [4] SCREEN: Stream Data Cleaning under Speed Constraints
    Song, Shaoxu
    Zhang, Aoqian
    Wang, Jianmin
    Yu, Philip S.
    SIGMOD'15: PROCEEDINGS OF THE 2015 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2015, : 827 - 841
  • [5] Stream Data Cleaning under Speed and Acceleration Constraints
    Song, Shaoxu
    Gao, Fei
    Zhang, Aoqian
    Wang, Jianmin
    Yu, Philip S.
    ACM TRANSACTIONS ON DATABASE SYSTEMS, 2021, 46 (03):
  • [6] Time Series Data Cleaning: A Survey
    Wang, Xi
    Wang, Chen
    IEEE ACCESS, 2020, 8 : 1866 - 1881
  • [7] Cleaning of Multi-Source Uncertain Time Series Data Based on PageRank
    高嘉伟
    孙纪舟
    JournalofDonghuaUniversity(EnglishEdition), 2023, 40 (06) : 695 - 700
  • [8] Photovoltaic Generation Data Cleaning Method Based on Approximately Periodic Time Series
    Zhang, J.
    Zhang, Sh
    Liang, J.
    Tian, B.
    Hou, Z.
    Liu, B. Zh
    2017 INTERNATIONAL CONFERENCE ON ENVIRONMENTAL AND ENERGY ENGINEERING (IC3E 2017), 2017, 63
  • [9] Streaming data cleaning based on speed change
    Haoyu Wang
    Aoqian Zhang
    Shaoxu Song
    Jianmin Wang
    The VLDB Journal, 2024, 33 : 1 - 24
  • [10] Streaming data cleaning based on speed change
    Wang, Haoyu
    Zhang, Aoqian
    Song, Shaoxu
    Wang, Jianmin
    VLDB JOURNAL, 2024, 33 (01): : 1 - 24