Time Series Data Cleaning Based on Dynamic Speed Constraints

被引:0
|
作者
Ding, Guohui [1 ]
Li, Chenyang [1 ]
Wei, Ru [1 ]
Sun, Shasha [1 ]
Liu, Zhaoyu [1 ]
Fan, Chunlong [1 ]
机构
[1] Shenyang Aerosp Univ, Shenyang, Peoples R China
来源
WEB INFORMATION SYSTEMS ENGINEERING, WISE 2020, PT II | 2020年 / 12343卷
基金
中国国家自然科学基金;
关键词
Data cleaning; Speed constraint; Minimum change principle;
D O I
10.1007/978-3-030-62008-0_33
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Errors are ubiquitous in time series data as sensors are often unstable. Existing approaches based on constraints can achieve good data repair effect on abnormal values. The constraint typically refers to the speed range of data changes. If the speed of data changes is not in the range, it is identified as abnormal data violating the constraint and needs repair, like if the oil consumption per hour of a sedan is negative or greater than 15 gallons, it is probably abnormal data. However, existing methods are only limited to specific type of data whose value change speed is stable. They will be inefficient when handling the data stream with sharp fluctuation because their constraints based on priori, fixed speed range might miss most abnormal data. To make up the gap in this scenario, an online cleaning approach based on dynamic speed constraints is proposed for time series data with fluctuating value change speed. The dynamic constraints proposed is not determined in advance but self-adaptive as data changes over time. A dual window mechanism is devised to transform the global optimum of data repair problem to local optimum problem. The classic minimum change principle and median principle are introduced for data repair. With respect to repair invalidation of minimum change principle facing consecutive data points violating constraints, we propose to use the boundary of the corresponding candidate repair set as repair strategy. Extensive experiments on real datasets demonstrate that the proposed approach can achieve higher repair accuracy than traditional approaches.
引用
收藏
页码:475 / 487
页数:13
相关论文
共 50 条
  • [21] Individual privacy constraints on time-series data
    Laforet, Fabian
    Buchmann, Erik
    Boehm, Klemens
    INFORMATION SYSTEMS, 2015, 54 : 74 - 91
  • [22] Dynamic analysis of time series data based on state space model
    Zhizhong, Yang, 1600, Science and Engineering Research Support Society (07):
  • [23] Data Cleaning Method Based on Time Series Similarity Measurement for Large Scale Smart Grid Load Data
    Lei, Yu
    Lin, RongHeng
    Zou, Hua
    Zhou, Shiqi
    Zhang, Yong
    2017 5TH INTERNATIONAL CONFERENCE ON ENTERPRISE SYSTEMS (ES), 2017, : 7 - 12
  • [24] Data Cleaning Method of No-tillage Seeder Monitoring Data Based on Multi-conditional Time Series
    Jiang H.
    Zhou L.
    Ma M.
    Li Y.
    Zhou Y.
    Yuan Y.
    Nongye Jixie Xuebao/Transactions of the Chinese Society for Agricultural Machinery, 2022, 53 (01): : 85 - 91
  • [25] Dynamic time warping based on cubic spline interpolation for time series data mining
    Li, Hailin
    Wan, Xiaoji
    Liang, Ye
    Gao, Shile
    2014 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOP (ICDMW), 2014, : 19 - 26
  • [26] Invariant subspace learning for time series data based on dynamic time warping distance
    Deng, Huiqi
    Chen, Weifu
    Shen, Qi
    Ma, Andy J.
    Yuen, Pong C.
    Feng, Guocan
    PATTERN RECOGNITION, 2020, 102
  • [27] Time works well: Dynamic time warping based on time weighting for time series data mining
    Li, Hailin
    INFORMATION SCIENCES, 2021, 547 : 592 - 608
  • [28] Time Series Analysis and Forecasting of Wind Speed Data
    Elsaraiti, Meftah
    Merabet, Adel
    Al-Durra, Ahmed
    2019 IEEE INDUSTRY APPLICATIONS SOCIETY ANNUAL MEETING, 2019,
  • [29] Online summarization of dynamic time series data
    Ogras, UY
    Ferhatosmanoglu, H
    VLDB JOURNAL, 2006, 15 (01): : 84 - 98
  • [30] Online summarization of dynamic time series data
    Umit Y. Ogras
    Hakan Ferhatosmanoglu
    The VLDB Journal, 2006, 15 : 84 - 98