Fast similarity search in the presence of longitudinal scaling in time series databases

被引:61
|
作者
Keogh, E
机构
关键词
D O I
10.1109/TAI.1997.632306
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The problem of finding patterns of interest in time series databases (query by content) is an important one, with applications in virtually every field of science. A variety of approaches have been suggested. These approaches are robust to noise, offset translation, and amplitude scaling to varying degrees. However, they are all extremely sensitive to scaling in the lime axis (longitudinal scaling). We present a method for similarity search that is robust to scaling in the time axis, in addition to noise, offset translation, and amplitude scaling. The method has been tested on medical, financial, space telemetry and artificial data. Furthermore the method is exceptionally fast, with the predicted 2 to 4 orders of magnitude speedup actually observed. The method uses a piecewise linear representation of the original data. We also introduce a new algorithm which both decides the optimal number of linear segments to use, and produces the actual linear representation.
引用
收藏
页码:578 / 584
页数:7
相关论文
共 50 条
  • [1] Indexing scheme for fast similarity search in large time series databases
    Keogh, Eamonn J.
    Pazzani, Michael J.
    Proceedings of the International Conference on Scientific and Statistical Database Management, SSDBM, 1999, : 56 - 67
  • [2] Dimensionality Reduction for Fast Similarity Search in Large Time Series Databases
    Eamonn Keogh
    Kaushik Chakrabarti
    Michael Pazzani
    Sharad Mehrotra
    Knowledge and Information Systems, 2001, 3 (3) : 263 - 286
  • [3] A simple dimensionality reduction technique for fast similarity search in large time series databases
    Keogh, EJ
    Pazzani, MJ
    KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS: CURRENT ISSUES AND NEW APPLICATIONS, 2000, 1805 : 122 - 133
  • [4] Similarity search in time series databases using moments
    Toshniwal, D
    Joshi, RC
    PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND APPLICATIONS (ICMLA'04), 2004, : 164 - 171
  • [5] Parallelization of similarity search in large time series databases
    Qiao, Jonathan
    Ye, Yang
    Zhang, Chaoyang
    FIRST INTERNATIONAL MULTI-SYMPOSIUMS ON COMPUTER AND COMPUTATIONAL SCIENCES (IMSCCS 2006), PROCEEDINGS, VOL 1, 2006, : 355 - +
  • [6] Interval-focused similarity search in time series databases
    Assfalg, Johannes
    Kriegel, Hans-Peter
    Kroeger, Peer
    Kunath, Peter
    Pryakhin, Alexey
    Renz, Matthias
    ADVANCES IN DATABASES: CONCEPTS, SYSTEMS AND APPLICATIONS, 2007, 4443 : 586 - +
  • [7] Anticipatory DTW for Efficient Similarity Search in Time Series Databases
    Assent, Ira
    Wichterich, Marc
    Krieger, Ralph
    Kremer, Hardy
    Seidl, Thomas
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2009, 2 (01):
  • [8] Similarity search using the polar wavelet in time series databases
    Kang, Seonggu
    Kim, Jaehwan
    Chae, Jinseok
    Choi, Wonik
    Lee, Sangjun
    ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS: WITH ASPECTS OF THEORETICAL AND METHODOLOGICAL ISSUES, 2007, 4681 : 1347 - +
  • [9] Fast similarity search in string databases
    Sheu, S
    Chang, A
    Huang, W
    19TH INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS, VOL 1, PROCEEDINGS: AINA 2005, 2005, : 617 - 622
  • [10] Fast online similarity search for uncertain time series
    Ma R.
    Zheng D.
    Yan L.
    Journal of Computing and Information Technology, 2020, 28 (01): : 1 - 17