Fast similarity search in the presence of longitudinal scaling in time series databases

被引:61
|
作者
Keogh, E
机构
关键词
D O I
10.1109/TAI.1997.632306
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The problem of finding patterns of interest in time series databases (query by content) is an important one, with applications in virtually every field of science. A variety of approaches have been suggested. These approaches are robust to noise, offset translation, and amplitude scaling to varying degrees. However, they are all extremely sensitive to scaling in the lime axis (longitudinal scaling). We present a method for similarity search that is robust to scaling in the time axis, in addition to noise, offset translation, and amplitude scaling. The method has been tested on medical, financial, space telemetry and artificial data. Furthermore the method is exceptionally fast, with the predicted 2 to 4 orders of magnitude speedup actually observed. The method uses a piecewise linear representation of the original data. We also introduce a new algorithm which both decides the optimal number of linear segments to use, and produces the actual linear representation.
引用
收藏
页码:578 / 584
页数:7
相关论文
共 50 条
  • [31] Similarity search in trajectory Databases
    Pelekis, Nikos
    Kopanakis, Ioannis
    Marketos, Gerasimos
    Ntoutsi, Irene
    Andrienko, Gennady
    Theodoridis, Yannis
    TIME 2007: 14TH INTERNATIONAL SYMPOSIUM ON TEMPORAL REPRESENTATION AND REASONING, PROCEEDINGS, 2007, : 129 - +
  • [32] Similarity search in multimedia databases
    Keim, DA
    Bustos, B
    20TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2004, : 873 - 873
  • [33] Scaling Search with Pattern Databases
    Edelkamp, Stefan
    Jabbar, Shahid
    Kissmann, Peter
    MODEL CHECKING AND ARTIFICIAL INTELLIGENCE, 2009, 5348 : 49 - 64
  • [34] Image Similarity Search in Large Databases Using a Fast Machine Learning Approach
    Sinjur, Smiljan
    Zazula, Damjan
    NEW DIRECTIONS IN INTELLIGENT INTERACTIVE MULTIMEDIA, 2008, 142 : 85 - 93
  • [35] SketchSort: Fast All Pairs Similarity Search for Large Databases of Molecular Fingerprints
    Tabei, Yasuo
    Tsuda, Koji
    MOLECULAR INFORMATICS, 2011, 30 (09) : 801 - 807
  • [36] An Efficient Similarity Search For Financial Multivariate Time Series
    Zhou, Dazhuo
    Li, Minqiang
    Yan, Hongcan
    2008 4TH INTERNATIONAL CONFERENCE ON WIRELESS COMMUNICATIONS, NETWORKING AND MOBILE COMPUTING, VOLS 1-31, 2008, : 11161 - 11164
  • [37] Similarity search on time series based on threshold queries
    Assfalg, J
    Kriegel, HP
    Kröger, P
    Kunath, P
    Pryakhin, A
    Renz, M
    ADVANCES IN DATABASE TECHNOLOGY - EDBT 2006, 2006, 3896 : 276 - 294
  • [38] GPU Acceleration of Similarity Search for Uncertain Time Series
    Hwang, Jun
    Kozawa, Yusuke
    Amagasa, Toshiyuki
    Kitagawa, Hiroyuki
    2014 17TH INTERNATIONAL CONFERENCE ON NETWORK-BASED INFORMATION SYSTEMS (NBIS 2014), 2014, : 626 - 631
  • [39] Similarity Search of Bounded TIDASETS within Large Time Interval Databases
    Meisen, Philipp
    Keng, Diane
    Meisen, Tobias
    Recchioni, Marco
    Jeschke, Sabina
    2015 INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE AND COMPUTATIONAL INTELLIGENCE (CSCI), 2015, : 24 - 29
  • [40] Set-based Similarity Search for Time Series
    Peng, Jinglin
    Wang, Hongzhi
    Li, Jianzhong
    Gao, Hong
    SIGMOD'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2016, : 2039 - 2052