Fast similarity search in the presence of longitudinal scaling in time series databases

被引:61
|
作者
Keogh, E
机构
关键词
D O I
10.1109/TAI.1997.632306
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The problem of finding patterns of interest in time series databases (query by content) is an important one, with applications in virtually every field of science. A variety of approaches have been suggested. These approaches are robust to noise, offset translation, and amplitude scaling to varying degrees. However, they are all extremely sensitive to scaling in the lime axis (longitudinal scaling). We present a method for similarity search that is robust to scaling in the time axis, in addition to noise, offset translation, and amplitude scaling. The method has been tested on medical, financial, space telemetry and artificial data. Furthermore the method is exceptionally fast, with the predicted 2 to 4 orders of magnitude speedup actually observed. The method uses a piecewise linear representation of the original data. We also introduce a new algorithm which both decides the optimal number of linear segments to use, and produces the actual linear representation.
引用
收藏
页码:578 / 584
页数:7
相关论文
共 50 条
  • [21] Fast similarity search in databases of 3D objects
    Wang, X
    Wang, JTL
    TENTH IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 1998, : 16 - 23
  • [22] A fast heuristic algorithm for similarity search in large DNA databases
    Jeong, In-Seon
    Park, Kyoung-Wook
    Lim, Hyeong-Seok
    PROCEEDINGS OF THE FRONTIERS IN THE CONVERGENCE OF BIOSCIENCE AND INFORMATION TECHNOLOGIES, 2007, : 335 - 340
  • [23] Fast similarity search in three-dimensional structure databases
    Wang, X
    Wang, JTL
    JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES, 2000, 40 (02): : 442 - 451
  • [24] Trend similarity and prediction in time-series databases
    Yoon, JP
    Lee, J
    Kim, S
    DATA MINING AND KNOWLEDGE DISCOVERY: THEORY, TOOLS, AND TECHNOLOGY II, 2000, 4057 : 201 - 212
  • [25] Similarity-based queries for time series Databases
    Wu, F
    Plihon, V
    Gardarin, G
    ENABLING SOCIETY WITH INFORMATION TECHNOLOGY, 2002, : 13 - 26
  • [26] Similarity Measure Selection for Clustering Time Series Databases
    Mori, Usue
    Mendiburu, Alexander
    Lozano, Jose A.
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2016, 28 (01) : 181 - 195
  • [27] Fast Similarity Search of Multi-dimensional Time Series via Segment Rotation
    Gong, Xudong
    Xiong, Yan
    Huang, Wenchao
    Chen, Lei
    Lu, Qiwei
    Hu, Yiqing
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PT1, 2015, 9049 : 108 - 124
  • [28] Fast and effective similarity search in medical tumor databases using morphology
    Korn, F
    Sidiropoulos, N
    Faloutsos, C
    Siegel, E
    Protopapas, Z
    MULTIMEDIA STORAGE AND ARCHIVING SYSTEMS, 1996, 2916 : 116 - 129
  • [29] Probabilistic Similarity Search for Uncertain Time Series
    Assfalg, Johannes
    Kriegel, Hans-Peter
    Kroeger, Peer
    Benz, Matthias
    SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT, PROCEEDINGS, 2009, 5566 : 435 - 443
  • [30] TimeExplorer: Similarity Search Time Series by Their Signatures
    Tuan Nhon Dang
    Wilkinson, Leland
    ADVANCES IN VISUAL COMPUTING, ISVC 2013, PT I, 2013, 8033 : 280 - 289