An efficient similarity searching algorithm based on clustering for time series

被引:0
|
作者
Feng, Yucai [1 ]
Jiang, Tao [1 ]
Zhou, Yingbiao [1 ]
Li, Junkui [1 ]
机构
[1] Huazhong Univ Sci & Technol, Coll Comp Sci & Technol, Wuhan 430074, Peoples R China
关键词
time series; clustering; similarity search; indexing;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Indexing large time series databases is crucial for efficient searching of time series queries. In the paper, we propose a novel indexing scheme RQI (Range Query based on Index) which includes three filtering methods: first-k filtering, indexing lower bounding and upper bounding as well as triangle inequality pruning. The basic idea is calculating wavelet coefficient whose first k coefficients are used to form a MBR. (minimal bounding rectangle) based on haar wavelet transform for each time series and then using point filtering method; At the same time, lower bounding and upper bounding feature of each time series is calculated, in advance, and stored into index structure. At last, triangle inequality pruning method is used by calculating the distance between time series beforehand. Then we introduce a novel lower bounding distance function SLBS (Symmetrical Lower Bounding based on Segment) and a novel clustering algorithm CSA (Clustering based on Segment Approximation) in order to further improve the search efficiency of point filtering method by keeping a good clustering trait of index structure. Extensive experiments over both synthetic and real datasets show that, our technologies provide perfect pruning power and could obtain an order of magnitude performance improvement for time series queries over traditional naive evaluation techniques.
引用
收藏
页码:360 / 373
页数:14
相关论文
共 50 条
  • [31] An approach based on data mining and genetic algorithm to optimizing time series clustering for efficient segmentation of customer behavior
    Hamidi, Hodjat
    Haghi, Bahare
    COMPUTERS IN HUMAN BEHAVIOR REPORTS, 2024, 16
  • [32] A clustering with slope algorithm based on item similarity
    Wu Huiyun
    Wang Yuping
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2016, 31 (04) : 2177 - 2185
  • [33] A Similarity Based Agglomerative Clustering Algorithm in Networks
    Liu, Zhiyuan
    Wang, Xiujuan
    Ma, Yinghong
    NINTH INTERNATIONAL CONFERENCE ON GRAPHIC AND IMAGE PROCESSING (ICGIP 2017), 2018, 10615
  • [34] A Clustering Algorithm Based on Variance-Similarity
    Li, Zhendong
    Li, Fei
    MEASUREMENT TECHNOLOGY AND ENGINEERING RESEARCHES IN INDUSTRY, PTS 1-3, 2013, 333-335 : 1306 - +
  • [35] Incremental Clustering for Time Series Data based on an Improved Leader Algorithm
    Huynh Thi Thu Thuy
    Duong Tuan Anh
    Vo Thi Ngoc Chau
    2019 IEEE - RIVF INTERNATIONAL CONFERENCE ON COMPUTING AND COMMUNICATION TECHNOLOGIES (RIVF), 2019, : 13 - 18
  • [36] Fuzzy clustering algorithm for time series based on adaptive incremental learning
    Wang, Wei
    Hu, Xiaohui
    Wang, Mingye
    JOURNAL OF INTELLIGENT & FUZZY SYSTEMS, 2020, 38 (04) : 3991 - 3998
  • [37] A Copula Based ICA Algorithm and Its Application to Time Series Clustering
    Jafar Rahmanishamsi
    Ali Dolati
    Masoudreza R. Aghabozorgi
    Journal of Classification, 2018, 35 : 230 - 249
  • [38] A Copula Based ICA Algorithm and Its Application to Time Series Clustering
    Rahmanishamsi, Jafar
    Dolati, Ali
    Aghabozorgi, Masoudreza R.
    JOURNAL OF CLASSIFICATION, 2018, 35 (02) : 230 - 249
  • [39] Equivalence partition based morphological similarity clustering for large-scale time series
    Shaolin Hu
    Scientific Reports, 13
  • [40] Equivalence partition based morphological similarity clustering for large-scale time series
    Hu, Shaolin
    SCIENTIFIC REPORTS, 2023, 13 (01)