A Fast Semi-Supervised Clustering Framework for Large-Scale Time Series Data

被引:18
|
作者
He, Guoliang [1 ]
Pan, Yanzhou [2 ]
Xia, Xuewen [3 ]
He, Jinrong [4 ]
Peng, Rong [1 ]
Xiong, Neal N. [5 ]
机构
[1] Wuhan Univ, Sch Comp Sci, Wuhan 430079, Peoples R China
[2] Rice Univ, Engn Dept, Houston, TX 77005 USA
[3] Minnan Normal Univ, Coll Phys & Informat Engn, Zhangzhou 363000, Peoples R China
[4] Yanan Univ, Coll Math & Comp Sci, Yanan 716000, Peoples R China
[5] Northeastern State Univ, Dept Math & Comp Sci, Tahlequah, OK 74464 USA
基金
中国国家自然科学基金;
关键词
Time series analysis; Clustering algorithms; Time measurement; Velocity measurement; Shape measurement; Clustering methods; Contracts; Constraint propagation; semi-supervised learning; similarity measure; time series clustering; CLASSIFICATION;
D O I
10.1109/TSMC.2019.2931731
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Semi-supervised clustering algorithms have several limitations: 1) the computation complexity of them is very high, because calculating the similarity distances of pairs of examples is time-consuming; 2) traditional semi-supervised clustering methods have not considered how to make full use of must-link and cannot-link constraints. In the clustering, the contribution of a few pairwise constraints to the clustering performance is very limited, and some may negatively affect the outcome; and 3) these methods are not effective to handle high dimensional data, especially for time series data. Up to now, few work touched semi-supervised clustering on time series data. To efficiently cluster large-scale time series data, we first tackle contract time series clustering to produce the most accurate clustering results under a contracted time. We propose a semi-supervised time series clustering framework (STSC), which integrates a fast similarity measure and a constraint propagation approach. Based on the proposed framework, two valid semi-supervised clustering algorithms including fssK-means and fssDBSCAN are designed. Experiments on 11 datasets show that our proposed method is efficient and effective for clustering large-scale time series data.
引用
收藏
页码:4201 / 4216
页数:16
相关论文
共 50 条
  • [31] Efficient semi-supervised clustering with pairwise constraint propagation for multivariate time series
    He, Guoliang
    Jin, Dawei
    Jiang, Wenjun
    Zhao, Zongkun
    Dai, Lifang
    Yu, Zhiwen
    Chen, C. L. Philip
    INFORMATION SCIENCES, 2024, 681
  • [32] Deep semi-supervised clustering for multi-variate time-series
    Ienco, Dino
    Interdonato, Roberto
    NEUROCOMPUTING, 2023, 516 : 36 - 47
  • [33] A Semi-Supervised Weighted Clustering Framework Facing to Hybrid Attributes Data Streams
    Chen, Xinquan
    2010 8TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2010, : 5988 - 5993
  • [34] Semi-supervised clustering for gene-expression data in multiobjective optimization framework
    Alok, Abhay Kumar
    Saha, Sriparna
    Ekbal, Asif
    INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS, 2017, 8 (02) : 421 - 439
  • [35] Semi-supervised clustering for gene-expression data in multiobjective optimization framework
    Abhay Kumar Alok
    Sriparna Saha
    Asif Ekbal
    International Journal of Machine Learning and Cybernetics, 2017, 8 : 421 - 439
  • [36] A Framework for Semi-Supervised Clustering Based on Dimensionality Reduction
    Cui Peng
    Zhang Ru-bo
    FIRST INTERNATIONAL WORKSHOP ON DATABASE TECHNOLOGY AND APPLICATIONS, PROCEEDINGS, 2009, : 192 - +
  • [37] An Analysis Framework for Large-Scale Time Series
    Teng F.
    Huang Q.-C.
    Li T.-R.
    Wang C.
    Tian C.-H.
    Jisuanji Xuebao/Chinese Journal of Computers, 2020, 43 (07): : 1279 - 1292
  • [38] Graph-Based Semi-Supervised Learning with Bipartite Graph for Large-Scale Data and Prediction of Unseen Data
    Alemi, Mohammad
    Bosaghzadeh, Alireza
    Dornaika, Fadi
    INFORMATION, 2024, 15 (10)
  • [39] Semi-Supervised clustering and Local Scale Learning algorithm
    Bchir, Ouiem
    Frigui, Hichem
    Ben Ismail, Mohamed Maher
    WORLD CONGRESS ON COMPUTER & INFORMATION TECHNOLOGY (WCCIT 2013), 2013,
  • [40] Large-scale image recognition based on parallel kernel supervised and semi-supervised subspace learning
    Fei Wu
    Xiao-Yuan Jing
    Qian Liu
    Song-Song Wu
    Guo-Liang He
    Neural Computing and Applications, 2017, 28 : 483 - 498