A Fast Semi-Supervised Clustering Framework for Large-Scale Time Series Data

被引:18
|
作者
He, Guoliang [1 ]
Pan, Yanzhou [2 ]
Xia, Xuewen [3 ]
He, Jinrong [4 ]
Peng, Rong [1 ]
Xiong, Neal N. [5 ]
机构
[1] Wuhan Univ, Sch Comp Sci, Wuhan 430079, Peoples R China
[2] Rice Univ, Engn Dept, Houston, TX 77005 USA
[3] Minnan Normal Univ, Coll Phys & Informat Engn, Zhangzhou 363000, Peoples R China
[4] Yanan Univ, Coll Math & Comp Sci, Yanan 716000, Peoples R China
[5] Northeastern State Univ, Dept Math & Comp Sci, Tahlequah, OK 74464 USA
基金
中国国家自然科学基金;
关键词
Time series analysis; Clustering algorithms; Time measurement; Velocity measurement; Shape measurement; Clustering methods; Contracts; Constraint propagation; semi-supervised learning; similarity measure; time series clustering; CLASSIFICATION;
D O I
10.1109/TSMC.2019.2931731
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Semi-supervised clustering algorithms have several limitations: 1) the computation complexity of them is very high, because calculating the similarity distances of pairs of examples is time-consuming; 2) traditional semi-supervised clustering methods have not considered how to make full use of must-link and cannot-link constraints. In the clustering, the contribution of a few pairwise constraints to the clustering performance is very limited, and some may negatively affect the outcome; and 3) these methods are not effective to handle high dimensional data, especially for time series data. Up to now, few work touched semi-supervised clustering on time series data. To efficiently cluster large-scale time series data, we first tackle contract time series clustering to produce the most accurate clustering results under a contracted time. We propose a semi-supervised time series clustering framework (STSC), which integrates a fast similarity measure and a constraint propagation approach. Based on the proposed framework, two valid semi-supervised clustering algorithms including fssK-means and fssDBSCAN are designed. Experiments on 11 datasets show that our proposed method is efficient and effective for clustering large-scale time series data.
引用
收藏
页码:4201 / 4216
页数:16
相关论文
共 50 条
  • [41] Large-scale image recognition based on parallel kernel supervised and semi-supervised subspace learning
    Wu, Fei
    Jing, Xiao-Yuan
    Liu, Qian
    Wu, Song-Song
    He, Guo-Liang
    NEURAL COMPUTING & APPLICATIONS, 2017, 28 (03): : 483 - 498
  • [42] Active Semi-supervised Framework with Data Editing
    Zhang, Xue
    Xiao, Wangxin
    COMPUTER SCIENCE AND INFORMATION SYSTEMS, 2012, 9 (04) : 1513 - 1532
  • [43] Noise-Robust Semi-Supervised Learning by Large-Scale Sparse Coding
    Lu, Zhiwu
    Gao, Xin
    Wang, Liwei
    Wen, Ji-Rong
    Huang, Songfang
    PROCEEDINGS OF THE TWENTY-NINTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2015, : 2828 - 2834
  • [44] A survey of large-scale graph-based semi-supervised classification algorithms
    Song Y.
    Zhang J.
    Zhang C.
    International Journal of Cognitive Computing in Engineering, 2022, 3 : 188 - 198
  • [45] LARGE-SCALE SEMI-SUPERVISED LEARNING BY APPROXIMATE LAPLACIAN EIGENMAPS, VLAD AND PYRAMIDS
    Mantziou, Eleni
    Papadopoulos, Symeon
    Kompatsiaris, Yiannis
    2013 14TH INTERNATIONAL WORKSHOP ON IMAGE ANALYSIS FOR MULTIMEDIA INTERACTIVE SERVICES (WIAMIS), 2013,
  • [46] queryCategorizr: A Large-Scale Semi-Supervised System for Categorization of Web Search Queries
    Grbovic, Mihajlo
    Djuric, Nemanja
    Radosavljevic, Vladan
    Bhamidipati, Narayan
    Hawker, Jordan
    Johnson, Caleb
    WWW'15 COMPANION: PROCEEDINGS OF THE 24TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB, 2015, : 199 - 202
  • [47] Large-Scale Semi-Supervised Training in Deep Learning Acoustic Model for ASR
    Long, Yanhua
    Li, Yijie
    Wei, Shuang
    Zhang, Qiaozheng
    Yang, Chunxia
    IEEE ACCESS, 2019, 7 : 133615 - 133627
  • [48] Semi-Supervised anchor graph ensemble for large-scale hyperspectral image classification
    He, Ziping
    Xia, Kewen
    Hu, Yuhen
    Yin, Zhixian
    Wang, Sijie
    Zhang, Jiangnan
    INTERNATIONAL JOURNAL OF REMOTE SENSING, 2022, 43 (05) : 1894 - 1918
  • [49] Semi-supervised Clustering Algorithm for Retention Time Alignment of Gas Chromatographic Data
    Hamadi, Omar Peter
    Varga, Tamas
    PERIODICA POLYTECHNICA-CHEMICAL ENGINEERING, 2022, 66 (03) : 414 - 421
  • [50] Enhancing Time Series Clustering by Incorporating Multiple Distance Measures with Semi-Supervised Learning
    Jing Zhou
    Shan-Feng Zhu
    Xiaodi Huang
    Yanchun Zhang
    Journal of Computer Science and Technology, 2015, 30 : 859 - 873