Adaptive Segmentation of Multivariate Time Series with FastICA and G-G Clustering

被引:0
|
作者
Wang L. [1 ,2 ]
Li Z.-Z. [1 ,2 ]
机构
[1] School of Automation and Electrical Engineering, University of Science and Technology Beijing, Beijing
[2] Key Laboratory of Knowledge Automation of Industrial Processes of Ministry of Education, School of Automation and Electrical Engineering, University of Science and Technology Beijing, Beijing
来源
基金
中国国家自然科学基金;
关键词
adaptive segmentation; fast independent component analysis; Gath-Geva clustering; minimum description length; multivariate time series;
D O I
10.12263/DZXB.20220649
中图分类号
学科分类号
摘要
The existing segmentation methods detect the statistical or shape changes of multivariate time series, and perform crisp segmentation on the location of change points. However, these methods fail to estimate the length of the transition interval between two segments, cannot accurately segment multivariate time series with high dimension, strong noise, and need to set parameters in advance. To address such matters, an adaptive multivariate time series segmentation method based on FastICA (Fast Independent Component Analysis) and G-G (Gath-Geva) clustering is proposed. In this method, the key features of multivariate time series are extracted via FastICA, and DW (Durbin-Watson) criterion is used to automatically select main components with high signal-to-noise ratio. According to the minimum description length (MDL), an adaptive multivariate time series segmentation model based on G-G clustering is designed, which is able to perform soft segmentation of multivariate time series. The experimental analysis is carried out on real datasets in many different fields. Compared with state-of-art benchmarks, the average F1 and MAE (Mean Absolute Error) of the proposed method on the above-mentioned datasets improve 8.4%~16.8% and 3.06%~6.56%, respectively. © 2023 Chinese Institute of Electronics. All rights reserved.
引用
收藏
页码:1235 / 1244
页数:9
相关论文
共 29 条
  • [11] LIM H, CHOI H, CHOI Y, Et al., Memetic algorithm for multivariate time-series segmentation, Pattern Recognition Letters, 138, pp. 60-67, (2020)
  • [12] KAWABATA Y, MATSUBARA Y, SAKURAI Y., Automatic sequential pattern mining in data streams, Proceedings of the 28th ACM International Conference on Information and Knowledge Management, pp. 1733-1742, (2019)
  • [13] GHARGHABI S, YEH C C M, Ding Y, Et al., Domain agnostic online semantic segmentation for multi-dimensional time series, Data Mining and Knowledge Discovery, 33, 1, pp. 96-130, (2019)
  • [14] GHARGHABI S, DING Y, YEH C C M, Et al., Matrix profile VIII: domain agnostic online semantic segmentation at superhuman performance levels, 2017 IEEE International Conference on Data Mining, pp. 117-126, (2017)
  • [15] DELDARI S, SMITH D V, SADRI A, Et al., Espresso: Entropy and shape aware time-series segmentation for processing heterogeneous sensor data, Proceedings of the ACM on Interactive Mobile Wearable and Ubiquitous Technologies, 4, 3, pp. 1-24, (2020)
  • [16] HALLAC D, VARE S, BOYD S, Et al., Toeplitz inverse covariance-based clustering of multivariate time series data, Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 215-223, (2017)
  • [17] WANG Ling, ZHU Hui, Fuzzy segmentation of multivariate time series with KPCA and G-G clustering, Control and Decision, 36, 1, pp. 115-124, (2021)
  • [18] ZHOU F, DE LA TORRE F, HODGINS J K., Hierarchical aligned cluster analysis for temporal clustering of human motion, IEEE Transactions on Pattern Analysis and Machine Intelligence, 35, 3, pp. 582-596, (2013)
  • [19] HOAI M, DE LA TORRE F., Maximum margin temporal clustering, Proceedings of the Fifteenth International Conference on Artificial Intelligence and Statistics (AIST-AT), pp. 520-528, (2012)
  • [20] GRUNWALD P D., The Minimum Description Length Principle, (2007)