Scalable classifier-agnostic channel selection for multivariate time series classification

被引:9
|
作者
Dhariyal, Bhaskar [1 ]
Le Nguyen, Thach [1 ]
Ifrim, Georgiana [1 ]
机构
[1] Univ Coll Dublin, Insight Ctr Data Analyt, Sch Comp Sci, Dublin, Ireland
基金
爱尔兰科学基金会;
关键词
Multivariate time series; Channel selection; Scalability; Classification; STATISTICAL COMPARISONS;
D O I
10.1007/s10618-022-00909-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Accuracy is a key focus of current work in time series classification. However, speed and data reduction are equally important in many applications, especially when the data scale and storage requirements rapidly increase. Current multivariate time series classification (MTSC) algorithms need hundreds of compute hours to complete training and prediction. This is due to the nature of multivariate time series data which grows with the number of time series, their length and the number of channels. In many applications, not all the channels are useful for the classification task, hence we require methods that can efficiently select useful channels and thus save computational resources. We propose and evaluate two methods for channel selection. Our techniques work by representing each class by a prototype time series and performing channel selection based on the prototype distance between classes. The main hypothesis is that useful channels enable better separation between classes; hence, channels with a larger distance between class prototypes are more useful. On the UEA MTSC benchmark, we show that these techniques achieve significant data reduction and classifier speedup for similar levels of classification accuracy. Channel selection is applied as a pre-processing step before training state-of-the-art MTSC algorithms and saves about 70% of computation time and data storage with preserved accuracy. Furthermore, our methods enable efficient classifiers, such as ROCKET, to achieve better accuracy than using no selection or greedy forward channel selection. To further study the impact of our techniques, we present experiments on classifying synthetic multivariate time series datasets with more than 100 channels, as well as a real-world case study on a dataset with 50 channels. In both cases, our channel selection methods result in significant data reduction with preserved or improved accuracy.
引用
收藏
页码:1010 / 1054
页数:45
相关论文
共 50 条
  • [31] Scalar attributes and time series combining patterns for multivariate time series classification
    Fekete, Gyorgy
    Molnar, Andras
    2022 IEEE 22ND INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND INFORMATICS AND 8TH IEEE INTERNATIONAL CONFERENCE ON RECENT ACHIEVEMENTS IN MECHATRONICS, AUTOMATION, COMPUTER SCIENCE AND ROBOTICS (CINTI-MACRO), 2022, : 335 - 339
  • [32] A Novel Channel and Temporal-Wise Attention in Convolutional Networks for Multivariate Time Series Classification
    Cheng, Xu
    Han, Peihua
    Li, Guoyuan
    Chen, Shengyong
    Zhang, Houxiang
    IEEE ACCESS, 2020, 8 : 212247 - 212257
  • [33] Feature Selection-Based Multiview Concentration for Multivariate Time Series Classification and Its Application
    He, Changchun
    Huo, Xin
    Zhu, Chao
    Chen, Songlin
    IEEE SENSORS JOURNAL, 2024, 24 (04) : 4798 - 4806
  • [34] Boosting for Real-Time Multivariate Time Series Classification
    Wang, Haishuai
    Wu, Jun
    THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 4999 - 5000
  • [35] Boosting Commit Classification Based on Multivariate Mixed Features and Heterogeneous Classifier Selection
    Wu, Yuhan
    Li, Yingling
    Wang, Ziao
    Tan, Qushan
    Liu, Jing
    Jiang, Yuao
    INTERNATIONAL JOURNAL OF SOFTWARE ENGINEERING AND KNOWLEDGE ENGINEERING, 2024, 34 (12) : 1949 - 1970
  • [36] Regularized shapelet learning for scalable time series classification
    Zhao, Huiyun
    Pan, Zhisong
    Tao, Wei
    COMPUTER NETWORKS, 2020, 173 (173)
  • [37] Scalable and accurate subsequence transform for time series classification
    Mbouopda, Michael Franklin
    Nguifo, Engelbert Mephu
    PATTERN RECOGNITION, 2024, 147
  • [38] RED CoMETS: An Ensemble Classifier for Symbolically Represented Multivariate Time Series
    Bennett, Luca A.
    Abdallah, Zahraa S.
    ADVANCED ANALYTICS AND LEARNING ON TEMPORAL DATA, AALTD 2023, 2023, 14343 : 76 - 91
  • [39] Distance function selection for multivariate time-series
    Morgachev, Gleb
    Goncharov, Alexey
    Strijov, Vadim
    2019 INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE: APPLICATIONS AND INNOVATIONS (IC-AIAI 2019), 2019, : 66 - 70
  • [40] Decomposability and selection of graphical models for multivariate time series
    Fried, R
    Didelez, V
    BIOMETRIKA, 2003, 90 (02) : 251 - 267