Matrix Profile XIII: Time Series Snippets: A New Primitive for Time Series Data Mining

被引:28
|
作者
Imani, Shima [1 ]
Madrid, Frank [1 ]
Ding, Wei [2 ]
Crouter, Scott [3 ]
Keogh, Eamonn [1 ]
机构
[1] Univ Calif Riverside, Dept Comp Sci & Engn, Riverside, CA 92521 USA
[2] Univ Massachusetts Boston, Dept Comp Sci, Boston, MA USA
[3] Univ Tennessee, Coll Educ Hlth & Human Sci, Knoxville, TN USA
关键词
time series; motifs; sampling; diversification;
D O I
10.1109/ICBK.2018.00058
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Perhaps the most basic query made by a data analyst confronting a new data source is "Show me some representative/typical data." Answering this question is trivial in many domains, but surprisingly, it is very difficult in large time series datasets. The major difficulty is not time or space complexity, but defining what it means to be representative data in this domain. In this work, we show that the obvious candidate definitions: motifs, shapelets, cluster centers, random samples etc., are all poor choices. Thus motivated, we introduce time series snippets, a novel representation of typical time series subsequences. Beyond their utility for visualizing and summarizing massive time series collections, we show that time series snippets have utility for high-level comparison of large time series collections.
引用
收藏
页码:382 / 389
页数:8
相关论文
共 50 条
  • [11] Data mining in medical time series
    Mikut, Ralf
    Reischl, Markus
    Burmeister, Ole
    Loose, Tobias
    BIOMEDIZINISCHE TECHNIK, 2006, 51 (5-6): : 288 - 293
  • [12] A review on time series data mining
    Fu, Tak-chung
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2011, 24 (01) : 164 - 181
  • [13] Process Mining for Time Series Data
    Ziolkowski, Tobias
    Koschmider, Agnes
    Schubert, Rene
    Renz, Matthias
    ENTERPRISE, BUSINESS-PROCESS AND INFORMATION SYSTEMS MODELING, 2022, 450 : 347 - 350
  • [14] Time series financial data mining
    Tseng, CC
    Kang, CT
    PROCEEDINGS OF THE 8TH JOINT CONFERENCE ON INFORMATION SCIENCES, VOLS 1-3, 2005, : 1035 - 1038
  • [15] A Survey on Time Series Data Mining
    Fakhrazari, Amin
    Vakilzadian, Hamid
    2017 IEEE INTERNATIONAL CONFERENCE ON ELECTRO INFORMATION TECHNOLOGY (EIT), 2017, : 476 - 481
  • [16] On privacy in time series data mining
    Zhu, Ye
    Fu, Yongjian
    Fu, Huirong
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2008, 5012 : 479 - +
  • [17] Time-Series Data Mining
    Esling, Philippe
    Agon, Carlos
    ACM COMPUTING SURVEYS, 2012, 45 (01)
  • [18] A new segmented time warping distance for data mining in time series database
    Xiao, H
    Feng, XF
    Hu, YF
    PROCEEDINGS OF THE 2004 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2004, : 1277 - 1281
  • [19] A New Model for Multiple Time Series Based on Data Mining
    Chen Zhuo
    Yang Bing-ru
    Zhou Fa-guo
    Li Lin-na
    Zhao Yun-feng
    KAM: 2008 INTERNATIONAL SYMPOSIUM ON KNOWLEDGE ACQUISITION AND MODELING, PROCEEDINGS, 2008, : 39 - 43
  • [20] DLCSS: A new similarity measure for time series data mining
    Soleimani, Gholamreza
    Abessi, Masoud
    ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2020, 92