Random pairwise shapelets forest: an effective classifier for time series

被引:8
|
作者
Yuan, Jidong [1 ]
Shi, Mohan [2 ]
Wang, Zhihai [1 ]
Liu, Haiyang [1 ]
Li, Jinyang [3 ]
机构
[1] Beijing Jiaotong Univ, Sch Comp & Informat Technol, Beijing 100044, Peoples R China
[2] Beijing Jingdong 360 Degree E Commerce Co Ltd, Beijing, Peoples R China
[3] Univ Hong Kong, Dept Comp Sci, Pok Fu Lam, Hong Kong, Peoples R China
基金
中国国家自然科学基金;
关键词
Time series classification; Pairwise shapelets; Random forest; Decomposed mean decrease impurity; REPRESENTATION; SIMILARITY; FEATURES;
D O I
10.1007/s10115-021-01630-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Shapelet is a discriminative subsequence of time series. An advanced shapelet-based method is to embed shapelet into the accurate and fast random forest. However, there are several limitations. First, random shapelet forest requires a large training cost for split threshold searching. Second, a single shapelet provides limited information for only one branch of the decision tree, resulting in insufficient accuracy. Third, the randomized ensemble decreases comprehensibility. For that, this paper presents Random Pairwise Shapelets Forest (RPSF). RPSF combines a pair of shapelets from different classes to construct random forest. It omits threshold searching to be more efficient, includes more information about each node of the forest to be more effective. Moreover, a discriminability measure, Decomposed Mean Decrease Impurity, is proposed to identify the influential region for each class. Extensive experiments show that RPSF is competitive compared with other methods, while it improves the training speed of shapelet-based forest.
引用
收藏
页码:143 / 174
页数:32
相关论文
共 50 条
  • [1] Random pairwise shapelets forest: an effective classifier for time series
    Jidong Yuan
    Mohan Shi
    Zhihai Wang
    Haiyang Liu
    Jinyang Li
    Knowledge and Information Systems, 2022, 64 : 143 - 174
  • [2] Random Pairwise Shapelets Forest
    Shi, Mohan
    Wang, Zhihai
    Yuan, Jidong
    Liu, Haiyang
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PAKDD 2018, PT I, 2018, 10937 : 68 - 80
  • [3] Proximity Forest: an effective and scalable distance-based classifier for time series
    Benjamin Lucas
    Ahmed Shifaz
    Charlotte Pelletier
    Lachlan O’Neill
    Nayyar Zaidi
    Bart Goethals
    François Petitjean
    Geoffrey I. Webb
    Data Mining and Knowledge Discovery, 2019, 33 : 607 - 635
  • [4] Proximity Forest: an effective and scalable distance-based classifier for time series
    Lucas, Benjamin
    Shifaz, Ahmed
    Pelletier, Charlotte
    O'Neill, Lachlan
    Zaidi, Nayyar
    Goethals, Bart
    Petitjean, Francois
    Webb, Geoffrey I.
    DATA MINING AND KNOWLEDGE DISCOVERY, 2019, 33 (03) : 607 - 635
  • [5] Random forest prediction of Alzheimer's disease using pairwise selection from time series data
    Moore, P. J.
    Lyons, T. J.
    Gallacher, J.
    Weiner, Michael W.
    Aisen, Paul
    Petersen, Ronald
    Jack, Clifford R., Jr.
    Jagust, William
    Trojanowki, John Q.
    Toga, Arthur W.
    Beckett, Laurel
    Green, Robert C.
    Saykin, Andrew J.
    Morris, John
    Shaw, Leslie M.
    Khachaturian, Zaven
    Sorensen, Greg
    Carrillo, Maria
    Kuller, Lew
    Raichle, Marc
    Paul, Steven
    Davies, Peter
    Fillit, Howard
    Hefti, Franz
    Holtzman, David
    Mesulam, M. Marcel
    Potter, William
    Snyder, Peter
    Montine, Tom
    Jimenez, Gustavo
    Donohue, Michael
    Gessert, Devon
    Harless, Kelly
    Salazar, Jennifer
    Cabrera, Yuliana
    Walter, Sarah
    Hergesheimer, Lindsey
    Harvey, Danielle
    Donohue, Michael
    Bernstein, Matthew
    Fox, Nick
    Thompson, Paul
    Schuff, Norbert
    DeCArli, Charles
    Borowski, Bret
    Gunter, Jeff
    Senjem, Matt
    Vemuri, Prashanthi
    Jones, David
    Kantarci, Kejal
    PLOS ONE, 2019, 14 (02):
  • [6] Discovering Malware with Time Series Shapelets
    Patri, Om P.
    Wojnowicz, Michael T.
    Wolff, Matt
    PROCEEDINGS OF THE 50TH ANNUAL HAWAII INTERNATIONAL CONFERENCE ON SYSTEM SCIENCES, 2017, : 6079 - 6088
  • [7] Learning Time-Series Shapelets
    Grabocka, Josif
    Schilling, Nicolas
    Wistuba, Martin
    Schmidt-Thieme, Lars
    PROCEEDINGS OF THE 20TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY AND DATA MINING (KDD'14), 2014, : 392 - 401
  • [8] Proximity forest 2.0: a new effective and scalable similarity-based classifier for time series
    Tan, Chang Wei
    Herrmann, Matthieu
    Salehi, Mahsa
    Webb, Geoffrey I.
    DATA MINING AND KNOWLEDGE DISCOVERY, 2025, 39 (02)
  • [9] GARLIC MAPPING FOR SENTINEL-2 TIME-SERIES DATA USING A RANDOM FOREST CLASSIFIER
    Chai, Zhaoyang
    Zhang, Hongyan
    Xu, Xiong
    Zhang, Liangpei
    2019 IEEE INTERNATIONAL GEOSCIENCE AND REMOTE SENSING SYMPOSIUM (IGARSS 2019), 2019, : 7224 - 7227
  • [10] Time-series Shapelets with Learnable Lengths
    Yamaguchi, Akihiro
    Ueno, Ken
    Kashima, Hisashi
    PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2023, 2023, : 2866 - 2876