PSP-AMS: Progressive Mining of Sequential Patterns Across Multiple Streams

被引:13
|
作者
Jaysawal, Bijay Prasad [1 ]
Huang, Jen-Wei [1 ]
机构
[1] Natl Cheng Kung Univ, Inst Comp & Commun Engn, 1 Univ Rd, Tainan 701, Taiwan
关键词
Progressive mining; sequential patterns; multiple data streams; across data streams; across-streams sequential patterns; SLIDING WINDOW;
D O I
10.1145/3281632
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Sequential pattern mining is used to find frequent data sequences over time. When sequential patterns are generated, the newly arriving patterns may not be identified as frequent sequential patterns due to the existence of old data and sequences. Progressive sequential pattern mining aims to find the most up-to-date sequential patterns given that obsolete items will be deleted from the sequences. When sequences come with multiple data streams, it is difficult to maintain and update the current sequential patterns. Even worse, when we consider the sequences across multiple streams, previous methods cannot efficiently compute the frequent sequential patterns. In this work, we propose an efficient algorithm PSP-AMS to address this problem. PSP-AMS uses a novel data structure PSP-MS-tree to insert new items, update current items, and delete obsolete items. By maintaining a PSP-MS-tree, PSP-AMS efficiently finds the frequent sequential patterns across multiple streams. The experimental results show that PSP-AMS significantly outperforms previous algorithms for mining of progressive sequential patterns across multiple streams on synthetic data as well as real data.
引用
收藏
页数:23
相关论文
共 50 条
  • [31] A Geometric Approach for Mining Sequential Patterns in Interval-Based Data Streams
    Hassani, Marwan
    Lu, Yifeng
    Wischnewsky, Jens
    Seidl, Thomas
    2016 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS (FUZZ-IEEE), 2016, : 2128 - 2135
  • [32] Parallel mining of maximal sequential patterns using multiple samples
    Congnan Luo
    Soon M. Chung
    The Journal of Supercomputing, 2012, 59 : 852 - 881
  • [33] Efficient Mining of Maximal Sequential Patterns Using Multiple Samples
    Luo, Congnan
    Chung, Soon M.
    PROCEEDINGS OF THE FIFTH SIAM INTERNATIONAL CONFERENCE ON DATA MINING, 2005, : 415 - 426
  • [34] Parallel mining of maximal sequential patterns using multiple samples
    Luo, Congnan
    Chung, Soon M.
    JOURNAL OF SUPERCOMPUTING, 2012, 59 (02): : 852 - 881
  • [35] Statistical supports for mining sequential patterns and improving the incremental update process on data streams
    Laur, Pierre-Alain
    Symphor, Jean-Emile
    Nock, Richard
    Poncelet, Pascal
    INTELLIGENT DATA ANALYSIS, 2007, 11 (01) : 29 - 47
  • [36] Mining fuzzy sequential patterns from multiple-item transactions
    Hong, TP
    Lin, KY
    Wang, SL
    JOINT 9TH IFSA WORLD CONGRESS AND 20TH NAFIPS INTERNATIONAL CONFERENCE, PROCEEDINGS, VOLS. 1-5, 2001, : 1317 - 1321
  • [37] Mining High Utility Sequential Patterns Using Multiple Minimum Utility
    Xu, Tiantian
    Xu, Jianliang
    Dong, Xiangjun
    INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2018, 32 (10)
  • [38] Mining Sequential Patterns from Web Click-Streams Based on Position Linked List
    Sun, Jintao
    Xie, Huosheng
    ASIA-PACIFIC YOUTH CONFERENCE ON COMMUNICATION TECHNOLOGY 2010 (APYCCT 2010), 2010, : 466 - 470
  • [39] Hiding co-occurring prioritized sensitive patterns over distributed progressive sequential data streams
    Keshavamurthy, Bettahally N.
    Toshniwal, Durga
    Eshwar, Bhavani K.
    JOURNAL OF NETWORK AND COMPUTER APPLICATIONS, 2012, 35 (03) : 1116 - 1129
  • [40] Frequent Subgraph Mining of Functional Interaction Patterns Across Multiple Cancers
    Durmaz, Arda
    Henderson, Tim A. D.
    Bebek, Gurkan
    PACIFIC SYMPOSIUM ON BICOMPUTING 2021, 2021, : 261 - 272