Incremental update on sequential patterns in large databases by implicit merging and efficient counting

被引:29
|
作者
Lin, MY [1 ]
Lee, SY [1 ]
机构
[1] Natl Chiao Tung Univ, Dept Comp Sci & Informat Engn, Hsinchu 30050, Taiwan
关键词
data mining; sequential patterns; incremental update; sequence discovery; sequence merging;
D O I
10.1016/S0306-4379(03)00036-X
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Current approaches for sequential pattern mining usually assume that the mining is performed in a static sequence database. However, databases are not static due to update so that the discovered patterns might become invalid and new patterns could be created. In addition to higher complexity, the maintenance of sequential patterns is more challenging than that of association rules owing to sequence merging. Sequence merging, which is unique in sequence databases, requires the appended new sequences to be merged with the existing ones if their customer ids are the same. Re-mining of the whole database appears to be inevitable since the information collected in previous discovery will be corrupted by sequence merging. Instead of re-mining, the proposed IncSP (Incremental Sequential Pattern Update) algorithm solves the maintenance problem through effective implicit merging and efficient separate counting over appended sequences. Patterns found previously are incrementally updated rather than re-mined from scratch. Moreover, the technique of early candidate pruning further speeds up the discovery of new patterns. Empirical evaluation using comprehensive synthetic data shows that IncSP is fast and scalable. (C) 2003 Elsevier Ltd. All rights reserved.
引用
收藏
页码:385 / 404
页数:20
相关论文
共 50 条
  • [1] Incremental update on sequential patterns in large databases
    Lin, MY
    Lee, SY
    TENTH IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 1998, : 24 - 31
  • [2] Efficient algorithms for incremental maintenance of closed sequential patterns in large databases
    Chang, Lei
    Wang, Tengjiao
    Yang, Dongqing
    Luan, Hua
    Tang, Shiwei
    DATA & KNOWLEDGE ENGINEERING, 2009, 68 (01) : 68 - 106
  • [3] Incremental mining of sequential patterns in large databases
    Masseglia, F
    Poncelet, P
    Teisseire, M
    DATA & KNOWLEDGE ENGINEERING, 2003, 46 (01) : 97 - 121
  • [4] An Efficient Approach to Discovering Sequential Patterns in Large Databases
    Yen, Show-Jane
    Cho, Chung-Wen
    LECTURE NOTES IN COMPUTER SCIENCE <D>, 2000, 1910 : 685 - 690
  • [5] Incremental Mining of High Utility Sequential Patterns in Incremental Databases
    Wang, Jun-Zhe
    Huang, Jiun-Long
    CIKM'16: PROCEEDINGS OF THE 2016 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2016, : 2341 - 2346
  • [6] An Efficient Algorithm for Mining Maximal Frequent Sequential Patterns in Large Databases
    Su, Qiu-bin
    Lu, Lu
    Cheng, Bin
    2018 INTERNATIONAL CONFERENCE ON COMMUNICATION, NETWORK AND ARTIFICIAL INTELLIGENCE (CNAI 2018), 2018, : 404 - 410
  • [7] Mining weighted sequential patterns in incremental uncertain databases
    Roy, Kashob Kumar
    Moon, Md Hasibul Haque
    Rahman, Md Mahmudur
    Ahmed, Chowdhury Farhan
    Leung, Carson Kai-Sang
    INFORMATION SCIENCES, 2022, 582 : 865 - 896
  • [8] An efficient algorithm for incremental mining of sequential patterns
    Ren, Jia-Dong
    Zhou, Xiao-Lei
    ADVANCES IN MACHINE LEARNING AND CYBERNETICS, 2006, 3930 : 179 - 188
  • [9] An Incremental Technique for Mining Coverage Patterns in Large Databases
    Ralla, Akhil
    Reddy, P. Krishna
    Mondal, Anirban
    2019 IEEE INTERNATIONAL CONFERENCE ON DATA SCIENCE AND ADVANCED ANALYTICS (DSAA 2019), 2019, : 211 - 220
  • [10] An efficient mining method for incremental updation in large databases
    Lee, WJ
    Lee, SJ
    INTELLIGENT DATA ENGINEERING AND AUTOMATED LEARNING, 2003, 2690 : 630 - 637