Incremental update on sequential patterns in large databases by implicit merging and efficient counting

被引:29
|
作者
Lin, MY [1 ]
Lee, SY [1 ]
机构
[1] Natl Chiao Tung Univ, Dept Comp Sci & Informat Engn, Hsinchu 30050, Taiwan
关键词
data mining; sequential patterns; incremental update; sequence discovery; sequence merging;
D O I
10.1016/S0306-4379(03)00036-X
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Current approaches for sequential pattern mining usually assume that the mining is performed in a static sequence database. However, databases are not static due to update so that the discovered patterns might become invalid and new patterns could be created. In addition to higher complexity, the maintenance of sequential patterns is more challenging than that of association rules owing to sequence merging. Sequence merging, which is unique in sequence databases, requires the appended new sequences to be merged with the existing ones if their customer ids are the same. Re-mining of the whole database appears to be inevitable since the information collected in previous discovery will be corrupted by sequence merging. Instead of re-mining, the proposed IncSP (Incremental Sequential Pattern Update) algorithm solves the maintenance problem through effective implicit merging and efficient separate counting over appended sequences. Patterns found previously are incrementally updated rather than re-mined from scratch. Moreover, the technique of early candidate pruning further speeds up the discovery of new patterns. Empirical evaluation using comprehensive synthetic data shows that IncSP is fast and scalable. (C) 2003 Elsevier Ltd. All rights reserved.
引用
收藏
页码:385 / 404
页数:20
相关论文
共 50 条
  • [21] Improvements of IncSpan: Incremental mining of sequential patterns in large database
    Nguyen, SN
    Sun, XZ
    Orlowska, ME
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2005, 3518 : 442 - 451
  • [22] An efficient approach for finding weighted sequential patterns from sequence databases
    Lan, Guo-Cheng
    Hong, Tzung-Pei
    Lee, Hong-Yu
    APPLIED INTELLIGENCE, 2014, 41 (02) : 439 - 452
  • [23] An efficient approach for finding weighted sequential patterns from sequence databases
    Guo-Cheng Lan
    Tzung-Pei Hong
    Hong-Yu Lee
    Applied Intelligence, 2014, 41 : 439 - 452
  • [24] Mining Positive and Negative Fuzzy Sequential Patterns in Large Transaction Databases
    Ouyang, Weimin
    Huang, Qinhua
    Luo, Shuanghu
    FIFTH INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY, VOL 5, PROCEEDINGS, 2008, : 18 - +
  • [25] Mining direct and indirect fuzzy sequential patterns in large transaction databases
    Ouyang, Weimin
    Huang, Qinhua
    ADVANCED INTELLIGENT COMPUTING THEORIES AND APPLICATIONS: WITH ASPECTS OF CONTEMPORARY INTELLIGENT COMPUTING TECHNIQUES, 2007, 2 : 180 - +
  • [26] A new framework for detecting weighted sequential patterns in large sequence databases
    Yun, Unil
    KNOWLEDGE-BASED SYSTEMS, 2008, 21 (02) : 110 - 122
  • [27] PAID: Mining sequential patterns by PAssed Item Deduction in large databases
    Yang, Zhenglu
    Kitsuregawa, Masaru
    Wang, Yitong
    10TH INTERNATIONAL DATABASE ENGINEERING AND APPLICATIONS SYMPOSIUM, PROCEEDINGS, 2006, : 113 - 120
  • [28] Efficient Discovery of Partial Periodic Patterns in Large Temporal Databases
    Kiran, Rage Uday
    Veena, Pamalla
    Ravikumar, Penugonda
    Saideep, Chennupati
    Zettsu, Koji
    Shang, Haichuan
    Toyoda, Masashi
    Kitsuregawa, Masaru
    Reddy, P. Krishna
    ELECTRONICS, 2022, 11 (10)
  • [29] Statistical supports for mining sequential patterns and improving the incremental update process on data streams
    Laur, Pierre-Alain
    Symphor, Jean-Emile
    Nock, Richard
    Poncelet, Pascal
    INTELLIGENT DATA ANALYSIS, 2007, 11 (01) : 29 - 47
  • [30] Efficient Incremental Mining of Qualified Web Traversal Patterns without Scanning Original Databases
    Ying, Jia-Ching
    Tseng, Vincent S.
    Yu, Philip S.
    2009 IEEE INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW 2009), 2009, : 338 - +