Discovering Patterns from Large and Dynamic Sequential Data

被引:5
|
作者
Wang K. [1 ]
机构
[1] Dept. of Info. Syst. and Comp. Sci., National University of Singapore, Singapore, 119260, Lower Kent Ridge Road
关键词
Combinatorial pattern matching; Data mining; Sequential pattern; Suffix tree; Update;
D O I
10.1023/A:1008689103430
中图分类号
学科分类号
摘要
Most daily and scientific data are sequential in nature. Discovering important patterns from such data can benefit the user and scientist by predicting coming activities, interpreting recurring phenomena, extracting outstanding similarities and differences for close attention, compressing data, and detecting intrusion. We consider the following incremental discovery problem for large and dynamic sequential data. Suppose that patterns were previously discovered and materialized. An update is made to the sequential database. An incremental discovery will take advantage of discovered patterns and compute only the change by accessing the affected part of the database and data structures. In addition to patterns, the statistics and position information of patterns need to be updated to allow further analysis and processing on patterns. We present an efficient algorithm for the incremental discovery problem. The algorithm is applied to sequential data that honors several sequential patterns modeling weather changes in Singapore. The algorithm finds what it is supposed to find. Experiments show that for small updates and large databases, the incremental discovery algorithm runs in time independent of the data size.
引用
收藏
页码:33 / 56
页数:23
相关论文
共 50 条
  • [41] On the Feasibility of Discovering Meta-Patterns from a Data Ensemble
    Suzuki, Einoshin
    DISCOVERY SCIENCE, DS 2015, 2015, 9356 : 266 - 274
  • [42] Dynamic patterns of industry convergence: Evidence from a large amount of unstructured data
    Kim, Namil
    Lee, Hyeokseong
    Kim, Wonjoon
    Lee, Hyunjong
    Suh, Jong Hwan
    RESEARCH POLICY, 2015, 44 (09) : 1734 - 1748
  • [43] Discovering gene-gene relations from sequential sentence patterns in biomedical literature
    Chiang, Jung-Hsien
    Liu, Hsiao-Sheng
    Chao, Shih-Yi
    Chen, Cheng-Yu
    EXPERT SYSTEMS WITH APPLICATIONS, 2007, 33 (04) : 1036 - 1041
  • [44] Discovering and Visualizing Patterns in EEG Data
    Anderson, Erik W.
    Chong, Catherine
    Preston, Gilbert A.
    Silva, Claudio T.
    2013 IEEE SYMPOSIUM ON PACIFIC VISUALIZATION (PACIFICVIS), 2013, : 105 - 112
  • [45] Discovering actionable patterns in event data
    Hellerstein, JL
    Ma, S
    Perng, CS
    IBM SYSTEMS JOURNAL, 2002, 41 (03) : 475 - 493
  • [46] Data Mining for Discovering Patterns in Migration
    Franco-Arcega, Anilu
    Franco-Sanchez, Kristell D.
    Castro-Espinoza, Felix A.
    Garcia-Islas, Luis H.
    NATURE-INSPIRED COMPUTATION AND MACHINE LEARNING, PT II, 2014, 8857 : 285 - 295
  • [47] On Discovering Feasible Periodic Patterns in Large Database
    Luo, Xiao
    Yuan, Hua
    Luo, Qian
    2013 IEEE 11TH INTERNATIONAL CONFERENCE ON DEPENDABLE, AUTONOMIC AND SECURE COMPUTING (DASC), 2013, : 344 - 351
  • [48] Discovering Interesting Patterns in Large Graph Cubes
    Demesmaeker, Florian
    Ghrab, Amine
    Nijssen, Siegfried
    Skhiri, Sabri
    2017 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2017, : 3322 - 3331
  • [49] Discovering time-interval sequential patterns in sequence databases
    Chen, YL
    Chiang, MC
    Ko, MT
    EXPERT SYSTEMS WITH APPLICATIONS, 2003, 25 (03) : 343 - 354
  • [50] An efficient approach with memory indexing for discovering frequent sequential patterns
    Dan, Cao
    Peng, Hui-Li
    Zhang, Xiao-Jian
    Du, Xing-Zheng
    PROCEEDINGS OF 2006 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2006, : 1001 - 1006