Effective database transformation and efficient support computation for mining sequential patterns

被引:0
|
作者
Cho, CW [1 ]
Wu, YH
Chen, ALP
机构
[1] Natl Tsing Hua Univ, Dept Comp Sci, Hsinchu, Taiwan
[2] Natl Chengchi Univ, Dept Comp Sci, Taipei 11623, Taiwan
关键词
data mining; sequential patterns; database transformation; frequent k-sequences;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we introduce a novel algorithm for mining sequential patterns from transaction databases. Since the FP-tree based approach is efficient in mining frequent itemsets, we adapt it to find frequent 1-sequences. For efficient frequent k-sequence mining, every frequent 1-sequence is encoded as a unique symbol and the database is transformed into one in the symbolic form. We observe that it is unnecessary to encode all the frequent 1-seqences, and make full use of the discovered frequent I-sequences to transform the database into one with a smallest size. To discover the frequent k-sequences, we design a tree structure to store the candidates. Each customer sequence is then scanned to decide whether the candidates are frequent k-sequences. We propose a technique to avoid redundantly enumerating the identical k-subsequences from a customer sequence to speed up the process. Moreover, the tree structure is designed in a way such that the supports of the candidates can be incremented for a customer sequence by a single sequential traversal of the tree. The experiment results show that our approach outperforms the previous works in various aspects including the scalability and the execution time.
引用
收藏
页码:163 / 174
页数:12
相关论文
共 50 条
  • [1] Effective database transformation and efficient support computation for mining sequential patterns
    Chung-Wen Cho
    Yi-Hung Wu
    Arbee L. P. Chen
    Journal of Intelligent Information Systems, 2009, 32 (1) : 23 - 51
  • [2] Effective database transformation and efficient support computation for mining sequential patterns
    Cho, Chung-Wen
    Wu, Yi-Hung
    Chen, Arbee L. P.
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2009, 32 (01) : 23 - 51
  • [3] Database support for data mining patterns
    Kotsifakos, E
    Ntoutsi, I
    Theodoridis, Y
    ADVANCES IN INFORMATICS, PROCEEDINGS, 2005, 3746 : 14 - 24
  • [4] An Effective Approach for Mining Weighted Sequential Patterns
    Patel, Mukesh
    Modi, Nilesh
    Passi, Kalpdrum
    SMART TRENDS IN INFORMATION TECHNOLOGY AND COMPUTER COMMUNICATIONS, SMARTCOM 2016, 2016, 628 : 904 - 915
  • [5] Approximate sequential patterns for incomplete sequence database mining
    Fiot, Celine
    Laurent, Anne
    Teisseire, Maguelonne
    2007 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOLS 1-4, 2007, : 663 - 668
  • [6] An efficient algorithm for incremental mining of sequential patterns
    Ren, Jia-Dong
    Zhou, Xiao-Lei
    ADVANCES IN MACHINE LEARNING AND CYBERNETICS, 2006, 3930 : 179 - 188
  • [7] A high efficient algorithm of mining sequential patterns
    Qin, F
    Yang, XB
    PROCEEDINGS OF THE 3RD WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION, VOLS 1-5, 2000, : 3750 - 3752
  • [8] Efficient Mining of Outlying Sequential Behavior Patterns
    Xu, Yifan
    Duan, Lei
    Xie, Guicai
    Fu, Min
    Li, Longhai
    Nummenmaa, Jyrki
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2021), PT II, 2021, 12682 : 325 - 341
  • [9] An efficient method for mining sequential patterns with indices
    Huynh, Huy Minh
    Nguyen, Loan T. T.
    Pham, Nam Ngoc
    Oplatkova, Zuzana Kominkova
    Yun, Unil
    Vo, Bay
    KNOWLEDGE-BASED SYSTEMS, 2022, 239
  • [10] NSPIS: Mining Negative Sequential Patterns with Individual Support
    Huang, Gengsen
    Gan, Wensheng
    Huang, Shan
    Chen, Jiahui
    Chen, Chien-Ming
    2021 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2021, : 5507 - 5516