A GSP-based efficient algorithm for mining frequent sequences

被引:0
|
作者
Zhang, MH [1 ]
Kao, B [1 ]
Yip, CL [1 ]
Cheung, D [1 ]
机构
[1] Univ Hong Kong, Dept Comp Sci & Informat Syst, Hong Kong, Hong Kong, Peoples R China
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper studies the problem of mining frequent sequences in transactional databases. In [6], Agrawal and Srikant proposed the GSP algorithm for extracting frequently occurring sequences. GSP is an iterative algorithm. It scans the database a number of times depending on the length of the longest frequent sequences in the database. The I/O cost is thus substantial if the database contains very long frequent sequences. In this paper, we extend the candidate generating function used by GSP and propose a new two-stage algorithm ATS. Our algorithm first mines a sample of the database to obtain a rough estimate of the frequent sequences and then refines the solution. Experiment results show that MFS saves I/O cost significantly compared with GSP.
引用
收藏
页码:497 / 503
页数:7
相关论文
共 50 条
  • [1] SPADE: An Efficient Algorithm for Mining Frequent Sequences
    Mohammed J. Zaki
    Machine Learning, 2001, 42 : 31 - 60
  • [2] SPADE: An efficient algorithm for mining frequent sequences
    Zaki, MJ
    MACHINE LEARNING, 2001, 42 (1-2) : 31 - 60
  • [3] An Efficient Algorithm for Mining Frequent Sequences in Dynamic Environment
    Li, Guangyuan
    Xiao, Qin
    Hu, Qinbin
    Yuan, Changan
    2009 IEEE INTERNATIONAL CONFERENCE ON GRANULAR COMPUTING ( GRC 2009), 2009, : 329 - 333
  • [4] An efficient mining maximal frequent traversal sequences algorithm based on bidirectional constraint
    Ren, Jia-Dong
    Zhang, Xiao-Jian
    Peng, Hui-Li
    PROCEEDINGS OF 2006 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-7, 2006, : 1575 - +
  • [5] PROWL: An efficient frequent continuity mining algorithm on event sequences
    Huang, KY
    Chang, CH
    Lin, KZ
    DATA WAREHOUSING AND KNOWLEDGE DISCOVERY, PROCEEDINGS, 2004, 3181 : 351 - 360
  • [6] A probabilistic algorithm for mining frequent sequences
    Tumasonis, R
    Dzemyda, G
    ADBIS' 04: EIGHTH EAST-EUROPEAN CONFERENCE ON ADVANCES IN DATABASES AND INFORMATION SYSTEMS, PROCEEDINGS, 2004, : 89 - 98
  • [7] Study and implementation of frequent sequences mining based prefetching algorithm
    Wang F.
    Wang P.
    Zhu C.
    Jisuanji Yanjiu yu Fazhan/Computer Research and Development, 2016, 53 (02): : 443 - 448
  • [8] An efficient algorithm of frequent itemsets mining based on MapReduce
    Wang, Le
    Feng, Lin
    Zhang, Jing
    Liao, Pengyu
    Journal of Information and Computational Science, 2014, 11 (08): : 2809 - 2816
  • [9] A Novel Frequent Trajectory Mining Method Based on GSP
    Li, Junhuai
    Wang, Jinqin
    Yu, Lei
    Zhang, Jing
    WEB INFORMATION SYSTEMS AND MINING, PT I, 2011, 6987 : 134 - 140
  • [10] An efficient algorithm for mining frequent sequences by a new strategy without support counting
    Chiu, DY
    Wu, YH
    Chen, ALP
    20TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2004, : 375 - 386