A GSP-based efficient algorithm for mining frequent sequences

被引:0
|
作者
Zhang, MH [1 ]
Kao, B [1 ]
Yip, CL [1 ]
Cheung, D [1 ]
机构
[1] Univ Hong Kong, Dept Comp Sci & Informat Syst, Hong Kong, Hong Kong, Peoples R China
来源
IC-AI'2001: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOLS I-III | 2001年
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper studies the problem of mining frequent sequences in transactional databases. In [6], Agrawal and Srikant proposed the GSP algorithm for extracting frequently occurring sequences. GSP is an iterative algorithm. It scans the database a number of times depending on the length of the longest frequent sequences in the database. The I/O cost is thus substantial if the database contains very long frequent sequences. In this paper, we extend the candidate generating function used by GSP and propose a new two-stage algorithm ATS. Our algorithm first mines a sample of the database to obtain a rough estimate of the frequent sequences and then refines the solution. Experiment results show that MFS saves I/O cost significantly compared with GSP.
引用
收藏
页码:497 / 503
页数:7
相关论文
共 50 条
  • [21] An efficient algorithm for mining frequent closed itemsets
    Fang, Gang
    Wu, Yue
    Li, Ming
    Chen, Jia
    Informatica (Slovenia), 2015, 39 (01): : 87 - 98
  • [22] An efficient algorithm for approximate frequent intemset mining
    Uppal, Veepu
    International Journal of Database Theory and Application, 2015, 8 (03): : 279 - 288
  • [23] BitTableFI: An efficient mining frequent itemsets algorithm
    Dong, Jie
    Han, Min
    KNOWLEDGE-BASED SYSTEMS, 2007, 20 (04) : 329 - 335
  • [24] An Efficient Close Frequent Pattern Mining Algorithm
    Tan, Jun
    Bu, Yingyong
    Yang, Bo
    ICICTA: 2009 SECOND INTERNATIONAL CONFERENCE ON INTELLIGENT COMPUTATION TECHNOLOGY AND AUTOMATION, VOL I, PROCEEDINGS, 2009, : 528 - 531
  • [25] An Efficient Algorithm for Mining Frequent Closed Itemsets
    Fang, Gang
    Wu, Yue
    Li, Ming
    Chen, Jia
    INFORMATICA-JOURNAL OF COMPUTING AND INFORMATICS, 2015, 39 (01): : 87 - 98
  • [26] Efficient Data Streams Based Closed Frequent Itemsets Mining Algorithm
    Tan, Jun
    ADVANCES IN CIVIL ENGINEERING II, PTS 1-4, 2013, 256-259 : 2910 - 2913
  • [27] An Efficient Algorithm for Frequent Pattern Mining based on Privacy-preserving
    Zhang, Yaling
    Wang, Ting
    Wang, Shangping
    2017 13TH INTERNATIONAL CONFERENCE ON NATURAL COMPUTATION, FUZZY SYSTEMS AND KNOWLEDGE DISCOVERY (ICNC-FSKD), 2017,
  • [28] MFS-SubSC: an efficient algorithm for mining frequent sequences with sub-sequence constraint
    Duong, Hai
    Tran, Anh
    KNOWLEDGE AND INFORMATION SYSTEMS, 2024, 66 (10) : 6151 - 6186
  • [29] An efficient algorithm for mining maximal frequent sequences by Top-Down Delay Decomposition method
    Ren, J. (jdren@ysu.edu.cn), 2007, ICIC Express Letters Office (08):
  • [30] Efficient algorithms for mining and incremental update of maximal frequent sequences
    Kao, B
    Zhang, MH
    Yip, CL
    Cheung, DW
    DATA MINING AND KNOWLEDGE DISCOVERY, 2005, 10 (02) : 87 - 116