An adaptive approximation method to discover frequent itemsets over sliding-window-based data streams

被引:12
|
作者
Li, Chao-Wei [1 ]
Jea, Kuen-Fang [1 ]
机构
[1] Natl Chung Hsing Univ, Dept Comp Sci & Engn, Taichung 40227, Taiwan
关键词
Data stream; Frequent itemset; Sliding window; Combinatorial Approximation; Adaptive approximation; Concept drift;
D O I
10.1016/j.eswa.2011.04.167
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Frequent-pattern discovery in data streams is more challenging than that in traditional databases since several requirements need to be additionally satisfied. For the sliding-window model of data streams, transactions both enter into and leave from the window at each sliding. In this paper, we propose an approximation method for mining frequent itemsets over the sliding window of a data stream. The proposed method could approximate itemsets' counts from the counts of their subsets instead of scanning the transactions for them. By noticing the more dynamic feature of sliding-window model, we have made an effort to devise a promising technique which enables the proposed method to approximate for itemsets adaptively. In addition, another technique which may adjust and correct the approximations is also designed. Empirical results have shown that the performance of proposed method is quite efficient and stable; moreover, the mining result from adaptive approximation (and approximation adjustment) achieves high accuracy. (C) 2011 Elsevier Ltd. All rights reserved.
引用
收藏
页码:13386 / 13404
页数:19
相关论文
共 50 条
  • [41] A Sliding Window-Based Approach for Mining Frequent Weighted Patterns Over Data Streams
    Bui, Huong
    Nguyen-Hoang, Tu-Anh
    Vo, Bay
    Nguyen, Ham
    Le, Tuong
    IEEE ACCESS, 2021, 9 : 56318 - 56329
  • [42] estWin:: Online data stream mining of recent frequent itemsets by sliding window method
    Chang, JH
    Lee, WS
    JOURNAL OF INFORMATION SCIENCE, 2005, 31 (02) : 76 - 90
  • [43] A dynamic layout of sliding window for frequent itemset mining over data streams
    Deypir, Mahmood
    Sadreddini, Mohammad Hadi
    JOURNAL OF SYSTEMS AND SOFTWARE, 2012, 85 (03) : 746 - 759
  • [44] Mining the frequent patterns in an arbitrary sliding window over online data streams
    Li, Guo-Hui
    Chen, Hui
    Ruan Jian Xue Bao/Journal of Software, 2008, 19 (10): : 2585 - 2596
  • [45] Mining frequent itemsets over data streams with multiple time-sensitive sliding windows
    Jin, Long
    Chai, Duck Jin
    Lee, Yang Koo
    Ryu, Keun Ho
    ALPIT 2007: PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON ADVANCED LANGUAGE PROCESSING AND WEB INFORMATION TECHNOLOGY, 2007, : 486 - +
  • [46] Variable slide window based frequent itemsets mining algorithm on large data streams
    Zhu, Xiao-Dong
    Huang, Zhi-Qiu
    Shen, Guo-Hua
    Yuan, Min
    Kongzhi yu Juece/Control and Decision, 2009, 24 (06): : 832 - 836
  • [47] Maintaining Only Frequent Itemsets to Mine Approximate Frequent Itemsets over Online Data Streams
    Wang, Yongyan
    Li, Kun
    Wang, Hongan
    2009 IEEE SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DATA MINING, 2009, : 381 - 388
  • [48] Moment: Maintaining closed frequent itemsets over a stream sliding window
    Chi, Y
    Wang, HX
    Yu, PS
    Muntz, RR
    FOURTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2004, : 59 - 66
  • [49] Online mining closed frequent itemsets over a stream sliding window
    Ao, Fu-Jiang
    Du, Jing
    Yan, Yue-Jin
    Huang, Ke-Di
    Xi Tong Gong Cheng Yu Dian Zi Ji Shu/Systems Engineering and Electronics, 2009, 31 (05): : 1235 - 1240
  • [50] Frequent itemsets mining based on concept lattice and sliding window
    Chang-Sheng, Zhang
    Jing, Ruan
    Hai-Long, Huang
    long-Chang, Li
    Bing-Ru, Yang
    Telkomnika - Indonesian Journal of Electrical Engineering, 2013, 11 (08): : 4780 - 4787