UP-tree & UP-Mine: A fast method based on upper bound for frequent pattern mining from uncertain data

被引:5
|
作者
Davashi, Razieh [1 ,2 ]
机构
[1] Islamic Azad Univ, Najafabad Branch, Fac Comp Engn, Najafabad, Iran
[2] Islamic Azad Univ, Najafabad Branch, Big Data Res Ctr, Najafabad, Iran
关键词
Data mining; Frequent pattern mining; Uncertain frequent pattern mining; Uncertain data; Expected support; EFFICIENT APPROACH; ALGORITHM;
D O I
10.1016/j.engappai.2021.104477
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In recent years, frequent pattern mining from uncertain data has been actively researched in data mining. There are numerous exact and upper bound-based approaches for uncertain frequent pattern mining. Exact-based algorithms may produce a large data structure and need time-consuming calculations and upper bound-based algorithms may produce many false positives. As a result, these algorithms demand much time and memory. There have been efforts to resolve the problem of upper bound-based algorithms, however, all of these methods only try to tighten the upper bound of expected support for long patterns. This is while pruning infrequent short patterns has a greater impact on reducing the false positives. To overcome these drawbacks, in this paper an efficient method based on upper bound is proposed for mining uncertain frequent patterns. The proposed method uses a new Tightened upper bound to expected support of patterns (Tup) which has a significant effect on reducing the number of false positives by tightening the upper bound of expected support and early pruning of infrequent 2-itemsets and their supersets. Comprehensive experimental results show that the proposed method reduces memory consumption in most cases and dramatically improves the performance of exact and upper bound-based methods in terms of runtime and scalability for dense and sparse uncertain data.
引用
收藏
页数:20
相关论文
共 40 条
  • [1] A tree-based approach for frequent pattern mining from uncertain data
    Leung, Carson Kai-Sang
    Mateo, Mark Anthony F.
    Brajczuk, Dale A.
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2008, 5012 : 653 - 661
  • [2] UP-EVOLVE - UNCERTAIN FREQUENT PATTERN MINING
    Wang, Shu
    Ng, Vincent
    ICEIS 2011: PROCEEDINGS OF THE 13TH INTERNATIONAL CONFERENCE ON ENTERPRISE INFORMATION SYSTEMS, VOL 1, 2011, : 74 - 84
  • [3] Vertical Frequent Pattern Mining from Uncertain Data
    Budhia, Bhavek P.
    Cuzzocrea, Alfredo
    Leung, Carson K.
    ADVANCES IN KNOWLEDGE-BASED AND INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, 2012, 243 : 1273 - 1282
  • [4] Fast Algorithms for Frequent Itemset Mining from Uncertain Data
    Leung, Carson Kai-Sang
    MacKinnon, Richard Kyle
    Tanbeer, Syed K.
    2014 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2014, : 893 - 898
  • [5] Fuzzy Association Rule Mining based Frequent Pattern Extraction from Uncertain Data
    Rajput, D. S.
    Thakur, R. S.
    Thakur, G. S.
    PROCEEDINGS OF THE 2012 WORLD CONGRESS ON INFORMATION AND COMMUNICATION TECHNOLOGIES, 2012, : 709 - 714
  • [6] Analysis of tree-based uncertain frequent pattern mining techniques without pattern losses
    Lee, Gangin
    Yun, Unil
    Lee, Kyung-Min
    JOURNAL OF SUPERCOMPUTING, 2016, 72 (11): : 4296 - 4318
  • [7] Analysis of tree-based uncertain frequent pattern mining techniques without pattern losses
    Gangin Lee
    Unil Yun
    Kyung-Min Lee
    The Journal of Supercomputing, 2016, 72 : 4296 - 4318
  • [8] Finding efficiencies in frequent pattern mining from big uncertain data
    Carson Kai-Sang Leung
    Richard Kyle MacKinnon
    Fan Jiang
    World Wide Web, 2017, 20 : 571 - 594
  • [9] Finding efficiencies in frequent pattern mining from big uncertain data
    Leung, Carson Kai-Sang
    MacKinnon, Richard Kyle
    Jiang, Fan
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2017, 20 (03): : 571 - 594
  • [10] Frequent pattern mining algorithm for uncertain data streams based on sliding window
    Yang, Junrui
    Yang, Cai
    Wei, Yanjun
    2016 8TH INTERNATIONAL CONFERENCE ON INTELLIGENT HUMAN-MACHINE SYSTEMS AND CYBERNETICS (IHMSC), VOL. 2, 2016, : 265 - 268