Frequent pattern mining on stream data using Hadoop CanTree-GTree

被引:7
|
作者
Kusumakumari, Vanteru [1 ]
Sherigar, Deepthi [1 ]
Chandran, Roshni [1 ]
Patil, Nagamma [1 ]
机构
[1] Natl Inst Technol Karnataka, Surathkal 575025, India
关键词
Stream data mining; Frequent item sets; GTree; CanTree; Hadoop; ITEMSETS;
D O I
10.1016/j.procs.2017.09.134
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The need for knowledge discovery from real-time stream data is continuously increasing nowadays and processing of transactions for mining patterns needs efficient data structures and algorithms. We propose a time-efficient Hadoop CanTree-GTree algorithm, using Apache Hadoop. This algorithm mines the complete frequent item sets (patterns) from real time transactions, by utilizing the sliding window technique. These are used to mine for closed frequent item sets and then, association rules are derived. It makes use of two data structures - CanTree and GTree. The results show that the Hadoop implementation of the algorithm performs 5 times better than in Java. (C) 2017 The Authors. Published by Elsevier B.V. Peer-review under responsibility of the scientific committee of the 7th International Conference on Advances in Computing & Communications.
引用
收藏
页码:266 / 273
页数:8
相关论文
共 50 条
  • [41] Constrained frequent pattern mining on univariate uncertain data
    Liu, Ying-Ho
    Wang, Chun-Sheng
    JOURNAL OF SYSTEMS AND SOFTWARE, 2013, 86 (03) : 759 - 778
  • [42] Vertical Frequent Pattern Mining from Uncertain Data
    Budhia, Bhavek P.
    Cuzzocrea, Alfredo
    Leung, Carson K.
    ADVANCES IN KNOWLEDGE-BASED AND INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, 2012, 243 : 1273 - 1282
  • [43] A Pattern Decomposition Algorithm for Data Mining of Frequent Patterns
    Zou, Qinghua
    Chu, Wesley
    Johnson, David
    Chiu, Henry
    Knowledge and Information Systems, 2002, 4 (04) : 466 - 482
  • [44] A Comparative Study of Frequent Pattern Mining with Trajectory Data
    Ding, Shiting
    Li, Zhiheng
    Zhang, Kai
    Mao, Feng
    SENSORS, 2022, 22 (19)
  • [45] Survey of the study on frequent pattern mining in data streams
    Wang, JL
    Xu, CF
    Chen, WD
    Pan, YH
    2004 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN & CYBERNETICS, VOLS 1-7, 2004, : 5917 - 5922
  • [46] Performance Evaluation of Frequent Pattern Mining Algorithms using Web Log Data for Web Usage Mining
    Gashaw, Yonas
    Liu, Fang
    2017 10TH INTERNATIONAL CONGRESS ON IMAGE AND SIGNAL PROCESSING, BIOMEDICAL ENGINEERING AND INFORMATICS (CISP-BMEI), 2017,
  • [47] Hybrid time decay model and probability decay window model for data stream closed frequent pattern mining
    Yang, Rui
    Ye, Dong
    JOURNAL OF APPLIED SCIENCE AND ENGINEERING, 2020, 23 (04): : 611 - 618
  • [48] MISFP-Growth: Hadoop-Based Frequent Pattern Mining with Multiple Item Support
    Wang, Chen-Shu
    Chang, Jui-Yen
    APPLIED SCIENCES-BASEL, 2019, 9 (10):
  • [49] Mining frequent closed itemsets using conditional frequent pattern tree
    Singh, SR
    Patra, BK
    Giri, D
    Proceedings of the IEEE INDICON 2004, 2004, : 501 - 504
  • [50] A Big Data Framework for Mining Sensor Data Using Hadoop
    El-Shafeiy, Engy A.
    El-Desouky, Ali I.
    STUDIES IN INFORMATICS AND CONTROL, 2017, 26 (03): : 365 - 376