GPApriori: GPU-Accelerated Frequent Itemset Mining

被引:35
|
作者
Zhang, Fan [1 ]
Zhang, Yan [1 ]
Bakos, Jason [1 ]
机构
[1] Univ S Carolina, Dept Comp Sci, Columbia, SC 29208 USA
关键词
Association rule mining; Frequent itemset mining; CUDA GPU computing; Parallel Computing;
D O I
10.1109/CLUSTER.2011.61
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper we describe GPApriori, a GPU-accelerated implementation of Frequent Itemset Mining (FIM). We tested our implementation with an Nvidia Tesla T10 graphic processor and demonstrate up to 100X speedup as compared with several state-of-the-art FIM algorithms on a CPU. In order to map the Apriori algorithm onto the SIMD execution model, we have designed a "static bitset" memory structure to represent the input database. This data structure improves upon the traditional approach of the vertical data layout in state-of-the art Apriori implementations. In our implementation, we perform a parallelized version of the support counting step on the GPU. Experimental results show that GPApriori consistently outperforms CPU-based Apriori implementations. Our results demonstrate the potential for GPGPUs in speeding up data mining algorithms.
引用
收藏
页码:590 / 594
页数:5
相关论文
共 50 条
  • [1] Probabilistic Frequent Itemset Mining on a GPU Cluster
    Kozawa, Yusuke
    Amagasa, Toshiyuki
    Kitagawa, Hiroyuki
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2014, E97D (04): : 779 - 789
  • [2] GViewer: GPU-Accelerated Graph Visualization and Mining
    Zhong, Jianlong
    He, Bingsheng
    SOCIAL INFORMATICS, 2011, 6984 : 304 - 307
  • [3] Everest: GPU-Accelerated System For Mining Temporal Motifs
    Yuan, Yichao
    Ye, Haojie
    Vedula, Sanketh
    Kaza, Wynn
    Talati, Nishil
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2023, 17 (02): : 162 - 174
  • [4] Exploiting GPU and cluster parallelism in single scan frequent itemset mining
    Djenouri, Youcef
    Djenouri, Djamel
    Belhadi, Asma
    Cano, Alberto
    INFORMATION SCIENCES, 2019, 496 : 363 - 377
  • [5] GStreamMiner: A GPU-accelerated Data Stream Mining Framework
    HewaNadungodage, Chandima
    Xia, Yuni
    Lee, John Jaehwan
    CIKM'16: PROCEEDINGS OF THE 2016 ACM CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, 2016, : 2489 - 2492
  • [6] GPU-Accelerated Microdosimetry
    Decunha, J.
    Mohan, R.
    MEDICAL PHYSICS, 2022, 49 (06) : E467 - E468
  • [7] GPU-accelerated CellProfiler
    Chakroun, Imen
    Michiels, Nick
    Wuyts, Roel
    PROCEEDINGS 2018 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE (BIBM), 2018, : 321 - 326
  • [8] Inverted Index Automata Frequent Itemset Mining for Large Dataset Frequent Itemset Mining
    Dai, Xin
    Hamed, Haza Nuzly Abdull
    Su, Qichen
    Hao, Xue
    IEEE ACCESS, 2024, 12 : 195111 - 195130
  • [9] Frequent Itemset Mining on Hadoop
    Ferenc Kovacs
    Illes, Janos
    IEEE 9TH INTERNATIONAL CONFERENCE ON COMPUTATIONAL CYBERNETICS (ICCC 2013), 2013, : 241 - 245
  • [10] On A Visual Frequent Itemset Mining
    Lim, SeungJin
    2009 FOURTH INTERNATIONAL CONFERENCE ON DIGITAL INFORMATION MANAGEMENT, 2009, : 25 - 30