Parallel mining of maximal frequent itemsets from databases

被引:4
|
作者
Chung, SM [1 ]
Luo, C [1 ]
机构
[1] Wright State Univ, Dept Comp Sci & Engn, Dayton, OH 45435 USA
关键词
parallel data mining; maximal frequent itemsets; association rules; scalability;
D O I
10.1109/TAI.2003.1250181
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we propose a parallel algorithm for mining maximal frequent itemsets from databases. A frequent itemset is maximal if none of its supersets is frequent. The new parallel algorithm is named Parallel Max-Miner (PMM), and it is a parallel version of the sequential Max-Miner algorithm [3]. Most of existing mining algorithms discover the frequent k-itemsets on the kth pass over the databases, and then generate the candidate (k + 1)-itemsets for the next pass. Compared to those level-wise algorithms, PMM looks ahead at each pass and prunes more candidate itemsets by checking the frequences of their supersets. We implemented PMM on a cluster of workstations, and evaluated its performance for various cases. PMM demonstrated better performance than other sequential and parallel algorithms, and its performance is quite scalable, even when there are large maximal frequent itemsets (i.e., long patterns) in databases.
引用
收藏
页码:134 / 139
页数:6
相关论文
共 50 条
  • [21] Efficient mining of frequent itemsets in distorted databases
    Wang, Jinlong
    Xu, Congfu
    AI 2006: ADVANCES IN ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2006, 4304 : 352 - +
  • [22] EFFICIENTLY MINING FREQUENT ITEMSETS IN TRANSACTIONAL DATABASES
    Alghyaline, Salah
    Hsieh, Jun-Wei
    Lai, Jim Z. C.
    JOURNAL OF MARINE SCIENCE AND TECHNOLOGY-TAIWAN, 2016, 24 (02): : 184 - 191
  • [23] P-MinMax: A novel parallel algorithm for mining maximal frequent itemsets
    Wang, Hui
    Qu, Qiang
    INT CONF ON CYBERNETICS AND INFORMATION TECHNOLOGIES, SYSTEMS AND APPLICATIONS/INT CONF ON COMPUTING, COMMUNICATIONS AND CONTROL TECHNOLOGIES, VOL II, 2007, : 224 - +
  • [24] Parallel algorithm for mining frequent itemsets
    Ruan, YL
    Liu, G
    Li, QH
    Proceedings of 2005 International Conference on Machine Learning and Cybernetics, Vols 1-9, 2005, : 2118 - 2121
  • [25] Mining Frequent Itemsets in Correlated Uncertain Databases
    Tong, Yong-Xin
    Chen, Lei
    She, Jieying
    JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2015, 30 (04) : 696 - 712
  • [26] Mining for contiguous frequent itemsets in transaction databases
    Berberidis, Christos
    Tzanis, George
    Vlahavas, Ioannis
    2005 IEEE INTELLIGENT DATA ACQUISITION AND ADVANCED COMPUTING SYSTEMS: TECHNOLOGY AND APPLICATIONS, 2005, : 679 - 685
  • [27] Mining maximal frequent itemsets for intrusion detection
    Wang, H
    Li, QH
    Xiong, HY
    Jiang, SY
    GRID AND COOPERATIVE COMPUTING GCC 2004 WORKSHOPS, PROCEEDINGS, 2004, 3252 : 422 - 429
  • [28] A New Method for Mining Maximal Frequent Itemsets
    Nadimi-Shahraki, Mohammad
    Mustapha, Norwati
    Sulaiman, Md Nasir B.
    Mamat, Ali B.
    INTERNATIONAL SYMPOSIUM OF INFORMATION TECHNOLOGY 2008, VOLS 1-4, PROCEEDINGS: COGNITIVE INFORMATICS: BRIDGING NATURAL AND ARTIFICIAL KNOWLEDGE, 2008, : 1064 - 1067
  • [29] Mining maximal frequent itemsets in uncertain data
    Tang, Xianghong
    Yang, Quanwei
    Zheng, Yang
    Huazhong Keji Daxue Xuebao (Ziran Kexue Ban)/Journal of Huazhong University of Science and Technology (Natural Science Edition), 2015, 43 (09): : 29 - 34
  • [30] Scalable algorithm for mining maximal frequent itemsets
    Li, QH
    Wang, H
    He, Y
    Jiang, SY
    2003 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-5, PROCEEDINGS, 2003, : 143 - 146