TFI-Apriori: Using new encoding to optimize the apriori algorithm

被引:5
|
作者
Ansari, Ebrahim [1 ]
Sadreddini, M. H. [2 ]
Mirsadeghi, S. M. H. [1 ]
Keshtkaran, Morteza [2 ]
Wallace, Richard [3 ]
机构
[1] Inst Adv Studies Basic Sci, Dept Comp Sci & Informat Technol, Zanjan, Iran
[2] Shiraz Univ, Dept Comp Sci & Engn, Shiraz, Iran
[3] Univ Complutense Madrid, Distributed Syst Architecture Res Grp, Madrid, Spain
关键词
Frequent pattern mining; association rule mining; apriori; knowledge discovery; data mining; MINING ASSOCIATION RULES; RELATIONAL DATABASES; FREQUENT PATTERNS; EFFICIENT METHOD; TRIE;
D O I
10.3233/IDA-173473
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we propose a new optimization for Apriori-based association rule mining algorithms where the frequency of items can be encoded and treated in a special manner drastically increasing the efficiency of the frequent itemset mining process. An efficient algorithm, called TFI-Apriori, is developed for mining the complete set of frequent itemsets. In the preprocessing phase of the proposed algorithm, the most frequent items from the database are selected and encoded. The TFI-Apriori algorithm then takes advantage of the encoded information to decrease the number of candidate itemsets generated in the mining process, and consequently drastically reduces execution time in candidate generation and support counting phases. Experimental results on actual datasets - databases coming from applications with very frequent items - demonstrate how the proposed algorithm is an order of magnitude faster than the classical Apriori approach without any loss in generation of the complete set of frequent itemsets. Additionally, TFI-Apriori has a smaller memory requirement than the traditional Apriori-based algorithms and embedding this new optimization approach in well-known implementations of the Apriori algorithm allows reuse of existing processing flows.
引用
收藏
页码:807 / 827
页数:21
相关论文
共 50 条
  • [1] A Method to Optimize Apriori Algorithm for Frequent Items Mining
    Zhang, Ke
    Liu, Jianhuan
    Chai, Yi
    Zhou, Jiayi
    Li, Yi
    2014 SEVENTH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID 2014), VOL 1, 2014, : 71 - 75
  • [2] A new improvement on Apriori algorithm
    Ji, Lei
    Zhang, Baowen
    Li, Jianhua
    2006 INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND SECURITY, PTS 1 AND 2, PROCEEDINGS, 2006, : 840 - 844
  • [3] Alarm Correlation Using Apriori Algorithm
    Sarkan, Mehmet Onur
    Akcakoca, Aysel
    Kucukakdag, Can
    Cataltepe, Zehra
    2015 23RD SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2015, : 1602 - 1605
  • [4] An Apriori Improving Algorithm Based on Encoding and Scale Compression of Items
    Yang, Yan
    Zhou, Yongquan
    Wang, Yong
    PROCEEDINGS OF ANNUAL CONFERENCE OF CHINA INSTITUTE OF COMMUNICATIONS, 2010, : 92 - 95
  • [5] An Improved Apriori Algorithm
    Gu, Jianlong
    Wang, Baojin
    Zhang, Fengyu
    Wang, Weiming
    Gao, Ming
    APPLIED INFORMATICS AND COMMUNICATION, PT I, 2011, 224 : 127 - +
  • [6] A generalized Apriori algorithm
    Jiang, BQ
    Zheng, FB
    Xu, Y
    Proceedings of the 8th Joint Conference on Information Sciences, Vols 1-3, 2005, : 1493 - 1496
  • [7] An Improved Apriori Algorithm
    Gu, Jianlong
    Wang, Baojin
    Zhang, Fengyu
    Wang, Weiming
    Gao, Ming
    2010 THE 3RD INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND INDUSTRIAL APPLICATION (PACIIA2010), VOL I, 2010, : 84 - 86
  • [8] An Improved Apriori Algorithm
    Huang Jinjing
    Zhao Lei
    Yang Jiwen
    ICCSE 2008: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE & EDUCATION: ADVANCED COMPUTER TECHNOLOGY, NEW EDUCATION, 2008, : 604 - 607
  • [9] An Improved Algorithm of Apriori
    Liao, Binhua
    COMPUTATIONAL INTELLIGENCE AND INTELLIGENT SYSTEMS, 2009, 51 : 427 - 432
  • [10] An Improved Algorithm for Apriori
    Zhang Changsheng
    Li Zhongyue
    Zheng Dongsong
    PROCEEDINGS OF THE FIRST INTERNATIONAL WORKSHOP ON EDUCATION TECHNOLOGY AND COMPUTER SCIENCE, VOL I, 2009, : 995 - 998