TFI-Apriori: Using new encoding to optimize the apriori algorithm

被引:5
|
作者
Ansari, Ebrahim [1 ]
Sadreddini, M. H. [2 ]
Mirsadeghi, S. M. H. [1 ]
Keshtkaran, Morteza [2 ]
Wallace, Richard [3 ]
机构
[1] Inst Adv Studies Basic Sci, Dept Comp Sci & Informat Technol, Zanjan, Iran
[2] Shiraz Univ, Dept Comp Sci & Engn, Shiraz, Iran
[3] Univ Complutense Madrid, Distributed Syst Architecture Res Grp, Madrid, Spain
关键词
Frequent pattern mining; association rule mining; apriori; knowledge discovery; data mining; MINING ASSOCIATION RULES; RELATIONAL DATABASES; FREQUENT PATTERNS; EFFICIENT METHOD; TRIE;
D O I
10.3233/IDA-173473
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we propose a new optimization for Apriori-based association rule mining algorithms where the frequency of items can be encoded and treated in a special manner drastically increasing the efficiency of the frequent itemset mining process. An efficient algorithm, called TFI-Apriori, is developed for mining the complete set of frequent itemsets. In the preprocessing phase of the proposed algorithm, the most frequent items from the database are selected and encoded. The TFI-Apriori algorithm then takes advantage of the encoded information to decrease the number of candidate itemsets generated in the mining process, and consequently drastically reduces execution time in candidate generation and support counting phases. Experimental results on actual datasets - databases coming from applications with very frequent items - demonstrate how the proposed algorithm is an order of magnitude faster than the classical Apriori approach without any loss in generation of the complete set of frequent itemsets. Additionally, TFI-Apriori has a smaller memory requirement than the traditional Apriori-based algorithms and embedding this new optimization approach in well-known implementations of the Apriori algorithm allows reuse of existing processing flows.
引用
收藏
页码:807 / 827
页数:21
相关论文
共 50 条
  • [41] Improvement of Apriori Algorithm for Association Rules
    Li, Xiaohui
    2011 INTERNATIONAL CONFERENCE ON FUTURE COMPUTER SCIENCE AND APPLICATION (FCSA 2011), VOL 4, 2011, : 312 - 315
  • [42] IMPLEMENTATION OF IMPROVED APRIORI ALGORITHM IN INTERNAL
    Kumar, Vijay
    Gopal, Veni Devi
    IIOAB JOURNAL, 2016, 7 (09) : 97 - 105
  • [43] An Improved Apriori Algorithm on the Frequent Itemse
    Fang, Xiang
    PROCEEDINGS OF THE 2013 THE INTERNATIONAL CONFERENCE ON EDUCATION TECHNOLOGY AND INFORMATION SYSTEM (ICETIS 2013), 2013, 65 : 845 - 848
  • [44] Improving algorithm Apriori for data mining
    Zhang, Zhuo
    Zhang, Lu
    Zhong, Shao-Chun
    Guan, Jiwen
    COMPUTATIONAL INTELLIGENCE IN DECISION AND CONTROL, 2008, 1 : 17 - 22
  • [45] An improved Apriori algorithm based on the matrix
    Wang, Feng
    Li, Yong-hua
    FBIE: 2008 INTERNATIONAL SEMINAR ON FUTURE BIOMEDICAL INFORMATION ENGINEERING, PROCEEDINGS, 2008, : 152 - 155
  • [46] An apriori algorithm to improve teaching effectiveness
    Xu S.
    International Journal of Performability Engineering, 2020, 16 (05) : 792 - 799
  • [47] Application of Apriori Algorithm in Diagnosis of Stroke
    Zhang, Weipeng
    MECHATRONICS AND INTELLIGENT MATERIALS II, PTS 1-6, 2012, 490-495 : 2017 - 2021
  • [48] Improvement of Apriori Algorithm for Association Rules
    Li Xiaohui
    2012 WORLD AUTOMATION CONGRESS (WAC), 2012,
  • [49] An application of Apriori algorithm on a diabetic database
    Duru, N
    KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 1, PROCEEDINGS, 2005, 3681 : 398 - 404
  • [50] A Methodological Approach for Mining the User Requirements Using Apriori Algorithm
    Soni, Anuja
    Saxena, Anand
    Bajaj, Parul
    JOURNAL OF CASES ON INFORMATION TECHNOLOGY, 2020, 22 (04) : 1 - 30