Perfect hashing schemes for mining association rules

被引:18
|
作者
Chang, CC [1 ]
Lin, CY [1 ]
机构
[1] Natl Chung Cheng Univ, Dept Comp Sci & Informat Engn, Chaiyi 621, Taiwan
来源
COMPUTER JOURNAL | 2005年 / 48卷 / 02期
关键词
D O I
10.1093/comjnl/bxh074
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Hashing schemes are widely used to improve the performance of data mining association rules, as in the DHP algorithm that utilizes the hash table in identifying the validity of candidate itemsets according to the number of the table's bucket accesses. However, since the hash table used in DHP is plagued by the collision problem, the process of generating large itemsets at each level requires two database scans, which leads to poor performance. In this paper we propose perfect hashing schemes to avoid collisions in the hash table. The main idea is to employ a refined encoding scheme, which transforms large itemsets into large 2-itemsets and thereby makes the application of perfect hashing feasible. Our experimental results demonstrate that the new method is also efficient (about three times faster than DHP), and scalable when the database size increases. We also propose another variant of the perfect hash scheme with reduced memory requirements. The properties and performances of several perfect hashing schemes are also investigated and compared.
引用
收藏
页码:168 / 179
页数:12
相关论文
共 50 条
  • [1] Perfect hashing schemes for mining association rules
    Chang, C.-C. (ccc@cs.ccu.edu.tw), 1600, Oxford University Press (48):
  • [2] Perfect hashing schemes for mining traversal patterns
    Chang, CC
    Lin, CY
    Chou, H
    FUNDAMENTA INFORMATICAE, 2006, 70 (03) : 185 - 202
  • [3] A minimal perfect hashing scheme to mining association rules from frequently updated data
    Tseng, Judy C. R.
    Hwang, Gwo-Jen
    Tsai, Wen-Fu
    JOURNAL OF THE CHINESE INSTITUTE OF ENGINEERS, 2006, 29 (03) : 391 - 401
  • [4] Mining association rules using inverted hashing and pruning
    Holt, JD
    Chung, SM
    INFORMATION PROCESSING LETTERS, 2002, 83 (04) : 211 - 220
  • [5] A New Perfect Hashing and Pruning Algorithm for Mining Association Rule
    Najadat, Hassan
    Amani, Shatnawi
    Ghadeer, Obiedat
    BUSINESS TRANSFORMATION THROUGH INNOVATION AND KNOWLEDGE MANAGEMENT: AN ACADEMIC PERSPECTIVE, VOLS 3 AND 4, 2010, : 2524 - 2531
  • [6] An algorithm for mining association rules using an efficient hashing with transaction trimming
    Senthil, Kumar A. V.
    Wahidabanu, R. S. D.
    IMECS 2007: INTERNATIONAL MULTICONFERENCE OF ENGINEERS AND COMPUTER SCIENTISTS, VOLS I AND II, 2007, : 853 - +
  • [7] Mining of association rules in text databases using Inverted Hashing and Pruning
    Holt, JD
    Chung, SM
    DATA WAREHOUSING AND KNOWLEDGE DISCOVERY, PROCEEDINGS, 2000, 1874 : 290 - 300
  • [8] Mining association rules in text databases using multipass with inverted hashing and pruning
    Holt, JD
    Chung, SM
    14TH IEEE INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2002, : 49 - 56
  • [9] Mining association rules
    Aly, HH
    Taha, Y
    Amr, A
    COMPUTERS AND THEIR APPLICATIONS, 2001, : 298 - 303
  • [10] Deduction Schemes for Association Rules
    Balcazar, Jose L.
    DISCOVERY SCIENCE, PROCEEDINGS, 2008, 5255 : 124 - 135