Perfect hashing schemes for mining association rules

被引:18
|
作者
Chang, CC [1 ]
Lin, CY [1 ]
机构
[1] Natl Chung Cheng Univ, Dept Comp Sci & Informat Engn, Chaiyi 621, Taiwan
来源
COMPUTER JOURNAL | 2005年 / 48卷 / 02期
关键词
D O I
10.1093/comjnl/bxh074
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Hashing schemes are widely used to improve the performance of data mining association rules, as in the DHP algorithm that utilizes the hash table in identifying the validity of candidate itemsets according to the number of the table's bucket accesses. However, since the hash table used in DHP is plagued by the collision problem, the process of generating large itemsets at each level requires two database scans, which leads to poor performance. In this paper we propose perfect hashing schemes to avoid collisions in the hash table. The main idea is to employ a refined encoding scheme, which transforms large itemsets into large 2-itemsets and thereby makes the application of perfect hashing feasible. Our experimental results demonstrate that the new method is also efficient (about three times faster than DHP), and scalable when the database size increases. We also propose another variant of the perfect hash scheme with reduced memory requirements. The properties and performances of several perfect hashing schemes are also investigated and compared.
引用
收藏
页码:168 / 179
页数:12
相关论文
共 50 条
  • [21] Parallel mining of association rules
    Agrawal, R
    Shafer, JC
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 1996, 8 (06) : 962 - 969
  • [22] Mining influential association rules
    Zhang, X
    Chen, Z
    Zhu, Q
    PROCEEDINGS OF THE 6TH JOINT CONFERENCE ON INFORMATION SCIENCES, 2002, : 490 - 493
  • [23] A framework for mining association rules
    Luo, J
    Rajasekaran, S
    KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 4, PROCEEDINGS, 2005, 3684 : 509 - 517
  • [24] Mining Causal Association Rules
    Li, Jiuyong
    Thuc Duy Le
    Liu, Lin
    Liu, Jixue
    Jin, Zhou
    Sun, Bingyu
    2013 IEEE 13TH INTERNATIONAL CONFERENCE ON DATA MINING WORKSHOPS (ICDMW), 2013, : 114 - 123
  • [25] Association rules mining algorithm
    Bhowmik, R
    Proceedings of the ISCA 20th International Conference on Computers and Their Applications, 2005, : 86 - 90
  • [26] Mining association rules in IDS
    Yin Jing-tao
    Lv Meng-ya
    PROCEEDINGS OF 2006 CHINESE CONTROL AND DECISION CONFERENCE, 2006, : 843 - 846
  • [27] Mining Consumption Association Rules
    Yen, Show-Jane
    Wang, Chiu-Kuang
    Lee, Yue-Shi
    JOURNAL OF INFORMATION SCIENCE AND ENGINEERING, 2016, 32 (02) : 271 - 285
  • [28] Mining spatial association rules
    Bembenik, R
    Protaziuk, G
    INTELLIGENT INFORMATION PROCESSING AND WEB MINING, 2004, : 3 - 12
  • [29] Mining weighted association rules
    Lu, Songfeng
    Hu, Heping
    Li, Fan
    Intelligent Data Analysis, 2001, 5 (03) : 211 - 225
  • [30] Private mining of association rules
    Zhan, J
    Matwin, S
    Chang, LW
    INTELLIGENCE AND SECURITY INFORMATICS, PROCEEDINGS, 2005, 3495 : 72 - 80