An Efficient Framework for Mining Association Rules in the Distributed Databases

被引:4
|
作者
Goyal, Lalit Mohan [1 ]
Beg, M. M. Sufyan [2 ]
Ahmad, Tanvir [3 ]
机构
[1] Bharati Vidyapeeths Coll Engn, Comp Sci & Engn, New Delhi, India
[2] Aligarh Muslim Univ, Fac Engn & Technol, Comp Engn, Aligarh, Uttar Pradesh, India
[3] Jamia Millia Islamia, Comp Engn, New Delhi, India
来源
COMPUTER JOURNAL | 2018年 / 61卷 / 05期
关键词
distributed mining; distributed algorithms; distributed database; frequent itemsets;
D O I
10.1093/comjnl/bxx067
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
While mining the association rules in distributed database, overhead increases at each site because of linkage and dependency with other sites. Each site scans database not only for itself but for the neighboring sites also. In the most popular Count Distribution (CD) and Fast Distributed Mining (FDM) algorithms, sites generate and scan the identical candidate itemsets. In the CD algorithm, sites generate candidate k + 1 itemsets using global frequent k-itemsets and in the FDM algorithm, sites generate using its own and neighboring sites heavy frequent k-itemsets. Most of the itemsets scanned by the CD algorithm are infrequent. These infrequent itemsets are not scanned in the FDM algorithm. Anyhow, in the FDM algorithm, some of the itemsets may be found frequent on neither of the sites but scanned on all the sites. In this paper, an efficient framework and an algorithm have been proposed for mining association rules in the distributed database. In the proposed framework, initially, overhead of each site for generating and scanning candidate itemsets for the neighboring sites is reduced. Later, a site either does not scan candidate k-itemset of neighboring site or postpone till its k + 1 itemsets are scanned.
引用
收藏
页码:645 / 657
页数:13
相关论文
共 50 条
  • [31] An efficient graph-based approach to mining association rules for large databases
    Department of Computer Science and Engineering, National Sun Yat-Sen University, No. 70 Lienhai Rd., Kaohsiung 80424, Taiwan
    Int. J. Intell. Inf. Database Syst., 2009, 3 (259-274): : 259 - 274
  • [32] A framework for mining association rules
    Luo, J
    Rajasekaran, S
    KNOWLEDGE-BASED INTELLIGENT INFORMATION AND ENGINEERING SYSTEMS, PT 4, PROCEEDINGS, 2005, 3684 : 509 - 517
  • [33] An Agent-Based Framework for Association Rules Mining of Distributed Data
    Hu, Gongzhu
    Ding, Shaozhen
    SOFTWARE ENGINEERING RESEARCH, MANAGEMENT AND APPLICATIONS 2009, 2009, 253 : 13 - 26
  • [34] Association rules mining in vertically partitioned databases
    Rozenberg, Boris
    Gudes, Ehud
    DATA & KNOWLEDGE ENGINEERING, 2006, 59 (02) : 378 - 396
  • [35] Algorithms for mining association rules in bag databases
    Hsu, PY
    Chen, YL
    Ling, CC
    INFORMATION SCIENCES, 2004, 166 (1-4) : 31 - 47
  • [36] Mining association rules from biological databases
    Rodríguez, A
    Carazo, JM
    Trelles, O
    JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY, 2005, 56 (05): : 493 - 504
  • [37] Mining gene expression databases for association rules
    Creighton, C
    Hanash, S
    BIOINFORMATICS, 2003, 19 (01) : 79 - 86
  • [38] Mining fuzzy association rules in incomplete databases
    Arotaritei, D
    PROCEEDINGS OF THE 2002 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOL 1 & 2, 2002, : 267 - 271
  • [39] Algorithms for mining association rules in image databases
    Gao, Li
    Dai, Shangping
    Zhu, Changwu
    Zheng, Shijue
    DCABES 2007 Proceedings, Vols I and II, 2007, : 805 - 807
  • [40] Mining spatial association rules in image databases
    Lee, Anthony J. T.
    Hong, Ruey-Wen
    Ko, Wei-Min
    Tsao, Wen-Kwang
    Lin, Hsiu-Hui
    INFORMATION SCIENCES, 2007, 177 (07) : 1593 - 1608