An Efficient Framework for Mining Association Rules in the Distributed Databases

被引:4
|
作者
Goyal, Lalit Mohan [1 ]
Beg, M. M. Sufyan [2 ]
Ahmad, Tanvir [3 ]
机构
[1] Bharati Vidyapeeths Coll Engn, Comp Sci & Engn, New Delhi, India
[2] Aligarh Muslim Univ, Fac Engn & Technol, Comp Engn, Aligarh, Uttar Pradesh, India
[3] Jamia Millia Islamia, Comp Engn, New Delhi, India
来源
COMPUTER JOURNAL | 2018年 / 61卷 / 05期
关键词
distributed mining; distributed algorithms; distributed database; frequent itemsets;
D O I
10.1093/comjnl/bxx067
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
While mining the association rules in distributed database, overhead increases at each site because of linkage and dependency with other sites. Each site scans database not only for itself but for the neighboring sites also. In the most popular Count Distribution (CD) and Fast Distributed Mining (FDM) algorithms, sites generate and scan the identical candidate itemsets. In the CD algorithm, sites generate candidate k + 1 itemsets using global frequent k-itemsets and in the FDM algorithm, sites generate using its own and neighboring sites heavy frequent k-itemsets. Most of the itemsets scanned by the CD algorithm are infrequent. These infrequent itemsets are not scanned in the FDM algorithm. Anyhow, in the FDM algorithm, some of the itemsets may be found frequent on neither of the sites but scanned on all the sites. In this paper, an efficient framework and an algorithm have been proposed for mining association rules in the distributed database. In the proposed framework, initially, overhead of each site for generating and scanning candidate itemsets for the neighboring sites is reduced. Later, a site either does not scan candidate k-itemset of neighboring site or postpone till its k + 1 itemsets are scanned.
引用
收藏
页码:645 / 657
页数:13
相关论文
共 50 条
  • [41] An Adaptive Privacy Preserving Framework for Distributed Association Rule Mining in Healthcare Databases
    Kuba, Hasanien K.
    Azzawi, Mustafa A.
    Darwish, Saad M.
    Hassen, Oday A.
    Abdulhussein, Ansam A.
    CMC-COMPUTERS MATERIALS & CONTINUA, 2023, 74 (02): : 4119 - 4133
  • [42] Efficient Mining of Association Rules based on Clustering from Distributed Data
    Bouraoui, Marwa
    Touzi, Amel Grissa
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2019, 10 (04) : 401 - 409
  • [43] Algorithm for Finding Association Rules in Distributed Databases
    Bhatnagar, Surbhi
    2012 2ND IEEE INTERNATIONAL CONFERENCE ON PARALLEL, DISTRIBUTED AND GRID COMPUTING (PDGC), 2012, : 915 - 920
  • [44] Mining of association rules in distributed database
    Li, Shijun
    Zheng, Peng
    Zhou, Dongru
    Wuhan Shuili Dianli Daxue Xuebao/Journal of Wuhan University of Hydraulic and Electric Engineering, 1999, 32 (06): : 91 - 93
  • [45] Mining Association Rules in Distributed System
    Li, Zou
    Xu, Liang
    PROCEEDINGS OF THE FIRST INTERNATIONAL WORKSHOP ON EDUCATION TECHNOLOGY AND COMPUTER SCIENCE, VOL II, 2009, : 1051 - 1054
  • [46] Computationally efficient mining for fuzzy implication-based association rules in quantitative databases
    Chen, GQ
    Yan, P
    Kerre, EE
    INTERNATIONAL JOURNAL OF GENERAL SYSTEMS, 2004, 33 (2-3) : 163 - 182
  • [47] An Efficient Approach for Mining Positive and Negative Association Rules from Large Transactional Databases
    Kishor, Peddi
    Porika, Sammulal
    2016 INTERNATIONAL CONFERENCE ON INVENTIVE COMPUTATION TECHNOLOGIES (ICICT), VOL 1, 2016, : 85 - 89
  • [48] An efficient algorithm for mining quantitative association rules to raise reliance of data in large databases
    Lee, HJ
    Park, WH
    Park, DS
    DESIGN AND APPLICATION OF HYBRID INTELLIGENT SYSTEMS, 2003, 104 : 672 - 681
  • [49] Mining interesting association rules from customer databases and transaction databases
    Tsai, PSM
    Chen, CM
    INFORMATION SYSTEMS, 2004, 29 (08) : 685 - 696
  • [50] Multipass Algorithms for Mining Association Rules in Text Databases
    John D. Holt
    Soon M. Chung
    Knowledge and Information Systems, 2001, 3 (2) : 168 - 183