Adaptive algorithms for join processing in distributed database systems

被引:2
|
作者
Scheuermann, P [1 ]
Chong, EI [1 ]
机构
[1] ORACLE CORP, NEW ENGLAND R&D CTR, NASHUA, NH 03062 USA
基金
美国国家科学基金会;
关键词
distributed query processing; join algorithms; adaptive algorithms; bipartite graphs;
D O I
10.1023/A:1008617911992
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Distributed query processing algorithms usually perform data reduction by using a semijoin program, but the problem with these approaches is that they still require an explicit join of the reduced relations in the final phase. We introduce an efficient algorithm for join processing in distributed database systems that makes use of bipartite graphs in order to reduce data communication costs and local processing costs. The bipartite graphs represent the tuples that can be joined in two relations taking also into account the reduction state of the relations. This algorithm fully reduces the relations at each site. We then present an adaptive algorithm for response time optimization that takes into account the system configuration, i.e., the additional resources available and the data characteristics, in order to select the best strategy for response time minimization. We also report on the results of a set of experiments which show that our algorithms outperform a number of the recently proposed methods for total processing time and response time minimization.
引用
收藏
页码:233 / 269
页数:37
相关论文
共 50 条
  • [21] Distributed Join Algorithms on Thousands of Cores
    Barthels, Claude
    Muller, Ingo
    Schneider, Timo
    Alonso, Gustavo
    Hoefler, Torsten
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2017, 10 (05): : 517 - 528
  • [22] Adaptive Query Processing in Cloud Database Systems
    Costa, Clayton Maciel
    Sousa, Antonio Luis
    2013 IEEE THIRD INTERNATIONAL CONFERENCE ON CLOUD AND GREEN COMPUTING (CGC 2013), 2013, : 201 - +
  • [24] JOIN AND SEMIJOIN ALGORITHMS FOR A MULTIPROCESSOR DATABASE MACHINE
    VALDURIEZ, P
    GARDARIN, G
    ACM TRANSACTIONS ON DATABASE SYSTEMS, 1984, 9 (01): : 133 - 161
  • [25] Practical algorithms for tracking database join sizes
    Ganguly, S
    Kesh, D
    Saba, C
    FSTTCS 2005: FOUNDATIONS OF SOFTWARE TECHNOLOGY AND THEORETICAL COMPUTER SCIENCE, PROCEEDINGS, 2005, 3821 : 297 - 309
  • [26] An Efficient Nested Query Processing for Distributed Database Systems
    Kang, Yu-Jin
    Choi, Chi-Hawn
    Yang, Kyung-En
    Kim, Hun-Gi
    Choi, Wan-Sup
    CONVERGENCE AND HYBRID INFORMATION TECHNOLOGY, 2011, 206 : 669 - +
  • [27] Optimizing Distributed Join for Array Database System
    Li, Jing
    Li, Hui
    Chen, Mei
    Zhu, Ming
    2016 8TH INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY IN MEDICINE AND EDUCATION (ITME), 2016, : 640 - 644
  • [28] Scalable Distributed Stream Join Processing
    Lin, Qian
    Ooi, Beng Chin
    Wang, Zhengkui
    Yu, Cui
    SIGMOD'15: PROCEEDINGS OF THE 2015 ACM SIGMOD INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2015, : 811 - 825
  • [29] Fast Distributed Complex Join Processing
    Zhang, Hao
    Qiao, Miao
    Yu, Jeffrey Xu
    Cheng, Hong
    2021 IEEE 37TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2021), 2021, : 2087 - 2092
  • [30] Evolutionary Algorithms for Query Optimization in Distributed Database Systems: A review
    Ali, Zulfiqar
    Kiran, Hafiza Maria
    Shahzad, Waseem
    ADCAIJ-ADVANCES IN DISTRIBUTED COMPUTING AND ARTIFICIAL INTELLIGENCE JOURNAL, 2018, 7 (03): : 115 - 127