From Theory to Practice: Efficient Join Query Evaluation in a Parallel Database System

被引:58
|
作者
Chu, Shumo [1 ]
Balazinska, Magdalena [1 ]
Suciu, Dan [1 ]
机构
[1] Univ Washington, Comp Sci & Engn, Seattle, WA 98195 USA
关键词
D O I
10.1145/2723372.2750545
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Big data analytics often requires processing complex queries using massive parallelism, where the main performance metrics is the communication cost incurred during data reshuffling. In this paper, we describe a system that can compute efficiently complex join queries, including queries with cyclic joins, on a massively parallel architecture. We build on two independent lines of work for multi-join query evaluation: a communication-optimal algorithm for distributed evaluation, and a worst-case optimal algorithm for sequential evaluation. We evaluate these algorithms together, then describe novel, practical optimizations for both algorithms.
引用
收藏
页码:63 / 78
页数:16
相关论文
共 50 条
  • [21] Performance evaluation of parallel query processing techniques in object-oriented database
    Wang, YJ
    Wang, YJ
    Hu, SR
    CHINESE JOURNAL OF ELECTRONICS, 2000, 9 (02): : 224 - 228
  • [22] Parallel processing of Multi-Join Expansion_Aggregate data cube query in high performance database systems
    Taniar, D
    Tan, RBN
    I-SPAN'02: INTERNATIONAL SYMPOSIUM ON PARALLEL ARCHITECTURES, ALGORITHMS AND NETWORKS, PROCEEDINGS, 2002, : 51 - 56
  • [23] A parallel query processing system based on graph-based database partitioning
    Nam, Yoon-Min
    Han, Donghyoung
    Kim, Min-Soo
    INFORMATION SCIENCES, 2019, 480 : 237 - 260
  • [24] Join query optimization in the distributed database system using an artificial bee colony algorithm and genetic operators
    Panahi, Vahideh
    Navimipour, Nima Jafari
    CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2019, 31 (17):
  • [25] Energy-Efficient Query Management Scheme for a Wireless Sensor Database System
    Guofang Nan
    Minqiang Li
    EURASIP Journal on Wireless Communications and Networking, 2010
  • [26] Energy-Efficient Query Management Scheme for a Wireless Sensor Database System
    Nan, Guofang
    Li, Minqiang
    EURASIP JOURNAL ON WIRELESS COMMUNICATIONS AND NETWORKING, 2010,
  • [27] Scatter-Gather-Merge: An efficient star-join query processing algorithm for data-parallel frameworks
    Han, Hyuck
    Jung, Hyungsoo
    Eom, Hyeonsang
    Yeom, Heon Y.
    CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2011, 14 (02): : 183 - 197
  • [28] Scatter-Gather-Merge: An efficient star-join query processing algorithm for data-parallel frameworks
    Hyuck Han
    Hyungsoo Jung
    Hyeonsang Eom
    Heon Y. Yeom
    Cluster Computing, 2011, 14 : 183 - 197
  • [29] AN AUTOMATED ATHLETE PERFORMANCE EVALUATION SYSTEM From Theory to Practice
    Silva, Hugo
    Martins, Goncalo
    Palma, Susana
    Mil-Homens, Pedro
    Valamatos, Maria
    BIODEVICES 2009: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON BIOMEDICAL ELECTRONICS AND DEVICES, 2009, : 239 - +
  • [30] Design and performance evaluation of the system architecture in a parallel database system: SPAX
    Kim, YK
    Park, YM
    Jin, SI
    Park, J
    INTERNATIONAL SOCIETY FOR COMPUTERS AND THEIR APPLICATIONS 13TH INTERNATIONAL CONFERENCE ON COMPUTERS AND THEIR APPLICATIONS, 1998, : 198 - 201