From Theory to Practice: Efficient Join Query Evaluation in a Parallel Database System

被引:58
|
作者
Chu, Shumo [1 ]
Balazinska, Magdalena [1 ]
Suciu, Dan [1 ]
机构
[1] Univ Washington, Comp Sci & Engn, Seattle, WA 98195 USA
关键词
D O I
10.1145/2723372.2750545
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Big data analytics often requires processing complex queries using massive parallelism, where the main performance metrics is the communication cost incurred during data reshuffling. In this paper, we describe a system that can compute efficiently complex join queries, including queries with cyclic joins, on a massively parallel architecture. We build on two independent lines of work for multi-join query evaluation: a communication-optimal algorithm for distributed evaluation, and a worst-case optimal algorithm for sequential evaluation. We evaluate these algorithms together, then describe novel, practical optimizations for both algorithms.
引用
收藏
页码:63 / 78
页数:16
相关论文
共 50 条
  • [31] PARALLEL PERFORMANCE VISUALIZATION - FROM PRACTICE TO THEORY
    HEATH, MT
    MALONY, AD
    ROVER, DT
    IEEE PARALLEL & DISTRIBUTED TECHNOLOGY, 1995, 3 (04): : 44 - 60
  • [32] FROM EVALUATION THEORY INTO PRACTICE
    MERRIMAN, HO
    JOURNAL OF RESEARCH AND DEVELOPMENT IN EDUCATION, 1970, 3 (04): : 48 - 58
  • [33] Efficient Query Execution Techniques in a 4DIS Video Database System for eLearning
    Chi-wai Fung
    Elvis Wai-chung Leung
    Qing Li
    Multimedia Tools and Applications, 2003, 20 : 25 - 49
  • [34] Efficient query execution techniques in a 4DIS video database system for eLearning
    Fung, CW
    Leung, EWC
    Li, Q
    MULTIMEDIA TOOLS AND APPLICATIONS, 2003, 20 (01) : 25 - 49
  • [35] Three Approximation Algorithms for Energy-Efficient Query Dissemination in Sensor Database System
    Zhang, Zhao
    Gao, Xiaofeng
    Zhang, Xuefei
    Wu, Weili
    Xiong, Hui
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2009, 5690 : 807 - +
  • [36] Reproducible Evaluation of System Efficiency With a Model of Architecture: From Theory to Practice
    Pelcat, Maxime
    Mercat, Alexandre
    Desnos, Karol
    Maggiani, Luca
    Liu, Yanzhou
    Heulot, Julien
    Nezan, Jean-Francois
    Hamidouche, Wassim
    Menard, Daniel
    Bhattacharyya, Shuvra S.
    IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2018, 37 (10) : 2050 - 2063
  • [37] Extracting predicates from mining models for efficient query evaluation
    Chaudhuri, S
    Narasayya, V
    Sarawagi, S
    ACM TRANSACTIONS ON DATABASE SYSTEMS, 2004, 29 (03): : 508 - 544
  • [38] Query Optimization of Distributed Database Based on Parallel Genetic Algorithm and Max-Min Ant System
    Ban, Wenjiao
    Lin, Jiming
    Tong, Jichao
    Li, Shiwen
    2015 8TH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID), VOL 2, 2015, : 581 - 585
  • [39] From Database to Diagnosis: 'Intelligent Query', a Tool to Help With Safety Signal Evaluation
    Samal, V.
    Mahaux, O.
    DRUG SAFETY, 2018, 41 (11) : 1160 - 1160
  • [40] Grace: An Efficient Parallel SPARQL Query System over Large-Scale RDF Data
    Kang, Xiang
    Zhao, Yuying
    Yuan, Pingpeng
    Jin, Hai
    PROCEEDINGS OF THE 2021 IEEE 24TH INTERNATIONAL CONFERENCE ON COMPUTER SUPPORTED COOPERATIVE WORK IN DESIGN (CSCWD), 2021, : 769 - 774