Minimizing Communication Cost in Distributed Multi-query Processing

被引:0
|
作者
Li, Jian [1 ]
Deshpande, Amol [1 ]
Khuller, Samir [1 ]
机构
[1] Univ Maryland, Dept Comp Sci, College Pk, MD 20742 USA
关键词
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Increasing prevalence of large-scale distributed monitoring and computing environments such as sensor networks, scientific federations, Grids etc., has led to a renewed interest in the area of distributed query processing and optimization. In this paper we address a general, distributed multi-query processing problem motivated by the need to minimize the communication cost in these environments. Specifically we address the problem of optimally sharing data movement across the communication edges in a distributed communication network given a set of overlapping queries and query plans for them (specifying the operations to be executed). Most of the problem variations of our general problem can be shown to be NP-Hard by a reduction from the Steiner tree problem. However, we show that the problem can be solved optimally if the communication network is a tree, and present a novel algorithm for finding an optimal data movement plan. For general communication networks, we present efficient approximation algorithms for several variations of the problem. Finally, we present an experimental study over synthetic datasets showing both the need for exploiting the sharing of data movement and the effectiveness of our algorithms at finding such plans.
引用
收藏
页码:772 / 783
页数:12
相关论文
共 50 条
  • [1] Multi-query Optimization for Distributed Similarity Query Processing
    Zhuang, Yi
    Li, Qing
    Chen, Lei
    28TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS, VOLS 1 AND 2, PROCEEDINGS, 2008, : 639 - +
  • [2] Leon: A Distributed RDF Engine for Multi-query Processing
    Guo, Xintong
    Gao, Hong
    Zou, Zhaonian
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2019), PT I, 2019, 11446 : 742 - 759
  • [3] A Distributed Engine for Multi-query Processing Based on Predicates with Spark
    Zhang, Bin
    Sun, Ximin
    Bi, Liwei
    Zhao, Changhao
    Chen, Xin
    Li, Xin
    Sun, Lei
    WEB AND BIG DATA, 2021, 1505 : 27 - 36
  • [4] Multi-Query Stream Processing on FPGAs
    Sadoghi, Mohammad
    Javed, Rija
    Tarafdar, Naif
    Singh, Harsh
    Palaniappan, Rohan
    Jacobsen, Hans-Arno
    2012 IEEE 28TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2012, : 1229 - 1232
  • [5] Multi-root, multi-query processing in sensor networks
    Zhang, Zhiguo
    Kshemkalyani, Ajay
    Shatz, Sol M.
    DISTRIBUTED COMPUTING IN SENSOR SYSTEMS, 2008, 5067 : 432 - 450
  • [6] Multi-query processing of XML data streams on multicore
    Soo-Hyung Kim
    Kyong-Ha Lee
    Yoon-Joon Lee
    The Journal of Supercomputing, 2017, 73 : 2339 - 2368
  • [7] Multi-query processing of XML data streams on multicore
    Kim, Soo-Hyung
    Lee, Kyong-Ha
    Lee, Yoon-Joon
    JOURNAL OF SUPERCOMPUTING, 2017, 73 (06): : 2339 - 2368
  • [8] Multi-query optimization for on-line analytical processing
    Kalnis, P
    Papadias, D
    INFORMATION SYSTEMS, 2003, 28 (05) : 457 - 473
  • [9] Minimizing the Make Span of Diagnostic Multi-Query Graphs Using Graph Pruning and Query Merging
    Tabassam, Nadra
    Obermaisser, Roman
    2018 IEEE 23RD INTERNATIONAL CONFERENCE ON EMERGING TECHNOLOGIES AND FACTORY AUTOMATION (ETFA), 2018, : 368 - 375
  • [10] Q-Graph: Preserving Query Locality in Multi-Query Graph Processing
    Mayer, Christian
    Mayer, Ruben
    Grunert, Jonas
    Rothermel, Kurt
    Tariq, Muhammad Adnan
    GRADES-NDA '18: PROCEEDINGS OF THE 1ST ACM SIGMOD JOINT INTERNATIONAL WORKSHOP ON GRAPH DATA MANAGEMENT EXPERIENCES & SYSTEMS (GRADES) AND NETWORK DATA ANALYTICS (NDA) 2018 (GRADES-NDA 2018), 2018,