Minimizing Communication Cost in Distributed Multi-query Processing

被引:0
|
作者
Li, Jian [1 ]
Deshpande, Amol [1 ]
Khuller, Samir [1 ]
机构
[1] Univ Maryland, Dept Comp Sci, College Pk, MD 20742 USA
关键词
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Increasing prevalence of large-scale distributed monitoring and computing environments such as sensor networks, scientific federations, Grids etc., has led to a renewed interest in the area of distributed query processing and optimization. In this paper we address a general, distributed multi-query processing problem motivated by the need to minimize the communication cost in these environments. Specifically we address the problem of optimally sharing data movement across the communication edges in a distributed communication network given a set of overlapping queries and query plans for them (specifying the operations to be executed). Most of the problem variations of our general problem can be shown to be NP-Hard by a reduction from the Steiner tree problem. However, we show that the problem can be solved optimally if the communication network is a tree, and present a novel algorithm for finding an optimal data movement plan. For general communication networks, we present efficient approximation algorithms for several variations of the problem. Finally, we present an experimental study over synthetic datasets showing both the need for exploiting the sharing of data movement and the effectiveness of our algorithms at finding such plans.
引用
收藏
页码:772 / 783
页数:12
相关论文
共 50 条
  • [41] MUSE: Multi-query Event Trend Aggregation
    Rozet, Allison
    Poppe, Olga
    Lei, Chuan
    Rundensteiner, Elke A.
    CIKM '20: PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, 2020, : 2193 - 2196
  • [42] Directions in multi-query optimization for sensor networks
    Demers, A
    Gehrke, J
    Rajaraman, R
    Trigoni, N
    Yao, Y
    ADVANCES IN PERVASIVE COMPUTING AND NETWORKING, 2005, : 179 - 196
  • [43] Multi-Query Unification for Generating Efficient Big Data Processing Components from a DFD
    Kimura, Kosaku
    Nomura, Yoshihide
    Kurihara, Hidetoshi
    Yamamoto, Koji
    Yamamoto, Rieko
    2013 IEEE SIXTH INTERNATIONAL CONFERENCE ON CLOUD COMPUTING (CLOUD 2013), 2013, : 260 - 268
  • [44] Multi-query Optimization in Federated RDF Systems
    Peng, Peng
    Zou, Lei
    Ozsu, M. Tamer
    Zhao, Dongyan
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2018, PT I, 2018, 10827 : 745 - 765
  • [45] A Vision for SPARQL Multi-Query Optimization on MapReduce
    Anyanwu, Kemafor
    2013 IEEE 29TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOPS (ICDEW), 2013, : 25 - 26
  • [46] Multi-Query Optimization via Common Sub Query Elimination for SPARQL
    Zhou, Xiaoyi
    Luo, Jie
    He, Tao
    2017 10TH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID), VOL 2, 2017, : 213 - 218
  • [47] An efficient co-operative framework for multi-query processing over compressed XML data
    He, Juzhen
    Ng, Wilfred
    Wang, Xiaoling
    Zhou, Aoying
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PROCEEDINGS, 2006, 3882 : 218 - 232
  • [48] Hierarchical matching and reasoning for multi-query image retrieval
    Ji, Zhong
    Li, Zhihao
    Zhang, Yan
    Wang, Haoran
    Pang, Yanwei
    Li, Xuelong
    NEURAL NETWORKS, 2024, 173
  • [49] Scalable Multi-Query Execution using Reinforcement Learning
    Sioulas, Panagiotis
    Ailamaki, Anastasia
    SIGMOD '21: PROCEEDINGS OF THE 2021 INTERNATIONAL CONFERENCE ON MANAGEMENT OF DATA, 2021, : 1651 - 1663
  • [50] Multi-query optimization for sketch-based estimation
    Dobra, Alin
    Garofalakis, Minos
    Gehrke, Johannes
    Rastogi, Rajeev
    INFORMATION SYSTEMS, 2009, 34 (02) : 209 - 230