Minimizing Communication Cost in Distributed Multi-query Processing

被引:0
|
作者
Li, Jian [1 ]
Deshpande, Amol [1 ]
Khuller, Samir [1 ]
机构
[1] Univ Maryland, Dept Comp Sci, College Pk, MD 20742 USA
关键词
D O I
暂无
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Increasing prevalence of large-scale distributed monitoring and computing environments such as sensor networks, scientific federations, Grids etc., has led to a renewed interest in the area of distributed query processing and optimization. In this paper we address a general, distributed multi-query processing problem motivated by the need to minimize the communication cost in these environments. Specifically we address the problem of optimally sharing data movement across the communication edges in a distributed communication network given a set of overlapping queries and query plans for them (specifying the operations to be executed). Most of the problem variations of our general problem can be shown to be NP-Hard by a reduction from the Steiner tree problem. However, we show that the problem can be solved optimally if the communication network is a tree, and present a novel algorithm for finding an optimal data movement plan. For general communication networks, we present efficient approximation algorithms for several variations of the problem. Finally, we present an experimental study over synthetic datasets showing both the need for exploiting the sharing of data movement and the effectiveness of our algorithms at finding such plans.
引用
收藏
页码:772 / 783
页数:12
相关论文
共 50 条
  • [11] Multi-Query Optimization for Complex Event Processing in SAP ESP
    Zhang, Shuhao
    Hoang Tam Vo
    Dahlmeier, Daniel
    He, Bingsheng
    2017 IEEE 33RD INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE 2017), 2017, : 1213 - 1224
  • [12] A multi-query optimizer for Monet
    Manegold, S
    Pellenkoft, A
    Kersten, M
    ADVANCES IN DATABASES, 2000, 1832 : 36 - 50
  • [13] Evaluating Multi-Query Sessions
    Kanoulas, Evangelos
    Carterette, Ben
    Clough, Paul D.
    Sanderson, Mark
    PROCEEDINGS OF THE 34TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR'11), 2011, : 1053 - 1062
  • [14] Pipelining in multi-query optimization
    Dalvi, NN
    Sanghai, SK
    Roy, P
    Sudarshan, S
    JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2003, 66 (04) : 728 - 762
  • [15] SPARQL Multi-Query Optimization
    Chen, Jiaqi
    Zhang, Fan
    Zou, Lei
    2018 17TH IEEE INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (IEEE TRUSTCOM) / 12TH IEEE INTERNATIONAL CONFERENCE ON BIG DATA SCIENCE AND ENGINEERING (IEEE BIGDATASE), 2018, : 1419 - 1425
  • [16] Multi-query Video Retrieval
    Wang, Zeyu
    Wu, Yu
    Narasimhan, Karthik
    Russakovsky, Olga
    COMPUTER VISION - ECCV 2022, PT XIV, 2022, 13674 : 233 - 249
  • [17] Sketch-based multi-query processing over data streams
    Dobra, A
    Garofalakis, M
    Gehrke, J
    Rastogi, R
    ADVANCES IN DATABASE TECHNOLOGY - EDBT 2004, PROCEEDINGS, 2004, 2992 : 551 - 568
  • [18] Fine-Grained Multi-Query Stream Processing on Integrated Architectures
    Zhang, Feng
    Zhang, Chenyang
    Yang, Lin
    Zhang, Shuhao
    He, Bingsheng
    Lu, Wei
    Du, Xiaoyong
    IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2021, 32 (09) : 2303 - 2320
  • [19] On Multi-Query Local Community Detection
    Bian, Yuchen
    Yan, Yaowei
    Cheng, Wei
    Wang, Wei
    Luo, Dongsheng
    Zhang, Xiang
    2018 IEEE INTERNATIONAL CONFERENCE ON DATA MINING (ICDM), 2018, : 9 - 18
  • [20] Multi-query SQL progress indicators
    Luo, Gang
    Naughton, Jeffrey F.
    Yu, Philip S.
    ADVANCES IN DATABASE TECHNOLOGY - EDBT 2006, 2006, 3896 : 921 - 941