Processing Top-k Join Queries

被引:14
|
作者
Wu, Minji [1 ]
Berti-Equille, Laure [2 ]
Marian, Amelie [1 ]
Procopiuc, Cecilia M. [3 ]
Srivastava, Divesh [3 ]
机构
[1] Rutgers State Univ, New Brunswick, NJ USA
[2] Univ Rennes 1, Rennes, France
[3] AT&T Labs Res, Florham Pk, NJ USA
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2010年 / 3卷 / 01期
关键词
D O I
10.14778/1920841.1920951
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We consider the problem of efficiently finding the top-k answers for join queries over web-accessible databases. Classical algorithms for finding top-k answers use branch-and-bound techniques to avoid computing scores of all candidates in identifying the top-k answers. To be able to apply such techniques, it is critical to efficiently compute (lower and upper) bounds and expected scores of candidate answers in an incremental fashion during the evaluation. In this paper, we describe novel techniques for these problems. The first contribution of this paper is a method to efficiently compute bounds for the score of a query result when tuples in tables from the "FROM" clause are discovered incrementally, through either sorted or random access. Our second contribution is an algorithm that, given a set of partially evaluated candidate answers, determines a good order in which to access the tables to minimize wasted efforts in the computation of top-k answers. We evaluate our algorithms on a variety of queries and data sets and demonstrate the significant benefits they provide.
引用
收藏
页码:860 / 870
页数:11
相关论文
共 50 条
  • [31] Approximate distributed top-k queries
    Boaz Patt-Shamir
    Allon Shafrir
    Distributed Computing, 2008, 21 : 1 - 22
  • [32] Top-k Combinatorial Skyline Queries
    Su, I-Fang
    Chung, Yu-Chi
    Lee, Chiang
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PT II, PROCEEDINGS, 2010, 5982 : 79 - +
  • [33] Evaluating top-k selection queries
    Chaudhuri, S
    Gravano, L
    PROCEEDINGS OF THE TWENTY-FIFTH INTERNATIONAL CONFERENCE ON VERY LARGE DATA BASES, 1999, : 399 - 410
  • [34] A top-k spatial join querying processing algorithm based on spark
    Qiao, Baiyou
    Hu, Bing
    Zhu, Junhai
    Wu, Gang
    Giraud-Carrier, Christophe
    Wang, Guoren
    INFORMATION SYSTEMS, 2020, 87
  • [35] Top-k spatial preference queries
    Yiu, Man Lung
    Dai, Xiangyuan
    Mamoulis, Nikos
    Vaitis, Michail
    2007 IEEE 23RD INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2007, : 1051 - +
  • [36] Optimizing Distributed Top-k Queries
    Neumann, Thomas
    Bender, Matthias
    Michel, Sebastian
    Schenkel, Ralf
    Triantafillou, Peter
    Weikum, Gerhard
    WEB INFORMATION SYSTEMS ENGINEERING - WISE 2008, PROCEEDINGS, 2008, 5175 : 337 - +
  • [37] Top-k queries on temporal data
    Li, Feifei
    Yi, Ke
    Le, Wangchao
    VLDB JOURNAL, 2010, 19 (05): : 715 - 733
  • [38] Top-k Dominating Queries: an introduction
    Manolopoulos, Yannis
    2015 12th IEEE International Conference on Programming and Systems (ISPS), 2015,
  • [39] Top-k queries on RDF graphs
    Wang, Dong
    Zou, Lei
    Zhao, Dongyan
    INFORMATION SCIENCES, 2015, 316 : 201 - 217
  • [40] Top-k queries on temporal data
    Feifei Li
    Ke Yi
    Wangchao Le
    The VLDB Journal, 2010, 19 : 715 - 733