Optimizing top-k selection queries over multimedia repositories

被引:36
|
作者
Chaudhuri, S [1 ]
Gravano, L
Marian, A
机构
[1] Microsoft Corp, Res, 1 Microsoft Way, Redmond, WA 98052 USA
[2] Columbia Univ, Dept Comp Sci, New York, NY 10027 USA
关键词
top-k query processing; multimedia databases; information search; information retrieval;
D O I
10.1109/TKDE.2004.30
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Repositories of multimedia objects having multiple types of attributes ( e. g., image, text) are becoming increasingly common. A query on these attributes will typically request not just a set of objects, as in the traditional relational query model ( filtering), but also a grade of match associated with each object, which indicates how well the object matches the selection condition ( ranking). Furthermore, unlike in the relational model, users may just want the k top-ranked objects for their selection queries for a relatively small k. In addition to the differences in the query model, another peculiarity of multimedia repositories is that they may allow access to the attributes of each object only through indexes. In this paper, we investigate how to optimize the processing of top-k selection queries over multimedia repositories. The access characteristics of the repositories and the above query model lead to novel issues in query optimization. In particular, the choice of the indexes used to search the repository strongly influences the cost of processing the filtering condition. We define an execution space that is search-minimal, i.e., the set of indexes searched is minimal. Although the general problem of picking an optimal plan in the search-minimal execution space is NP-hard, we present an efficient algorithm that solves the problem optimally with respect to our cost model and execution space when the predicates in the query are independent. We also show that the problem of optimizing top-k selection queries can be viewed, in many cases, as that of evaluating more traditional selection conditions. Thus, both problems can be viewed together as an extended filtering problem to which techniques of query processing and optimization may be adapted.
引用
收藏
页码:992 / 1009
页数:18
相关论文
共 50 条
  • [41] Approximate distributed top-k queries
    Patt-Shamir, Boaz
    Shafrir, Allon
    DISTRIBUTED COMPUTING, 2008, 21 (01) : 1 - 22
  • [42] Answering Top-k Queries Over a Mixture of Attractive and Repulsive Dimensions
    Ranu, Sayan
    Singh, Ambuj K.
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2011, 5 (03): : 169 - 180
  • [43] Evaluating Top-k queries over web-accessible Databases
    Bruno, N
    Gravano, L
    Marian, A
    18TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2002, : 369 - +
  • [44] SAP: Improving Continuous Top-K Queries Over Streaming Data
    Zhu, Rui
    Wang, Bin
    Yang, Xiaochun
    Zheng, Baihua
    Wang, Guoren
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2017, 29 (06) : 1310 - 1328
  • [45] A Cost-Based Range Estimation for Mapping Top-k Selection Queries over Relational Databases
    Ayanso, Anteneh
    Goes, Paulo B.
    Mehta, Kumar
    JOURNAL OF DATABASE MANAGEMENT, 2009, 20 (04) : 1 - 25
  • [46] FedTopK: Top-K Queries Optimization over Federated RDF Systems
    Ge, Ningchao
    Qin, Zheng
    Peng, Peng
    Zou, Lei
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2021), PT III, 2021, 12683 : 595 - 599
  • [47] Distributed probabilistic top-k dominating queries over uncertain databases
    Niranjan Rai
    Xiang Lian
    Knowledge and Information Systems, 2023, 65 : 4939 - 4965
  • [48] Distributed probabilistic top-k dominating queries over uncertain databases
    Rai, Niranjan
    Lian, Xiang
    KNOWLEDGE AND INFORMATION SYSTEMS, 2023, 65 (11) : 4939 - 4965
  • [49] Answering Top-k Queries over Outsourced Sensitive Data in the Cloud
    Mahboubi, Sakina
    Akbarinia, Reza
    Valduriez, Patrick
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, DEXA 2018, PT I, 2018, 11029 : 218 - 231
  • [50] Top-k Closest Pair Queries over Spatial Knowledge Graph
    Wu, Fangwei
    Xie, Xike
    Shi, Jieming
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS (DASFAA 2021), PT I, 2021, 12681 : 625 - 640