Optimizing top-k selection queries over multimedia repositories

被引:36
|
作者
Chaudhuri, S [1 ]
Gravano, L
Marian, A
机构
[1] Microsoft Corp, Res, 1 Microsoft Way, Redmond, WA 98052 USA
[2] Columbia Univ, Dept Comp Sci, New York, NY 10027 USA
关键词
top-k query processing; multimedia databases; information search; information retrieval;
D O I
10.1109/TKDE.2004.30
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Repositories of multimedia objects having multiple types of attributes ( e. g., image, text) are becoming increasingly common. A query on these attributes will typically request not just a set of objects, as in the traditional relational query model ( filtering), but also a grade of match associated with each object, which indicates how well the object matches the selection condition ( ranking). Furthermore, unlike in the relational model, users may just want the k top-ranked objects for their selection queries for a relatively small k. In addition to the differences in the query model, another peculiarity of multimedia repositories is that they may allow access to the attributes of each object only through indexes. In this paper, we investigate how to optimize the processing of top-k selection queries over multimedia repositories. The access characteristics of the repositories and the above query model lead to novel issues in query optimization. In particular, the choice of the indexes used to search the repository strongly influences the cost of processing the filtering condition. We define an execution space that is search-minimal, i.e., the set of indexes searched is minimal. Although the general problem of picking an optimal plan in the search-minimal execution space is NP-hard, we present an efficient algorithm that solves the problem optimally with respect to our cost model and execution space when the predicates in the query are independent. We also show that the problem of optimizing top-k selection queries can be viewed, in many cases, as that of evaluating more traditional selection conditions. Thus, both problems can be viewed together as an extended filtering problem to which techniques of query processing and optimization may be adapted.
引用
收藏
页码:992 / 1009
页数:18
相关论文
共 50 条
  • [21] SRJA:A Research on Optimizing Top-k Join Queries Based on Spark
    Ren, Hui
    Fu, Haidong
    Xu, Fangfang
    Gu, Jinguang
    Zhao, Di
    PROCEEDINGS OF THE 2017 12TH IEEE CONFERENCE ON INDUSTRIAL ELECTRONICS AND APPLICATIONS (ICIEA), 2017, : 1000 - 1005
  • [22] Evaluating continuous top-k queries over document streams
    Weixiong Rao
    Lei Chen
    Shudong Chen
    Sasu Tarkoma
    World Wide Web, 2014, 17 : 59 - 83
  • [23] Evaluating Top-k Skyline queries over relational databases
    Brando, Carmen
    Goncalves, Marlene
    Gonzalez, Vanessa
    DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2007, 4653 : 254 - +
  • [24] Supporting Various Top-k Queries over Uncertain Datasets
    LI Wenfeng
    FU Zufa
    WANG Liwei
    LI Deyi
    PENG Zhiyong
    Wuhan University Journal of Natural Sciences, 2014, 19 (01) : 84 - 92
  • [25] Evaluating continuous top-k queries over document streams
    Rao, Weixiong
    Chen, Lei
    Chen, Shudong
    Tarkoma, Sasu
    WORLD WIDE WEB-INTERNET AND WEB INFORMATION SYSTEMS, 2014, 17 (01): : 59 - 83
  • [26] Upsortable: Programming Top-K Queries Over Data Streams
    Subercaze, Julien
    Gravier, Christophe
    Gillani, Syed
    Kammoun, Abderrahmen
    Laforest, Frederique
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2017, 10 (12): : 1873 - 1876
  • [27] Top-K data source selection for keyword queries over multiple XML data sources
    Khanh Nguyen
    Cao, Jinli
    JOURNAL OF INFORMATION SCIENCE, 2012, 38 (02) : 156 - 175
  • [28] Processing Top-k Join Queries
    Wu, Minji
    Berti-Equille, Laure
    Marian, Amelie
    Procopiuc, Cecilia M.
    Srivastava, Divesh
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2010, 3 (01): : 860 - 870
  • [29] Approximate distributed top-k queries
    Boaz Patt-Shamir
    Allon Shafrir
    Distributed Computing, 2008, 21 : 1 - 22
  • [30] Top-k Combinatorial Skyline Queries
    Su, I-Fang
    Chung, Yu-Chi
    Lee, Chiang
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PT II, PROCEEDINGS, 2010, 5982 : 79 - +