Optimizing top-k selection queries over multimedia repositories

被引:36
|
作者
Chaudhuri, S [1 ]
Gravano, L
Marian, A
机构
[1] Microsoft Corp, Res, 1 Microsoft Way, Redmond, WA 98052 USA
[2] Columbia Univ, Dept Comp Sci, New York, NY 10027 USA
关键词
top-k query processing; multimedia databases; information search; information retrieval;
D O I
10.1109/TKDE.2004.30
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Repositories of multimedia objects having multiple types of attributes ( e. g., image, text) are becoming increasingly common. A query on these attributes will typically request not just a set of objects, as in the traditional relational query model ( filtering), but also a grade of match associated with each object, which indicates how well the object matches the selection condition ( ranking). Furthermore, unlike in the relational model, users may just want the k top-ranked objects for their selection queries for a relatively small k. In addition to the differences in the query model, another peculiarity of multimedia repositories is that they may allow access to the attributes of each object only through indexes. In this paper, we investigate how to optimize the processing of top-k selection queries over multimedia repositories. The access characteristics of the repositories and the above query model lead to novel issues in query optimization. In particular, the choice of the indexes used to search the repository strongly influences the cost of processing the filtering condition. We define an execution space that is search-minimal, i.e., the set of indexes searched is minimal. Although the general problem of picking an optimal plan in the search-minimal execution space is NP-hard, we present an efficient algorithm that solves the problem optimally with respect to our cost model and execution space when the predicates in the query are independent. We also show that the problem of optimizing top-k selection queries can be viewed, in many cases, as that of evaluating more traditional selection conditions. Thus, both problems can be viewed together as an extended filtering problem to which techniques of query processing and optimization may be adapted.
引用
收藏
页码:992 / 1009
页数:18
相关论文
共 50 条
  • [31] Top-k spatial preference queries
    Yiu, Man Lung
    Dai, Xiangyuan
    Mamoulis, Nikos
    Vaitis, Michail
    2007 IEEE 23RD INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2007, : 1051 - +
  • [32] Top-k queries on temporal data
    Li, Feifei
    Yi, Ke
    Le, Wangchao
    VLDB JOURNAL, 2010, 19 (05): : 715 - 733
  • [33] Top-k Dominating Queries: an introduction
    Manolopoulos, Yannis
    2015 12th IEEE International Conference on Programming and Systems (ISPS), 2015,
  • [34] Top-k queries on RDF graphs
    Wang, Dong
    Zou, Lei
    Zhao, Dongyan
    INFORMATION SCIENCES, 2015, 316 : 201 - 217
  • [35] Top-k queries on temporal data
    Feifei Li
    Ke Yi
    Wangchao Le
    The VLDB Journal, 2010, 19 : 715 - 733
  • [36] Top-k Sequenced Route Queries
    Ohsawa, Yutaka
    Htoo, Htoo
    2017 18TH IEEE INTERNATIONAL CONFERENCE ON MOBILE DATA MANAGEMENT (IEEE MDM 2017), 2017, : 320 - 323
  • [37] Continuous Top-k Dominating Queries
    Kontaki, Maria
    Papadopoulos, Apostolos N.
    Manolopoulos, Yannis
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2012, 24 (05) : 840 - 853
  • [38] Probabilistic Reverse Top-k Queries
    Jin, Cheqing
    Zhang, Rong
    Kang, Qiangqiang
    Zhang, Zhao
    Zhou, Aoying
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2014, PT I, 2014, 8421 : 406 - 419
  • [39] Top-k Dominating Queries: a Survey
    Tiakas, Eleftherios
    Papadopoulos, Apostolos N.
    Manolopoulos, Yannis
    2015 12TH IEEE INTERNATIONAL CONFERENCE ON PROGRAMMING AND SYSTEMS (ISPS), 2015,
  • [40] Geometric Approaches for Top-k Queries
    Mouratidis, Kyriakos
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2017, 10 (12): : 1985 - 1987