Optimised phrase querying and browsing of large text databases

被引:2
|
作者
Bahle, D [1 ]
Williams, HE [1 ]
Zobel, J [1 ]
机构
[1] RMIT Univ, Dept Comp Sci, Melbourne, Vic 3001, Australia
关键词
D O I
10.1109/ACSC.2001.906618
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Most search systems for querying large document collections-for example, web search engines-are based on well-understood information retrieval principles. These systems are both efficient and effective in finding answers to many user information needs, expressed through informal ranked or structured Boolean queries. Phrase querying and browsing are additional techniques that can augment or replace conventional querying tools. In this paper we propose optimisations for phrase querying with a nextword index, an efficient structure for phrase-based searching. We show that careful consideration of which search terms are evaluated in a query plan and optimisation of the order of evaluation of the plan can reduce query evaluation costs by more than a factor of five. We conclude that, for phrase querying and browsing with nextword indexes, an ordered query plan should be used for all browsing and querying. Moreover, we show that optimised phrase querying is practical on large text collections.
引用
收藏
页码:11 / 19
页数:9
相关论文
共 50 条
  • [1] Integrated browsing and querying for image databases
    Santini, S
    Jain, R
    IEEE MULTIMEDIA, 2000, 7 (03) : 26 - 39
  • [2] Image and video databases: Visual browsing, querying and retrieval
    DelBimbo, A
    JOURNAL OF VISUAL LANGUAGES AND COMPUTING, 1996, 7 (04): : 353 - 359
  • [3] Faceted browsing over large databases of text-annotated objects
    Dakka, Wisam
    Ipeirotis, Panagiotis G.
    Wood, Kenneth R.
    2007 IEEE 23RD INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2007, : 1464 - +
  • [4] Querying Large Graph Databases
    Ke, Yiping
    Cheng, James
    Yu, Jeffrey Xu
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, PT II, PROCEEDINGS, 2010, 5982 : 487 - +
  • [5] Querying text databases for efficient information extraction
    Agichtein, E
    Gravano, L
    19TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2003, : 113 - 124
  • [6] Interactive Browsing of Large Image Databases
    Schaefer, Gerald
    2016 SIXTH INTERNATIONAL CONFERENCE ON DIGITAL INFORMATION PROCESSING AND COMMUNICATIONS (ICDIPC), 2016, : 168 - 170
  • [7] Visual Browsing of Large Image Databases
    Schaefer, Gerald
    ADVANCED MACHINE LEARNING TECHNOLOGIES AND APPLICATIONS, AMLTA 2014, 2014, 488 : 531 - 539
  • [8] Visualizing and browsing large image databases
    Frigui, H
    IKE '04: PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE ENGNINEERING, 2004, : 68 - 74
  • [9] Hierarchical browsing and search of large image databases
    Chen, JY
    Bouman, CA
    Dalton, JC
    IEEE TRANSACTIONS ON IMAGE PROCESSING, 2000, 9 (03) : 442 - 455
  • [10] Efficient phrase querying with common phrase index
    Chang, Matthew
    Poon, Chung Keung
    ADVANCES IN INFORMATION RETRIEVAL, 2006, 3936 : 61 - 71