Optimised phrase querying and browsing of large text databases

被引:2
|
作者
Bahle, D [1 ]
Williams, HE [1 ]
Zobel, J [1 ]
机构
[1] RMIT Univ, Dept Comp Sci, Melbourne, Vic 3001, Australia
关键词
D O I
10.1109/ACSC.2001.906618
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
Most search systems for querying large document collections-for example, web search engines-are based on well-understood information retrieval principles. These systems are both efficient and effective in finding answers to many user information needs, expressed through informal ranked or structured Boolean queries. Phrase querying and browsing are additional techniques that can augment or replace conventional querying tools. In this paper we propose optimisations for phrase querying with a nextword index, an efficient structure for phrase-based searching. We show that careful consideration of which search terms are evaluated in a query plan and optimisation of the order of evaluation of the plan can reduce query evaluation costs by more than a factor of five. We conclude that, for phrase querying and browsing with nextword indexes, an ordered query plan should be used for all browsing and querying. Moreover, we show that optimised phrase querying is practical on large text collections.
引用
收藏
页码:11 / 19
页数:9
相关论文
共 50 条
  • [21] Querying Databases with Taxonomies
    Martinenghi, Davide
    Torlone, Riccardo
    CONCEPTUAL MODELING - ER 2010, 2010, 6412 : 377 - +
  • [22] On querying ontologies and databases
    Bulskov, H
    Knappe, R
    Andreasen, T
    FLEXIBLE QUERY ANSWERING SYSTEMS, PROCEEDINGS, 2004, 3055 : 191 - 202
  • [23] Querying multidimensional databases
    Cabibbo, L
    Torlone, R
    DATABASE PROGRAMMING LANGUAGES, 1998, 1369 : 319 - 335
  • [24] Fast phrase querying with combined indexes
    Williams, HE
    Zobel, J
    Bahle, D
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2004, 22 (04) : 573 - 594
  • [25] QUERYING OBJECT DATABASES
    LOOMIS, MES
    JOURNAL OF OBJECT-ORIENTED PROGRAMMING, 1994, 7 (03): : 56 - &
  • [26] Querying XML Databases
    de Sousa, AA
    Pereira, JL
    Carvalho, JA
    XXII INTERNATIONAL CONFERENCE OF THE CHILEAN COMPUTER SCIENCE SOCIETY, PROCEEDINGS, 2002, : 142 - 150
  • [27] QUERYING LOGICAL DATABASES
    VARDI, MY
    JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 1986, 33 (02) : 142 - 160
  • [28] Browsing and querying multimedia report collections
    Consorti, F
    Merialdo, P
    Sindoni, G
    MEDICAL INFORMATICS EUROPE '97: PARTS A & B, 1997, 43 : 401 - 405
  • [29] Querying inconsistent databases
    Greco, S
    Zumpano, E
    LOGIC FOR PROGRAMMING AND AUTOMATED REASONING, PROCEEDINGS, 2000, 1955 : 308 - 325
  • [30] QUERYING INDEPENDENT DATABASES
    BUNEMAN, OP
    DAVIDSON, SB
    WATTERS, A
    INFORMATION SCIENCES, 1990, 52 (01) : 1 - 34