Distributed query processing using partitioned inverted files

被引:33
|
作者
Badue, C [1 ]
Ribeiro-Neto, B [1 ]
Baeza-Yates, R [1 ]
Ziviani, N [1 ]
机构
[1] Univ Fed Minas Gerais, Dept Comp Sci, Belo Horizonte, MG, Brazil
关键词
D O I
10.1109/SPIRE.2001.989733
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we study query processing in a distributed text database. The novelty is a real distributed architecture implementation that offers concurrent query service. The distributed system adopts a network of workstations model and the client-server paradigm. The document collection is indexed with an inverted file. We adopt two distinct strategies of index partitioning in the distributed system, namely local index partitioning arid global index partitioning. In both strategies, documents are ranked using the vector space model along with a document filtering technique for fast ranking. We evaluate and compare the impact of the two index partitioning strategies on query processing performance. Experimental results on retrieval efficiency show that, within our framework, the global index partitioning outperforms the local index partitioning.
引用
收藏
页码:10 / 20
页数:11
相关论文
共 50 条
  • [1] Query processing for selection and projection using inverted partitioned indexes
    Wah, TY
    Meng, YK
    IEEE 2000 TENCON PROCEEDINGS, VOLS I-III: INTELLIGENT SYSTEMS AND TECHNOLOGIES FOR THE NEW MILLENNIUM, 2000, : 406 - 409
  • [2] Parallel search using partitioned inverted files
    MacFarlane, A
    McCann, JA
    Robertson, SE
    SPIRE 2000: SEVENTH INTERNATIONAL SYMPOSIUM ON STRING PROCESSING AND INFORMATION RETRIEVAL - PROCEEDINGS, 2000, : 209 - 220
  • [3] Load balancing distributed inverted files: Query ranking
    Gomez-Pantoja, Carlos
    Marin, Mauricio
    PROCEEDINGS OF THE 16TH EUROMICRO CONFERENCE ON PARALLEL, DISTRIBUTED AND NETWORK-BASED PROCESSING, 2008, : 329 - 333
  • [4] Adapting partitioned continuous query processing in distributed systems
    Zhu, Yali
    Rundensteiner, Elke A.
    2007 IEEE 23RD INTERNATIONAL CONFERENCE ON DATA ENGINEERING WORKSHOP, VOLS 1-2, 2007, : 594 - 603
  • [5] Parallel methods for the update of partitioned inverted files
    MacFarlane, A.
    McCann, J. A.
    Robertson, S. E.
    ASLIB PROCEEDINGS, 2007, 59 (4-5): : 367 - 396
  • [6] Parallel methods for the generation of partitioned inverted files
    MacFarlane, A
    McCann, JA
    Robertson, SE
    ASLIB PROCEEDINGS, 2005, 57 (05): : 434 - 459
  • [7] Scheduling intersection queries in term partitioned inverted files
    Marin, Mauricio
    Gomez-Pantoja, Carlos
    Gonzalez, Senen
    Gil-Costa, Veronica
    EURO-PAR 2008 PARALLEL PROCESSING, PROCEEDINGS, 2008, 5168 : 434 - 443
  • [8] Inverted file partitioning for distributed query processing in information retrieval systems
    Srisawat, J
    Alexandridis, N
    OConnell, M
    PARALLEL AND DISTRIBUTED COMPUTING SYSTEMS - PROCEEDINGS OF THE ISCA 9TH INTERNATIONAL CONFERENCE, VOLS I AND II, 1996, : 738 - 743
  • [9] Query Processing of Pre-partitioned Data Using Sandwich Operators
    Baumann, Stephan
    Boncz, Peter
    Sattler, Kai-Uwe
    ENABLING REAL-TIME BUSINESS INTELLIGENCE, VLDB 2012, 2013, 154 : 76 - 92
  • [10] Distributed query processing using suffix arrays
    Marín, M
    Navarro, G
    STRING PROCESSING AND INFORMATION RETRIEVAL, PROCEEDINGS, 2003, 2857 : 311 - 325