DIASPORA: A highly distributed web-query processing system

被引:0
|
作者
Ramanath M. [1 ]
Haritsa J.R. [1 ,2 ]
机构
[1] Supercomputer Education and Research Centre, Indian Institute of Science, Bangalore
[2] Database Systems Research Department, Lucent Bell Labs, 600 Mountain Avenue, Murray Hill, 07974, NJ
关键词
Query Language; Query Processing; Query Processor; Result Graph; Semistructured Data;
D O I
10.1023/A:1019233713818
中图分类号
学科分类号
摘要
Current proposals for web querying systems have assumed a centralized processing architecture wherein data is shipped from the remote sites to the user's site. We present here the design and implementation of DIASPORA, a highly distributed query processing system for the web. It is based on the premise that several web applications are more naturally processed in a distributed manner, opening up possibilities of significant reductions in network traffic and user response times. DIASPORA is built over an expressive graph-based data model that utilizes simple heuristics and lends itself to automatic generation. The model captures both the content of web documents and the hyperlink structural framework of a web site. Distributed queries on the model are expressed through a declarative language that permits users to explicitly specify navigation. DIASPORA implements a query-shipping model wherein queries are autonomously forwarded from one web-site to another, without requiring much coordination from the query originating site. Its design addresses a variety of interesting issues that arise in the distributed web context including determining query completion, handling query rewriting, supporting query termination and preventing multiple computations of a query at a site due to the same query arriving through different paths in the hyperlink framework. The DIASPORA system is currently operational and is undergoing testing on our campus network. In this paper we describe the design of the system and report initial performance results that indicate significant performance improvements over comparable centralized approaches. © 2000, Kluwer Academic Publishers.
引用
收藏
页码:111 / 124
页数:13
相关论文
共 50 条
  • [1] Improving web-query processing through semantic knowledge
    Conesa, Jordi
    Storey, Veda C.
    Sugumaran, Vijayan
    DATA & KNOWLEDGE ENGINEERING, 2008, 66 (01) : 18 - 34
  • [2] Query enrichment for Web-query classification
    Shen, Dou
    Pan, Rong
    Sun, Jian-Tao
    Pan, Jeffrey Junfeng
    Wu, Kangheng
    Yin, Jie
    Yang, Qiang
    ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2006, 24 (03) : 320 - 352
  • [3] Query Processing in Highly Distributed Environments
    Kawaguchi, Akira
    Ha, Nguyen Viet
    Tsuru, Masato
    Mowshowitz, Abbe
    Shibata, Masahiro
    ADVANCES IN INTELLIGENT NETWORKING AND COLLABORATIVE SYSTEMS (INCOS-2021), 2022, 312 : 283 - 294
  • [4] Web-based distributed XML query processing
    Smiljanic, M
    Feng, L
    Jonker, W
    INTELLIGENT SEARCH ON XML DATA: APPLICATIONS, LANGUAGES, MODELS IMPLEMENTATIONS AND BENCHMARKS, 2003, 2818 : 207 - 216
  • [5] Distributed System for Query Processing with Grid Authentication
    Atanassov, E. I.
    Georgiev, D.
    Gurov, T.
    Karaivanova, A.
    Nikolova, Y.
    LARGE-SCALE SCIENTIFIC COMPUTING, LSSC 2013, 2014, 8353 : 467 - 475
  • [6] A Study of the Impact of Index Updates on Distributed Query Processing for Web Search
    Sarigiannis, Charalampos
    Plachouras, Vassilis
    Baeza-Yates, Ricardo
    ADVANCES IN INFORMATION RETRIEVAL, PROCEEDINGS, 2009, 5478 : 595 - +
  • [7] FACT: A learning based web query processing system
    Chen, ST
    Diao, YL
    Lu, HJ
    Tian, ZP
    SIGMOD RECORD, 2000, 29 (02) : 587 - 587
  • [8] Algorithms for query processing in a distributed knowledge integration system
    Goczyla, Krzysztof
    Zawadzka, Teresa
    PROCEEDINGS OF THE 2008 1ST INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY, 2008, : 27 - 30
  • [9] DISTRIBUTED QUERY-PROCESSING IN A MULTIPLE DATABASE SYSTEM
    CHEN, ALP
    BRILL, D
    TEMPLETON, M
    YU, CT
    IEEE JOURNAL ON SELECTED AREAS IN COMMUNICATIONS, 1989, 7 (03) : 390 - 398
  • [10] A query processing algorithm for a system of heterogeneous distributed databases
    Egyhazy, CJ
    Triantis, KP
    Bhasker, B
    DISTRIBUTED AND PARALLEL DATABASES, 1996, 4 (01) : 49 - 79