Scaling XML query processing: distribution, localization and pruning

被引:0
|
作者
Patrick Kling
M. Tamer Özsu
Khuzaima Daudjee
机构
[1] University of Waterloo,Cheriton School of Computer Science
来源
关键词
Distributed; XML; Localization; Pruning;
D O I
暂无
中图分类号
学科分类号
摘要
Distributing data collections by fragmenting them is an effective way of improving the scalability of a database system. While the distribution of relational data is well understood, the unique characteristics of the XML data and query model present challenges that require different distribution techniques. In this paper, we show how XML data can be fragmented horizontally and vertically. Based on this, we propose solutions to two of the problems encountered in distributed query processing and optimization on XML data, namely localization and pruning. Localization takes a fragmentation-unaware query plan and converts it to a distributed query plan that can be executed at the sites that hold XML data fragments in a distributed system. We then show how the resulting distributed query plan can be pruned so that only those sites are accessed that can contribute to the query result. We demonstrate that our techniques can be integrated into a real-life XML database system and that they significantly improve the performance of distributed query execution.
引用
收藏
页码:445 / 490
页数:45
相关论文
共 50 条
  • [31] A query processing architecture for an XML data warehouse
    Wiwatwattana, Nuwee
    Jagadish, H. V.
    2008 IEEE 24TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2008, : 1513 - +
  • [32] A Method of XML Twig Query Processing based on XML Document Schema
    Yu, Yi
    PROCEEDINGS OF THE 2017 INTERNATIONAL CONFERENCE ON MECHANICAL, ELECTRONIC, CONTROL AND AUTOMATION ENGINEERING (MECAE 2017), 2017, 61 : 172 - 175
  • [33] Improved Centralized XML Query Processing Using Distributed Query Workload
    Subramaniam, Samini
    Haw, Su-Cheng
    Soon, Lay-Ki
    IEEE ACCESS, 2021, 9 : 29127 - 29142
  • [34] Methods for XML Phrase Query Processing over a Collection of XML Documents
    Paul, T.
    Kabir, M. H.
    Prodhan, M. A. T.
    Chowdhury, M. W.
    2011 IEEE PACIFIC RIM CONFERENCE ON COMMUNICATIONS, COMPUTERS AND SIGNAL PROCESSING (PACRIM), 2011, : 376 - 381
  • [35] Hybrid Dynamic Pruning for Efficient and Effective Query Processing
    Fang, Wenxiu
    Marbach, Trent G.
    Wang, Gang
    Liu, Xiaoguang
    CIKM '20: PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON INFORMATION & KNOWLEDGE MANAGEMENT, 2020, : 2013 - 2016
  • [36] Estimating communication costs for distributed XML query processing
    Park, Jong-Hyun
    Kang, Ji-Hoon
    IET COMMUNICATIONS, 2013, 7 (08) : 766 - 773
  • [37] Attribute grammars for scalable query processing on XML streams
    Christoph Koch
    Stefanie Scherzinger
    The VLDB Journal, 2007, 16 : 317 - 342
  • [38] An Efficient Indexing and Compressing Scheme for XML Query Processing
    Liao, I-En
    Hsu, Wen-Chiao
    Chen, Yu-Lin
    NETWORKED DIGITAL TECHNOLOGIES, PT 1, 2010, 87 : 70 - 84
  • [39] Attribute grammars for scalable query processing on XML streams
    Koch, C
    Scherzinger, S
    DATABASE PROGRAMMING LANGUAGES, 2004, 2921 : 233 - 256
  • [40] Attribute grammars for scalable query processing on XML streams
    Koch, Christoph
    Scherzinger, Stefanie
    VLDB JOURNAL, 2007, 16 (03): : 317 - 342