XCpaqs: Compression of XML document with XPath query support

被引:0
|
作者
Wang, HZ [1 ]
Li, JZ [1 ]
Luo, JZ [1 ]
He, ZY [1 ]
机构
[1] Harbin Inst Technol, Harbin, Peoples R China
关键词
XML; compression; query process; XPath; compressor;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Information in XML format has obvious redundancy that wastes disk space, bandwidth and I/O when querying XML data. For the efficiency of storage and query XML,it is necessary to compress XML data. In this paper, XCpaqs, a compression technology of XML, is presented. XCpaqs separates XML document into structure and context information. At the same time, it keeps homomorphism relation between compressed and original XML document. XCpaqs encodes tag and path respectively. It makes parts of XPath query could be processed in main memory. XCpaqs can recognize data types and uses different encode strategy to compress data with different type. This feature makes the technology support XML documents without schema information. Therefore, XCpaqs is adaptive for XML warehouse, which stores XML documents gathered from internet with various schemas. The technology of query execution on XML data compressed by XCpaqs is also presented.
引用
收藏
页码:354 / 358
页数:5
相关论文
共 50 条
  • [1] Nested XPath Query Optimization for XML Structured Document Database
    Senthilkumar, Radha
    Rakesh, G. B.
    Sasikala, N.
    Gowrishankar, M.
    Kannan, A.
    ADCOM: 2008 16TH INTERNATIONAL CONFERENCE ON ADVANCED COMPUTING AND COMMUNICATIONS, 2008, : 422 - +
  • [2] The complexity of XPath query evaluation and XML typing
    Gottlob, G
    Koch, C
    Pichler, R
    JOURNAL OF THE ACM, 2005, 52 (02) : 284 - 335
  • [3] XML document clustering using common XPath
    Leung, HP
    Chung, FL
    Chan, SCF
    Luk, R
    INTERNATIONAL WORKSHOP ON CHALLENGES IN WEB INFORMATION RETRIEVAL AND INTEGRATION, PROCEEDINGS, 2005, : 91 - 96
  • [4] An XML/XPath query language and XMark performance study
    Davis, KC
    Zhan, YS
    Davis, RB
    2003 SYMPOSIUM ON APPLICATIONS AND THE INTERNET, PROCEEDINGS, 2003, : 422 - 427
  • [5] Indexing XML documents for XPath query processing in external memory
    Chen, Qun
    Lim, Andrew
    Ong, Kian Win
    Tang, Jiqing
    DATA & KNOWLEDGE ENGINEERING, 2006, 59 (03) : 681 - 699
  • [6] XPath Query Technology of XML Data Stream Based on Structure Index
    Wei, Xianmin
    ADVANCED MATERIALS SCIENCE AND TECHNOLOGY, PTS 1-2, 2011, 181-182 : 103 - 108
  • [7] Schema-aware XPath filtering on XML document streams
    Lee, Daewook
    Kwon, Joonho
    Yang, Weidong
    Shin, Hyoseop
    Kwak, Jae-min
    Lee, Sukho
    JOURNAL OF INTELLIGENT MANUFACTURING, 2009, 20 (03) : 273 - 282
  • [8] Schema-aware XPath filtering on XML document streams
    Daewook Lee
    Joonho Kwon
    Weidong Yang
    Hyoseop Shin
    Jae-min Kwak
    Sukho Lee
    Journal of Intelligent Manufacturing, 2009, 20 : 273 - 282
  • [9] Concurrency control in XML document databases: XPath locking protocol
    Jea, KF
    Chen, SY
    Wang, SH
    NINTH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS, PROCEEDINGS, 2002, : 551 - 556
  • [10] Early nested word automata for XPath query answering on XML streams
    Debarbieux, Denis
    Gauwin, Olivier
    Niehren, Joachim
    Sebastian, Tom
    Zergaoui, Mohamed
    THEORETICAL COMPUTER SCIENCE, 2015, 578 : 100 - 125