Schema-aware XPath filtering on XML document streams

被引:0
|
作者
Daewook Lee
Joonho Kwon
Weidong Yang
Hyoseop Shin
Jae-min Kwak
Sukho Lee
机构
[1] Seoul National University,School of Electrical Engineering and Computer Science
[2] Advanced Institutes of Convergence Technology,Department of Computing and Information Technology
[3] Fudan University,Department of Advanced Technology Fusion
[4] School of Internet and Multimedia Engineering,undefined
[5] Konkuk University,undefined
[6] SoC Research Center,undefined
[7] Korea Electronics Technology Institute,undefined
来源
关键词
XML; Filtering; Document Type Definition (DTD); XML schema; XML stream; XPath simplification; XPath optimization;
D O I
暂无
中图分类号
学科分类号
摘要
The XML stream filtering is gaining widespread attention from the research community in recent years. There have been many efforts to improve the performance of the XML filtering system by utilizing XML schema information. In this paper, we design and implement an XML stream filtering system, SFilter, which uses DTD or XML schema information for improving the performance. We propose the simplification and two kinds of optimization, one is static and the other is dynamic optimization. The Simplification and static optimization transform the XPath queries to make automata as an index structure for the filtering. The dynamic optimization are done in runtime at the filtering time. We developed five kinds of static optimization and two kinds of dynamic optimization. We present the novel filtering algorithm for the resulting transformed XPath queries and runtime optimizing. The experimental result shows that our system filters the XML streams efficiently.
引用
收藏
页码:273 / 282
页数:9
相关论文
共 50 条
  • [31] Formal Framework of XML Document Schema Design
    Zainol, Zurinahni
    Wang, Bing
    INTERNATIONAL JOURNAL OF INFORMATION RETRIEVAL RESEARCH, 2012, 2 (01) : 21 - 64
  • [32] Entropy as a Measure of Quality of XML Schema Document
    Basci, Dilek
    Misra, Sanjay
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2011, 8 (01) : 75 - 83
  • [33] Schema extraction for multimedia XML document retrieval
    Yoon, JP
    Kim, S
    PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON WEB INFORMATION SYSTEMS ENGINEERING, VOL II, 2000, : 113 - 120
  • [34] Decompose, Enrich, and Extract! Schema-aware Event Extraction using LLMs.
    Shiri, Fatemeh
    Moghimifar, Farhad
    Haffari, Reza
    Li, Yuan-Fang
    Van Nguyen
    Yoo, John
    2024 27TH INTERNATIONAL CONFERENCE ON INFORMATION FUSION, FUSION 2024, 2024,
  • [35] Schema-Aware Hyper-Relational Knowledge Graph Embeddings for Link Prediction
    Lu, Yuhuan
    Yang, Dingqi
    Wang, Pengyang
    Rosso, Paolo
    Cudre-Mauroux, Philippe
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (06) : 2614 - 2628
  • [36] BLAST: a Loosely Schema-aware Meta-blocking Approach for Entity Resolution
    Simonini, Giovanni
    Bergamaschi, Sonia
    Jagadish, H. V.
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2016, 9 (12): : 1173 - 1184
  • [37] Early nested word automata for XPath query answering on XML streams
    Debarbieux, Denis
    Gauwin, Olivier
    Niehren, Joachim
    Sebastian, Tom
    Zergaoui, Mohamed
    THEORETICAL COMPUTER SCIENCE, 2015, 578 : 100 - 125
  • [38] A Method of XML Twig Query Processing based on XML Document Schema
    Yu, Yi
    PROCEEDINGS OF THE 2017 INTERNATIONAL CONFERENCE ON MECHANICAL, ELECTRONIC, CONTROL AND AUTOMATION ENGINEERING (MECAE 2017), 2017, 61 : 172 - 175
  • [39] XML-Document-Filtering Automaton
    Silvasti, Panu
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2008, 1 (02): : 1666 - 1671
  • [40] Schema-aware Reference as Prompt Improves Data-Efficient Knowledge Graph Construction
    Yao, Yunzhi
    Mao, Shengyu
    Zhang, Ningyu
    Chen, Xiang
    Deng, Shumin
    Chen, Xi
    Chen, Huajun
    PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023, 2023, : 911 - 921