Schema-aware XPath filtering on XML document streams

被引:0
|
作者
Lee, Daewook [1 ]
Kwon, Joonho [2 ]
Yang, Weidong [3 ]
Shin, Hyoseop [4 ]
Kwak, Jae-min [5 ]
Lee, Sukho [1 ]
机构
[1] Seoul Natl Univ, Sch Elect Engn & Comp Sci, Seoul 151742, South Korea
[2] Adv Inst Convergence Technol, Suwon 443270, Gyeonggi Do, South Korea
[3] Fudan Univ, Dept Comp & Informat Technol, Shanghai 200433, Peoples R China
[4] Konkuk Univ, Dept Adv Technol Fus, Sch Internet & Multimedia Engn, Seoul 143701, South Korea
[5] Korea Elect Technol Inst, SoC Res Ctr, Gyeonggi Do 463816, South Korea
关键词
XML; Filtering; Document Type Definition (DTD); XML schema; XML stream; XPath simplification; XPath optimization; PERFORMANCE;
D O I
10.1007/s10845-008-0218-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The XML stream filtering is gaining widespread attention from the research community in recent years. There have been many efforts to improve the performance of the XML filtering system by utilizing XML schema information. In this paper, we design and implement an XML stream filtering system, SFilter, which uses DTD or XML schema information for improving the performance. We propose the simplification and two kinds of optimization, one is static and the other is dynamic optimization. The Simplification and static optimization transform the XPath queries to make automata as an index structure for the filtering. The dynamic optimization are done in runtime at the filtering time. We developed five kinds of static optimization and two kinds of dynamic optimization. We present the novel filtering algorithm for the resulting transformed XPath queries and runtime optimizing. The experimental result shows that our system filters the XML streams efficiently.
引用
收藏
页码:273 / 282
页数:10
相关论文
共 50 条
  • [31] Formal Framework of XML Document Schema Design
    Zainol, Zurinahni
    Wang, Bing
    INTERNATIONAL JOURNAL OF INFORMATION RETRIEVAL RESEARCH, 2012, 2 (01) : 21 - 64
  • [32] Entropy as a Measure of Quality of XML Schema Document
    Basci, Dilek
    Misra, Sanjay
    INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY, 2011, 8 (01) : 75 - 83
  • [33] Schema extraction for multimedia XML document retrieval
    Yoon, JP
    Kim, S
    PROCEEDINGS OF THE FIRST INTERNATIONAL CONFERENCE ON WEB INFORMATION SYSTEMS ENGINEERING, VOL II, 2000, : 113 - 120
  • [34] Decompose, Enrich, and Extract! Schema-aware Event Extraction using LLMs.
    Shiri, Fatemeh
    Moghimifar, Farhad
    Haffari, Reza
    Li, Yuan-Fang
    Van Nguyen
    Yoo, John
    2024 27TH INTERNATIONAL CONFERENCE ON INFORMATION FUSION, FUSION 2024, 2024,
  • [35] Schema-Aware Hyper-Relational Knowledge Graph Embeddings for Link Prediction
    Lu, Yuhuan
    Yang, Dingqi
    Wang, Pengyang
    Rosso, Paolo
    Cudre-Mauroux, Philippe
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (06) : 2614 - 2628
  • [36] BLAST: a Loosely Schema-aware Meta-blocking Approach for Entity Resolution
    Simonini, Giovanni
    Bergamaschi, Sonia
    Jagadish, H. V.
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2016, 9 (12): : 1173 - 1184
  • [37] Early nested word automata for XPath query answering on XML streams
    Debarbieux, Denis
    Gauwin, Olivier
    Niehren, Joachim
    Sebastian, Tom
    Zergaoui, Mohamed
    THEORETICAL COMPUTER SCIENCE, 2015, 578 : 100 - 125
  • [38] A Method of XML Twig Query Processing based on XML Document Schema
    Yu, Yi
    PROCEEDINGS OF THE 2017 INTERNATIONAL CONFERENCE ON MECHANICAL, ELECTRONIC, CONTROL AND AUTOMATION ENGINEERING (MECAE 2017), 2017, 61 : 172 - 175
  • [39] XML-Document-Filtering Automaton
    Silvasti, Panu
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2008, 1 (02): : 1666 - 1671
  • [40] Schema-aware Reference as Prompt Improves Data-Efficient Knowledge Graph Construction
    Yao, Yunzhi
    Mao, Shengyu
    Zhang, Ningyu
    Chen, Xiang
    Deng, Shumin
    Chen, Xi
    Chen, Huajun
    PROCEEDINGS OF THE 46TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, SIGIR 2023, 2023, : 911 - 921