Schema-aware XPath filtering on XML document streams

被引:0
|
作者
Lee, Daewook [1 ]
Kwon, Joonho [2 ]
Yang, Weidong [3 ]
Shin, Hyoseop [4 ]
Kwak, Jae-min [5 ]
Lee, Sukho [1 ]
机构
[1] Seoul Natl Univ, Sch Elect Engn & Comp Sci, Seoul 151742, South Korea
[2] Adv Inst Convergence Technol, Suwon 443270, Gyeonggi Do, South Korea
[3] Fudan Univ, Dept Comp & Informat Technol, Shanghai 200433, Peoples R China
[4] Konkuk Univ, Dept Adv Technol Fus, Sch Internet & Multimedia Engn, Seoul 143701, South Korea
[5] Korea Elect Technol Inst, SoC Res Ctr, Gyeonggi Do 463816, South Korea
关键词
XML; Filtering; Document Type Definition (DTD); XML schema; XML stream; XPath simplification; XPath optimization; PERFORMANCE;
D O I
10.1007/s10845-008-0218-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The XML stream filtering is gaining widespread attention from the research community in recent years. There have been many efforts to improve the performance of the XML filtering system by utilizing XML schema information. In this paper, we design and implement an XML stream filtering system, SFilter, which uses DTD or XML schema information for improving the performance. We propose the simplification and two kinds of optimization, one is static and the other is dynamic optimization. The Simplification and static optimization transform the XPath queries to make automata as an index structure for the filtering. The dynamic optimization are done in runtime at the filtering time. We developed five kinds of static optimization and two kinds of dynamic optimization. We present the novel filtering algorithm for the resulting transformed XPath queries and runtime optimizing. The experimental result shows that our system filters the XML streams efficiently.
引用
收藏
页码:273 / 282
页数:10
相关论文
共 50 条
  • [41] Efficient Filtering of XML Documents with XPath Expressions Containing Ancestor Axis
    Ning, Bo
    Liu, Chengfei
    Wang, Guoren
    WEB-AGE INFORMATION MANAGEMENT, PROCEEDINGS, 2010, 6184 : 551 - +
  • [42] Semantic Annotation of XML-Schema for Document Transformations
    Koepke, Julius
    Eder, Johann
    ON THE MOVE TO MEANINGFUL INTERNET SYSTEMS: OTM 2010 WORKSHOPS, 2010, 6428 : 219 - 228
  • [43] RETA: A Schema-Aware, End-to-End Solution for Instance Completion in Knowledge Graphs
    Rosso, Paolo
    Yang, Dingqi
    Ostapuk, Natalia
    Cudre-Mauroux, Philippe
    PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE 2021 (WWW 2021), 2021, : 845 - 856
  • [44] Schema Extraction and Integration of Heterogeneous XML Document Collections
    Janga, Prudhvi
    Davis, Karen C.
    MODEL AND DATA ENGINEERING, MEDI 2013, 2013, 8216 : 176 - 187
  • [45] Worst-case optimal algorithm for XPath evaluation over XML streams
    Ramanan, Prakash
    JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2009, 75 (08) : 465 - 485
  • [46] Normalization of the Forward XPath for Efficient Query Evaluation over XML Data Streams
    Qiao, Lixiang
    Yang, Zhimin
    Yang, Chi
    Ren, Kaijun
    Liu, Chang
    JCPC: 2009 JOINT CONFERENCE ON PERVASIVE COMPUTING, 2009, : 365 - +
  • [47] Validating key constraints over XML document using XPath and structure checking
    Liu, YF
    Yang, DQ
    Tang, SW
    Wang, TJ
    Gao, J
    FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2005, 21 (04): : 583 - 595
  • [48] Embedding XML Schema constraints in search-based intersection tests for XPath query optimization
    Böttcher, S
    Steinmetz, R
    SIXTEENTH INTERNATIONAL WORKSHOP ON DATABASE AND EXPERT SYSTEMS APPLICATIONS, PROCEEDINGS, 2005, : 842 - 846
  • [49] Enhancing WS-BPEL Dynamic Invariant Generation Using XML Schema and XPath Information
    Palomo-Duarte, Manuel
    Garcia-Dominguez, Antonio
    Medina-Bulo, Inmaculada
    WEB ENGINEERING, PROCEEDINGS, 2009, 5648 : 469 - 472
  • [50] Satisfiability-test, rewriting and refinement of users' XPath queries according to XML schema definitions
    Groppe, Jinghua
    Groppe, Sven
    ADVANCES IN DATABASES AND INFORMATION SYSTEMS, PROCEEDINGS, 2006, 4152 : 22 - 38