Fast and efficient computation of reachability queries over linked XML documents' graphs

被引:2
|
作者
Sayed, Awny [1 ]
机构
[1] Menia Univ, Comp Sci Dept, Al Minya, Egypt
关键词
Extensible markup language; Databases; Query Languages;
D O I
10.1108/17440080910947312
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Purpose - The increasing popularity of XML has generated a lot of interest in query processing over graph-structured data. To support efficient evaluation of path expressions structured indexes have been proposed. Extending the proposed indexes to work with large XML graphs and to support intraor inter-document links requires a lot of computing power for the creation process and a lot of space to store the indexes. Moreover, the efficient evaluation of ancestors-descendants queries over arbitrary graphs with long paths is a severe problem. This paper aims to propose a scalable path index which is based on the concept of 2-hop covers as introduced by Cohen et al. Design/methodology/approach - The problem of efficiently managing and querying XML documents poses interesting challenges on database research. The proposed algorithm for index creation scales down the original graph size substantially. As a result a directed acyclic graph with a smaller number of nodes and edges will emerge. This reduces the number of computing steps required for building the index. Thus, computing time and space will be reduced as well. The index also permits ancestors-descendants relationships to be efficiently evaluated. Moreover, the proposed index has a nice property in comparison to most other work; it is optimized for descendants-or-self queries on arbitrary graphs with link relationships. Findings - In this paper, a scalable path index is proposed. It can efficiently address the problem of querying large XML documents that contain links and have cycles. Cycles in the graph stress path-indexing algorithms. An overview about 2-hop cover and the algorithms that used to build the index are given. Research limitations/implications - This paper works on the updating problem. Since the construction of the index is quite complex its construction make sense for some time. However, this means it is currently dealing with the problem of updating XML-documents. Originality/value - This paper presents an efficient path index that can test the reachability between two nodes and evaluate ancestors-descendants queries over arbitrary graphs with long paths.
引用
收藏
页码:56 / +
页数:22
相关论文
共 50 条
  • [21] The space complexity of processing XML twig queries over indexed documents
    Shalem, Mirit
    Bar-Yossef, Ziv
    2008 IEEE 24TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2008, : 824 - +
  • [22] Evaluation of XPath Queries Over XML Documents Using SparkSQL Framework
    Hricov, Radoslav
    Senk, Adam
    Kroha, Petr
    Valenta, Michal
    BEYOND DATABASES, ARCHITECTURES AND STRUCTURES: TOWARDS EFFICIENT SOLUTIONS FOR DATA ANALYSIS AND KNOWLEDGE REPRESENTATION, 2017, 716 : 28 - 41
  • [23] Efficient Processing of Queries over Recursive XML Data
    Alghamdi, Norah Saleh
    Rahayu, Wenny
    Pardede, Eric
    2015 IEEE 29th International Conference on Advanced Information Networking and Applications (IEEE AINA 2015), 2015, : 134 - 142
  • [24] Interval-Index: A Scalable and Fast Approach for Reachability Queries in Large Graphs
    Li, Fangxu
    Yuan, Pingpeng
    Jin, Hai
    KNOWLEDGE SCIENCE, ENGINEERING AND MANAGEMENT, KSEM 2015, 2015, 9403 : 224 - 235
  • [25] I/O Efficient Label-Constrained Reachability Queries in Large Graphs
    Yuan, Long
    Li, Xia
    Chen, Zi
    Lin, Xuemin
    Zhao, Xiang
    Zhang, Wenjie
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2024, 17 (10): : 2590 - 2602
  • [26] Efficient In-Memory Evaluation of Reachability Graph Pattern Queries on Data Graphs
    Wu, Xiaoying
    Theodoratos, Dimitri
    Skoutas, Dimitrios
    Lan, Michael
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, DASFAA 2022, PT I, 2022, : 55 - 71
  • [27] Efficient reachability test on directed graphs and its application to large XML data
    Nakamura, Yuusaku
    Maita, Tetsuya
    Sakamoto, Hiroshi
    2007 IEEE INTERNATIONAL WORKSHOP ON DATABASES FOR NEXT GENERATION RESEARCHERS, 2007, : 25 - +
  • [28] Fast SLCA and ELCA Computation for XML Keyword Queries based on Set Intersection
    Zhou, Junfeng
    Bao, Zhifeng
    Wang, Wei
    Ling, Tok Wang
    Chen, Ziyang
    Lin, Xudong
    Guo, Jingfeng
    2012 IEEE 28TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING (ICDE), 2012, : 905 - 916
  • [29] DLCR : Efficient Indexing for Label-Constrained Reachability Queries on Large Dynamic Graphs
    Chen, Xin
    Peng, You
    Wang, Sibo
    Yu, Jeffrey Xu
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2022, 15 (08): : 1645 - 1657
  • [30] Horton+: A Distributed System for Processing Declarative Reachability Queries over Partitioned Graphs
    Sarwat, Mohamed
    Elnikety, Sameh
    He, Yuxiong
    Mokbel, Mohamed F.
    PROCEEDINGS OF THE VLDB ENDOWMENT, 2013, 6 (14): : 1918 - 1929