Xandy: Detecting changes on large unordered XML documents using relational Databases

被引:0
|
作者
Leonardi, E [1 ]
Bhowmick, SS
Madria, S
机构
[1] Nanyang Technol Univ, Sch Comp Engn, Singapore 2263, Singapore
[2] Univ Missouri, Dept Comp Sci, Rolla, MO 65409 USA
关键词
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Previous works in change detection on XML documents are not suitable for detecting the changes to large XML documents as it requires a lot of memory to keep the two versions of XML documents in the memory. In this paper, we take a more conservative yet novel approach of using traditional relational database engines for detecting the changes to large unordered XML documents. We elaborate how we detect the changes on unordered XML documents by using relational database. To this end, we have implemented a prototype system called XANDY that converts XML documents into relational tuples and detects the changes from these tuples by using SQL queries. Our experimental results show that the relational approach has better scalability compared to published algorithms like X-Diff. The result quality of our approach is comparable to the one of X-Diff.
引用
收藏
页码:711 / 723
页数:13
相关论文
共 50 条
  • [41] Generating Free Redundancy XML Documents from Non Normalized Relational Views Using A Statistically Approach
    Nasser, Mohammed
    Ibrahim, Hamidah
    Mamat, Ali
    Sulaiman, Md Nasir
    2008 3RD INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGIES: FROM THEORY TO APPLICATIONS, VOLS 1-5, 2008, : 1233 - 1239
  • [42] XML2HBase: Storing and querying large collections of XML documents using a NoSQL database system
    Bao, Liang
    Yang, Jin
    Wu, Chase Q.
    Qi, Haiyang
    Zhang, Xin
    Cai, Shunda
    JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2022, 161 : 83 - 99
  • [43] A hyper-box approach using relational databases for large scale machine learning
    Papadakis, Stelios E.
    Stykas, Vangelis A.
    Mastorakis, George
    Mavromoustakis, Constandinos X.
    2014 INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND MULTIMEDIA (TEMU), 2014, : 69 - 73
  • [44] XAncestor: An efficient mapping approach for storing and querying XML documents in relational database using path-based technique
    Qtaish, Amjad
    Ahmad, Kamsuriah
    KNOWLEDGE-BASED SYSTEMS, 2016, 114 : 167 - 192
  • [45] Achieving k-anonymity Using Improved Greedy Heuristics for Very Large Relational Databases
    Babu, Korra Sathya
    Reddy, Nithin
    Kumar, Nitesh
    Elliot, Mark
    Jena, Sanjay Kumar
    TRANSACTIONS ON DATA PRIVACY, 2013, 6 (01) : 1 - 17
  • [46] Practical implications of using non-relational databases to store large genomic data files and novel phenotypes
    Souza, Andre Moreira
    Santos Weigert, Rodrigo de Andrade
    Machado de Sousa, Elaine Parros
    Andrietta, Lucas Tassoni
    Ventura, Ricardo Vieira
    JOURNAL OF ANIMAL BREEDING AND GENETICS, 2022, 139 (01) : 100 - 112
  • [47] Threat Analysis Using Topic Models in Large-Scale Vulnerability Databases and Security Incident Case Documents
    Koyanagi, Hiroki
    Takaragi, Kazuo
    Wohlgemuth, Sven
    Umezawa, Katsuyuki
    2021 IEEE VIRTUAL IEEE INTERNATIONAL SYMPOSIUM ON TECHNOLOGIES FOR HOMELAND SECURITY, 2021,
  • [48] Efficient evaluation of linear path expressions on large-scale heterogeneous XML documents using information retrieval techniques
    Park, YH
    Whang, KY
    Lee, BS
    Han, WS
    JOURNAL OF SYSTEMS AND SOFTWARE, 2006, 79 (02) : 180 - 190
  • [49] The analysis of long-term changes in plant communities using large databases: The effect of stratified resampling
    Haveman, Rense
    Janssen, John A. M.
    JOURNAL OF VEGETATION SCIENCE, 2008, 19 (03) : 355 - U97
  • [50] Detecting Connectivity Changes in Autism Spectrum Disorder Using Large-Scale Granger Causality
    Abidin, Anas Z.
    Dsouza, Adora M.
    Wismueller, Axel
    MEDICAL IMAGING 2019: IMAGE PROCESSING, 2019, 10949