Similarity Join of XML Documents Stored in File System

被引:0
|
作者
Bocassanta, F. [1 ]
Dorneles, C. F. [1 ]
机构
[1] Univ Fed Santa Catarina, BR-88040900 Florianopolis, SC, Brazil
关键词
Similarity join; similarity function; similarity join tool;
D O I
10.1109/TLA.2010.5688101
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Joining XML documents, in a data integration environment, is not a trivial task because besides data are stored in several representations (abbreviated, incomplete, or misspelled), XML data are usually organized as collection of values, which requires a different implementation of the join operation. In this paper, we present two similarity join operators, which are used over XML documents stored in file system. The operators have been implemented in a tool, called SimJoiX, which assists in the task of joining data stored in XML files.
引用
收藏
页码:722 / 727
页数:6
相关论文
共 50 条
  • [21] Handling sharable queries in both streaming and stored XML documents
    Karam, Marcel
    Awada, Rana
    International Journal of Intelligent Information and Database Systems, 2012, 6 (01) : 3 - 29
  • [22] Improvement of XML structural join algorithm with weaving multi-documents
    Yan, J., 1600, Asian Network for Scientific Information (12):
  • [23] An efficient similarity-based approach for comparing XML documents
    Oliveira, Alessandreia
    Tessarolli, Gabriel
    Ghiotto, Gleiph
    Pinto, Bruno
    Campello, Fernando
    Marques, Matheus
    Oliveira, Carlos
    Rodrigues, Igor
    Kalinowski, Marcos
    Souza, Ueverton
    Murta, Leonardo
    Braganholo, Vanessa
    INFORMATION SYSTEMS, 2018, 78 : 40 - 57
  • [24] Evaluating the similarity of XML documents based on frequent label sequences
    Bei, Y. (byj@zju.edu.cn), 1600, Advanced Institute of Convergence Information Technology (04):
  • [25] Similarity measures for XML documents based on kernel matrix learning
    Institute of Computer Science and Technology, Peking University, Beijing 100871, China
    不详
    Ruan Jian Xue Bao, 2006, 5 (991-1000):
  • [26] A kernel method for measuring structural similarity between XML documents
    Jeong, Buhwan
    Lee, Daewon
    Cho, Hyunbo
    Kulvatunyou, Boonserm
    NEW TRENDS IN APPLIED ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2007, 4570 : 572 - +
  • [27] Efficient holistic twig join algorithm on XML documents with optimization rules and index
    Jiang, Jinhua
    Chen, Gang
    Shou, Lidan
    Chen, Ke
    PROCEEDINGS OF THE 2008 INTERNATIONAL CONFERENCE ON CYBERWORLDS, 2008, : 30 - 35
  • [28] The consistency control system of XML documents
    Torii, O
    Kimura, T
    Segawa, J
    2003 SYMPOSIUM ON APPLICATIONS AND THE INTERNET, PROCEEDINGS, 2003, : 102 - 110
  • [29] System of information retrieval in XML documents
    Smadhi, S
    ISSUES AND TRENDS OF INFORMATION TECHNOLOGY MANAGEMENT IN CONTEMPORARY ORGANIZATIONS, VOLS 1 AND 2, 2002, : 736 - 739
  • [30] An inductive learning system for XML documents
    Wu, Xiaobing
    INDUCTIVE LOGIC PROGRAMMING, 2008, 4894 : 292 - 306