Similarity Join of XML Documents Stored in File System

被引:0
|
作者
Bocassanta, F. [1 ]
Dorneles, C. F. [1 ]
机构
[1] Univ Fed Santa Catarina, BR-88040900 Florianopolis, SC, Brazil
关键词
Similarity join; similarity function; similarity join tool;
D O I
10.1109/TLA.2010.5688101
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Joining XML documents, in a data integration environment, is not a trivial task because besides data are stored in several representations (abbreviated, incomplete, or misspelled), XML data are usually organized as collection of values, which requires a different implementation of the join operation. In this paper, we present two similarity join operators, which are used over XML documents stored in file system. The operators have been implemented in a tool, called SimJoiX, which assists in the task of joining data stored in XML files.
引用
收藏
页码:722 / 727
页数:6
相关论文
共 50 条
  • [41] Concurrent design versioning system, based on XML file
    Delinchant, B
    Gerbaud, L
    Wurtz, F
    Ateinza, E
    IECON-2002: PROCEEDINGS OF THE 2002 28TH ANNUAL CONFERENCE OF THE IEEE INDUSTRIAL ELECTRONICS SOCIETY, VOLS 1-4, 2002, : 2485 - 2490
  • [42] A novel method for measuring similarity of XML documents based on extended adjacency matrix
    Zhang, Xueliang
    Yang, Ting
    Fan, Baoquan
    Wang, Xu
    Wei, Jinmao
    Journal of Computational Information Systems, 2011, 7 (07): : 2555 - 2565
  • [43] Clustering XML Documents Using Closed Frequent Subtrees: A Structural Similarity Approach
    Kutty, Sangeetha
    Tran, Tien
    Nayak, Richi
    Li, Yuefeng
    FOCUSED ACCESS TO XML DOCUMENTS, 2008, 4862 : 183 - 194
  • [44] Approximate top-k structural similarity search over XML documents
    Xie, T
    Sha, CF
    Wang, XL
    Zhou, AY
    FRONTIERS OF WWW RESEARCH AND DEVELOPMENT - APWEB 2006, PROCEEDINGS, 2006, 3841 : 319 - 330
  • [45] An intelligent XML publishing system for scientific documents -: word2XML
    Akudi, E
    Lu, J
    8TH WORLD MULTI-CONFERENCE ON SYSTEMICS, CYBERNETICS AND INFORMATICS, VOL II, PROCEEDINGS: COMPUTING TECHNIQUES, 2004, : 156 - 161
  • [46] A Survey on Tree Edit Distance Lower Bound Estimation Techniques for Similarity Join on XML Data
    Li, Fei
    Wang, Hongzhi
    Li, Jianzhong
    Gao, Hong
    SIGMOD RECORD, 2013, 42 (04) : 29 - 39
  • [47] PIX:: A system for phrase matching in XML documents:: A demonstration
    Amer-Yahia, S
    Fernández, M
    Srivastava, D
    Xu, Y
    19TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2003, : 774 - 776
  • [48] Study and development of the DTD generation system for XML documents
    Leonov, AV
    Khusnutdinov, RR
    PROGRAMMING AND COMPUTER SOFTWARE, 2005, 31 (04) : 197 - 210
  • [49] XAS: A system for accessing componentized, virtual XML documents
    Lo, ML
    Chen, SK
    Padmanabhan, S
    Chung, JY
    PROCEEDINGS OF THE 23RD INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, 2001, : 493 - 502
  • [50] Study and Development of the DTD Generation System for XML Documents
    A. V. Leonov
    R. R. Khusnutdinov
    Programming and Computer Software, 2005, 31 : 197 - 210