Similarity Join of XML Documents Stored in File System

被引:0
|
作者
Bocassanta, F. [1 ]
Dorneles, C. F. [1 ]
机构
[1] Univ Fed Santa Catarina, BR-88040900 Florianopolis, SC, Brazil
关键词
Similarity join; similarity function; similarity join tool;
D O I
10.1109/TLA.2010.5688101
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Joining XML documents, in a data integration environment, is not a trivial task because besides data are stored in several representations (abbreviated, incomplete, or misspelled), XML data are usually organized as collection of values, which requires a different implementation of the join operation. In this paper, we present two similarity join operators, which are used over XML documents stored in file system. The operators have been implemented in a tool, called SimJoiX, which assists in the task of joining data stored in XML files.
引用
收藏
页码:722 / 727
页数:6
相关论文
共 50 条
  • [31] Similarity Evaluation of XML Documents Based on Weighted Element Tree Model
    Wang, Chenying
    Yuan, Xiaojie
    Ning, Hua
    Lian, Xin
    ADVANCED DATA MINING AND APPLICATIONS, PROCEEDINGS, 2009, 5678 : 680 - 687
  • [32] A Prufer Sequence Based Approach to Measure Structural Similarity of XML Documents
    Periakaruppan, Ramanathan
    Nadarajan, Rethinaswamy
    ON THE MOVE TO MEANINGFUL INTERNET SYSTEMS: OTM 2013 WORKSHOPS, 2013, 8186 : 639 - 648
  • [33] Evaluate structure similarity in XML documents with merge-edit-distance
    Zhou, Chong
    Lu, Yansheng
    Zou, Lei
    Hu, Rong
    EMERGING TECHNOLOGIES IN KNOWLEDGE DISCOVERY AND DATA MINING, 2007, 4819 : 301 - 311
  • [34] A METHODOLOGY FOR USING EDGES TO MEASURE STRUCTURAL AND SEMANTIC SIMILARITY OF XML DOCUMENTS
    Qiu, Hong-Jun
    Yu, Wen-Jing
    PROCEEDINGS OF 2009 INTERNATIONAL CONFERENCE ON MACHINE LEARNING AND CYBERNETICS, VOLS 1-6, 2009, : 1653 - +
  • [35] Multimedia file forensics system exploiting file similarity search
    Min-Ja Kim
    Chuck Yoo
    Young-Woong Ko
    Multimedia Tools and Applications, 2019, 78 : 5233 - 5254
  • [36] A Bloom Filter Based Approach for Evaluating Structural Similarity of XML Documents
    Peng, Dunlu
    Hou, Huan
    Lu, Jing
    WEB INFORMATION SYSTEMS AND MINING, PROCEEDINGS, 2009, 5854 : 242 - 251
  • [37] Multimedia file forensics system exploiting file similarity search
    Kim, Min-Ja
    Yoo, Chuck
    Ko, Young-Woong
    MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (05) : 5233 - 5254
  • [38] Similarity search for office XML documents based on style and structure data
    Watanabe, Yousuke
    Kamigaito, Hidetaka
    Yokota, Haruo
    INTERNATIONAL JOURNAL OF WEB INFORMATION SYSTEMS, 2013, 9 (02) : 100 - 116
  • [39] RAX System to Rank Arabic XML Documents
    Elzentani, Hesham
    Veinovic, Mladen
    Simic, Goran
    INTERNATIONAL JOURNAL OF ADVANCED COMPUTER SCIENCE AND APPLICATIONS, 2016, 7 (12) : 179 - 190
  • [40] An Efficient Duplicate Detection System for XML Documents
    Lwin, Thandar
    Nyunt, Thi Thi Soe
    2010 SECOND INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND APPLICATIONS: ICCEA 2010, PROCEEDINGS, VOL 2, 2010, : 178 - 182