Similarity Join of XML Documents Stored in File System

被引:0
|
作者
Bocassanta, F. [1 ]
Dorneles, C. F. [1 ]
机构
[1] Univ Fed Santa Catarina, BR-88040900 Florianopolis, SC, Brazil
关键词
Similarity join; similarity function; similarity join tool;
D O I
10.1109/TLA.2010.5688101
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Joining XML documents, in a data integration environment, is not a trivial task because besides data are stored in several representations (abbreviated, incomplete, or misspelled), XML data are usually organized as collection of values, which requires a different implementation of the join operation. In this paper, we present two similarity join operators, which are used over XML documents stored in file system. The operators have been implemented in a tool, called SimJoiX, which assists in the task of joining data stored in XML files.
引用
收藏
页码:722 / 727
页数:6
相关论文
共 50 条
  • [1] Engineering documents into XML file formats
    Chiang, Chia-Chu
    INTERNATIONAL CONFERENCE ON INFORMATION TECHNOLOGY, PROCEEDINGS, 2007, : 610 - 615
  • [2] Computing similarity between XML documents for XML mining
    Lee, JW
    Park, SS
    ENGINEERING KNOWLEDGE IN THE AGE OF THE SEMANTIC WEB, PROCEEDINGS, 2004, 3257 : 492 - 493
  • [3] An implementation of XML documents search system based on similarity in structure and semantics
    Park, U
    Seo, Y
    INTERNATIONAL WORKSHOP ON CHALLENGES IN WEB INFORMATION RETRIEVAL AND INTEGRATION, PROCEEDINGS, 2005, : 97 - 102
  • [4] Similarity computation for XML documents by XML element sequence patterns
    Zhang, Haiwei
    Yuan, Xiaojie
    Yang, Na
    Liu, Zhongqi
    PROGRESS IN WWW RESEARCH AND DEVELOPMENT, PROCEEDINGS, 2008, 4976 : 227 - 232
  • [5] An approach for XML similarity join using tree serialization
    Wen, Lianzi
    Amagasa, Toshiyuki
    Kitagawa, Hiroyuki
    DATABASE SYSTEMS FOR ADVANCED APPLICATIONS, 2008, 4947 : 562 - 570
  • [6] Structural similarity between XML documents and DTDs
    Ng, PKL
    Ng, VTY
    COMPUTATIONAL SICENCE - ICCS 2003, PT III, PROCEEDINGS, 2003, 2659 : 412 - 421
  • [7] Using structural similarity for clustering XML documents
    Aitelhadj, Ali
    Boughanem, Mohand
    Mezghiche, Mohamed
    Souam, Fatiha
    KNOWLEDGE AND INFORMATION SYSTEMS, 2012, 32 (01) : 109 - 139
  • [8] Structure and Content Similarity for Clustering XML Documents
    Zhang, Lijun
    Li, Zhanhuai
    Chen, Qun
    Li, Ning
    WEB-AGE INFORMATION MANAGEMENT, 2010, 6185 : 116 - 124
  • [9] Using structural similarity for clustering XML documents
    Ali Aïtelhadj
    Mohand Boughanem
    Mohamed Mezghiche
    Fatiha Souam
    Knowledge and Information Systems, 2012, 32 : 109 - 139
  • [10] Semantic Structural Similarity for Clustering XML Documents
    Kim, Tae-Soon
    Lee, Ju-Hong
    Song, Jae-Won
    ICHIT 2008: INTERNATIONAL CONFERENCE ON CONVERGENCE AND HYBRID INFORMATION TECHNOLOGY, PROCEEDINGS, 2008, : 552 - 557