Similarity measurement of XML documents based on structure and contents

被引:0
|
作者
Kim, Tae-Soon [1 ]
Lee, Ju-Hong [1 ]
Song, Jae-Won [1 ]
Kim, Deok-Hwan [2 ]
机构
[1] Inha Univ, Dept Comp Sci & Informat Engn, Incheon, South Korea
[2] Inha Univ, Dept Elect Engn, Incheon, South Korea
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Researches on the similarity measure between XML documents are being progressed in order to effectively control and retrieve various XNIL documents. Previous works mostly suggest similarity-measuring methods focusing only on the tag structure of XML documents. However, they have a problem of incorrectly calculating the semantic similarity of XML contents. In this paper, we propose a new similarity measurement method considering not only the structural information of tags in XML documents but also the semantic information of tags and text content information related with the tags. Our experiments demonstrate that our proposed method improves the accuracy of similarity, compared to the previous works.
引用
收藏
页码:902 / +
页数:2
相关论文
共 50 条
  • [31] Clustering XML Documents by Structure
    Lesniewska, Anna
    ADVANCES IN DATABASES AND INFORMATION SYSTEMS, 2010, 5968 : 238 - 246
  • [32] A novel method for measuring similarity of XML documents based on extended adjacency matrix
    Zhang, Xueliang
    Yang, Ting
    Fan, Baoquan
    Wang, Xu
    Wei, Jinmao
    Journal of Computational Information Systems, 2011, 7 (07): : 2555 - 2565
  • [33] Similarity Join of XML Documents Stored in File System
    Bocassanta, F.
    Dorneles, C. F.
    IEEE LATIN AMERICA TRANSACTIONS, 2010, 8 (06) : 722 - 727
  • [34] Semantic Structural Similarity Measure for Clustering XML Documents
    Song, Ling
    Ma, Jun
    Lei, Jingsheng
    Zhang, Dongmei
    Wang, Zhen
    WEB INFORMATION SYSTEMS AND MINING, PROCEEDINGS, 2009, 5854 : 232 - +
  • [35] Measuring the structural similarity among XML documents and DTDs
    Elisa Bertino
    Giovanna Guerrini
    Marco Mesiti
    Journal of Intelligent Information Systems, 2008, 30 : 55 - 92
  • [36] Measuring the structural similarity among XML documents and DTDs
    Bertino, Elisa
    Guerrini, Giovanna
    Mesiti, Marco
    JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2008, 30 (01) : 55 - 92
  • [37] Structural similarity evaluation between XML documents and DTDs
    Tekli, Joe
    Chbeir, Richard
    Yetongnon, Kokou
    WEB INFORMATION SYSTEMS ENGINEERING - WISE 2007, PROCEEDINGS, 2007, 4831 : 196 - 211
  • [38] Unification of XML DTD for XML documents with similar structure
    Yoo, CS
    Woo, SM
    Kim, YS
    COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2005, PT 3, 2005, 3482 : 954 - 963
  • [39] XCLSC: Structure and Content-based Clustering of XML Documents
    Bessine, Karima
    Nehar, Attia
    Cherroun, Hadda
    Moussaoui, Abdelouahab
    2015 12TH IEEE INTERNATIONAL CONFERENCE ON PROGRAMMING AND SYSTEMS (ISPS), 2015, : 221 - 227
  • [40] Implementation of index schema for XML documents based on structure of database
    Song, Youngrok
    Choo, Kyonam
    Woo, Yoseop
    Min, Hongki
    WEBIST 2007: PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON WEB INFORMATION SYSTEMS AND TECHNOLOGIES, VOL IT: INTERNET TECHNOLOGY, 2007, : 402 - +