Similarity measurement of XML documents based on structure and contents

被引:0
|
作者
Kim, Tae-Soon [1 ]
Lee, Ju-Hong [1 ]
Song, Jae-Won [1 ]
Kim, Deok-Hwan [2 ]
机构
[1] Inha Univ, Dept Comp Sci & Informat Engn, Incheon, South Korea
[2] Inha Univ, Dept Elect Engn, Incheon, South Korea
关键词
D O I
暂无
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Researches on the similarity measure between XML documents are being progressed in order to effectively control and retrieve various XNIL documents. Previous works mostly suggest similarity-measuring methods focusing only on the tag structure of XML documents. However, they have a problem of incorrectly calculating the semantic similarity of XML contents. In this paper, we propose a new similarity measurement method considering not only the structural information of tags in XML documents but also the semantic information of tags and text content information related with the tags. Our experiments demonstrate that our proposed method improves the accuracy of similarity, compared to the previous works.
引用
收藏
页码:902 / +
页数:2
相关论文
共 50 条
  • [1] Classifying XML documents based on Structure/Content similarity
    Xing, Guangming
    Guo, Jinhua
    Xia, Zhonghang
    COMPARATIVE EVALUATION OF XML INFORMATION RETRIEVAL SYSTEMS, 2007, 4518 : 444 - 457
  • [2] Structure and Content Similarity for Clustering XML Documents
    Zhang, Lijun
    Li, Zhanhuai
    Chen, Qun
    Li, Ning
    WEB-AGE INFORMATION MANAGEMENT, 2010, 6185 : 116 - 124
  • [3] An implementation of XML documents search system based on similarity in structure and semantics
    Park, U
    Seo, Y
    INTERNATIONAL WORKSHOP ON CHALLENGES IN WEB INFORMATION RETRIEVAL AND INTEGRATION, PROCEEDINGS, 2005, : 97 - 102
  • [4] Similarity search for office XML documents based on style and structure data
    Watanabe, Yousuke
    Kamigaito, Hidetaka
    Yokota, Haruo
    INTERNATIONAL JOURNAL OF WEB INFORMATION SYSTEMS, 2013, 9 (02) : 100 - 116
  • [5] Clustering XML documents based on structural similarity
    Xing, Guangming
    Xia, Zhonghang
    Guo, Jinhua
    ADVANCES IN DATABASES: CONCEPTS, SYSTEMS AND APPLICATIONS, 2007, 4443 : 905 - +
  • [6] A methodology for measuring structure similarity of fuzzy XML documents
    Zhen Zhao
    Zongmin Ma
    Computing, 2017, 99 : 493 - 506
  • [7] A methodology for measuring structure similarity of fuzzy XML documents
    Zhao, Zhen
    Ma, Zongmin
    COMPUTING, 2017, 99 (05) : 493 - 506
  • [8] XML document similarity measure in terms of the structure and contents
    Kim, Woosaeng
    PROCEEDINGS OF THE 2ND WSEAS INTERNATIONAL CONFERENCE ON COMPUTER ENGINEERING AND APPLICATIONS: MODERN TOPICS OF COMPUTER SCIENCE, 2008, : 205 - 212
  • [9] Extraction of partial XML documents using IR-based structure and contents analysis
    Hatano, K
    Kinutani, H
    Yoshikawa, M
    Uemura, S
    CONCEPTUAL MODELING FOR NEW INFORMATION SYSTEMS TECHNOLOGIES, 2002, 2465 : 334 - 347
  • [10] An Efficient Structure Similarity Measure Method for XML documents based on Vector Space Model
    Yan, Hongcan
    Li, Minqiang
    Jin, Dianchuan
    Zhou, Dazhuo
    Yan, Shaohong
    DCABES 2008 PROCEEDINGS, VOLS I AND II, 2008, : 345 - +