A Novel Method for Measuring Structure and Semantic Similarity of XML Documents Based on Extended Adjacency Matrix

被引:2
|
作者
Zhang, Xue-Liang [1 ]
Yang, Ting [1 ]
Fan, Bao-Quan [1 ]
Wang, Xu [1 ]
Wei, Jin-Mao [1 ]
机构
[1] Nankai Univ, Coll Informat Tech Sci, Tianjin 300071, Peoples R China
关键词
similarity; XML; semantic; structure; adjacency matrix;
D O I
10.1016/j.phpro.2012.02.215
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Similarity measurement of XML documents is crucial to meet various needs of approximate searches and document classifications in XML-oriented applications. Some methods have been proposed for this purpose. Nevertheless, few methods can be elegantly exploited to depict structure and semantic information and hence to effectively measure the similarity of XML documents. In this paper, we present a new method of computing the structure and semantic similarity of XML documents based on extended adjacency matrix(EAM). Different from a general adjacency matrix, in an EAM, the structure information of not only the adjacent layers but also the ancestor-descendant layers can be stored. For measuring the similarity of two XML documents, the proposed method firstly stores the structure and semantic information in two extended adjacency matrices (M-1,M-2). Then it computes similarity of the two documents through cos(M-1,M-2). Experimental results on bench-mark data show that the method holds high efficiency and accuracy. (C) 2011 Published by Elsevier B.V. Selection and/or peer-review under responsibility of ICAPIE Organization Committee.
引用
收藏
页码:1452 / 1461
页数:10
相关论文
共 50 条
  • [1] A novel method for measuring similarity of XML documents based on extended adjacency matrix
    Zhang, Xueliang
    Yang, Ting
    Fan, Baoquan
    Wang, Xu
    Wei, Jinmao
    Journal of Computational Information Systems, 2011, 7 (07): : 2555 - 2565
  • [2] A novel method for measuring semantic similarity for XML schema matching
    Jeong, Buhwan
    Lee, Damon
    Cho, Hyunbo
    Lee, Jaewook
    EXPERT SYSTEMS WITH APPLICATIONS, 2008, 34 (03) : 1651 - 1658
  • [3] A methodology for measuring structure similarity of fuzzy XML documents
    Zhen Zhao
    Zongmin Ma
    Computing, 2017, 99 : 493 - 506
  • [4] A methodology for measuring structure similarity of fuzzy XML documents
    Zhao, Zhen
    Ma, Zongmin
    COMPUTING, 2017, 99 (05) : 493 - 506
  • [5] A kernel method for measuring structural similarity between XML documents
    Jeong, Buhwan
    Lee, Daewon
    Cho, Hyunbo
    Kulvatunyou, Boonserm
    NEW TRENDS IN APPLIED ARTIFICIAL INTELLIGENCE, PROCEEDINGS, 2007, 4570 : 572 - +
  • [6] Semantic Structural Similarity for Clustering XML Documents
    Kim, Tae-Soon
    Lee, Ju-Hong
    Song, Jae-Won
    ICHIT 2008: INTERNATIONAL CONFERENCE ON CONVERGENCE AND HYBRID INFORMATION TECHNOLOGY, PROCEEDINGS, 2008, : 552 - 557
  • [7] Classifying XML documents based on Structure/Content similarity
    Xing, Guangming
    Guo, Jinhua
    Xia, Zhonghang
    COMPARATIVE EVALUATION OF XML INFORMATION RETRIEVAL SYSTEMS, 2007, 4518 : 444 - 457
  • [8] Similarity measures for XML documents based on kernel matrix learning
    Institute of Computer Science and Technology, Peking University, Beijing 100871, China
    不详
    Ruan Jian Xue Bao, 2006, 5 (991-1000):
  • [9] Similarity measurement of XML documents based on structure and contents
    Kim, Tae-Soon
    Lee, Ju-Hong
    Song, Jae-Won
    Kim, Deok-Hwan
    COMPUTATIONAL SCIENCE - ICCS 2007, PT 3, PROCEEDINGS, 2007, 4489 : 902 - +
  • [10] Semantic Structural Similarity Measure for Clustering XML Documents
    Song, Ling
    Ma, Jun
    Lei, Jingsheng
    Zhang, Dongmei
    Wang, Zhen
    WEB INFORMATION SYSTEMS AND MINING, PROCEEDINGS, 2009, 5854 : 232 - +